|
Name |
Accession |
Description |
Interval |
E-value |
| CH_PLEC-like_rpt1 |
cd21188 |
first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ... |
41-150 |
2.19e-74 |
|
first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409037 Cd Length: 105 Bit Score: 243.46 E-value: 2.19e-74
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 41 DRVQKKTFTKWVNKHLIKHWRaeaqrHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQV 120
Cdd:cd21188 1 DAVQKKTFTKWVNKHLIKARR-----RVVDLFEDLRDGHNLISLLEVLSGESLPRERGRMRFHRLQNVQTALDFLKYRKI 75
|
90 100 110
....*....|....*....|....*....|
gi 1920237968 121 KLVNIRNDDIADGNPKLTLGLIWTIILHFQ 150
Cdd:cd21188 76 KLVNIRAEDIVDGNPKLTLGLIWTIILHFQ 105
|
|
| CH_PLEC_rpt1 |
cd21235 |
first calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ... |
38-161 |
2.25e-71 |
|
first calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409084 Cd Length: 119 Bit Score: 235.30 E-value: 2.25e-71
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 38 DERDRVQKKTFTKWVNKHLIKhwraeAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRH 117
Cdd:cd21235 1 DERDRVQKKTFTKWVNKHLIK-----AQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRH 75
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1920237968 118 RQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSE 161
Cdd:cd21235 76 RQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSE 119
|
|
| CH_PLEC_rpt2 |
cd21238 |
second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ... |
163-268 |
1.08e-69 |
|
second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409087 Cd Length: 106 Bit Score: 229.91 E-value: 1.08e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 163 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 242
Cdd:cd21238 1 MTAKEKLLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPMLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 80
|
90 100
....*....|....*....|....*.
gi 1920237968 243 PEDVDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21238 81 PEDVDVPQPDEKSIITYVSSLYDAMP 106
|
|
| CH_DYST_rpt1 |
cd21236 |
first calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also ... |
37-159 |
1.71e-69 |
|
first calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. Dystonin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409085 Cd Length: 128 Bit Score: 230.26 E-value: 1.71e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 37 QDERDRVQKKTFTKWVNKHLIKhwraeAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLR 116
Cdd:cd21236 11 KDERDKVQKKTFTKWINQHLMK-----VRKHVNDLYEDLRDGHNLISLLEVLSGDTLPREKGRMRFHRLQNVQIALDYLK 85
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1920237968 117 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQ 159
Cdd:cd21236 86 RRQVKLVNIRNDDITDGNPKLTLGLIWTIILHFQISDIHVTGE 128
|
|
| CH_PLEC-like_rpt2 |
cd21189 |
second calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ... |
164-268 |
3.84e-64 |
|
second calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409038 Cd Length: 105 Bit Score: 214.18 E-value: 3.84e-64
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 243
Cdd:cd21189 1 SAKEALLLWARRTTEGYPGVRVTNFTSSWRDGLAFNAIIHRNRPDLIDFRSVRNQSNRENLENAFNVAEKEFGVTRLLDP 80
|
90 100
....*....|....*....|....*
gi 1920237968 244 EDVDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21189 81 EDVDVPEPDEKSIITYVSSLYDVFP 105
|
|
| CH_MACF1_rpt1 |
cd21237 |
first calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, ... |
38-160 |
5.38e-61 |
|
first calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1) and similar proteins; MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. MACF1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409086 Cd Length: 118 Bit Score: 205.65 E-value: 5.38e-61
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 38 DERDRVQKKTFTKWVNKHLIKhwraeAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRH 117
Cdd:cd21237 1 DERDRVQKKTFTKWVNKHLMK-----VRKHINDLYEDLRDGHNLISLLEVLSGVKLPREKGRMRFHRLQNVQIALDFLKQ 75
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1920237968 118 RQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQS 160
Cdd:cd21237 76 RQVKLVNIRNDDITDGNPKLTLGLIWTIILHFQISDIYISGES 118
|
|
| CH_DYST_rpt2 |
cd21239 |
second calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also ... |
164-268 |
1.43e-56 |
|
second calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. Dystonin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409088 Cd Length: 104 Bit Score: 192.51 E-value: 1.43e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 243
Cdd:cd21239 1 SAKERLLLWSQQMTEGYTGIRCENFTTCWRDGRLFNAIIHKYRPDLIDMNTVAVQSNLANLEHAFYVAEK-LGVTRLLDP 79
|
90 100
....*....|....*....|....*
gi 1920237968 244 EDVDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21239 80 EDVDVSSPDEKSVITYVSSLYDVFP 104
|
|
| CH_MACF1_rpt2 |
cd21240 |
second calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, ... |
162-268 |
6.26e-50 |
|
second calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1) and similar proteins; MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. MACF1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409089 Cd Length: 107 Bit Score: 173.30 E-value: 6.26e-50
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 162 DMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLL 241
Cdd:cd21240 2 DMSAKEKLLLWTQKVTAGYTGIKCTNFSSCWSDGKMFNALIHRYRPDLVDMERVQIQSNRENLEQAFEVAER-LGVTRLL 80
|
90 100
....*....|....*....|....*..
gi 1920237968 242 DPEDVDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21240 81 DAEDVDVPSPDEKSVITYVSSIYDAFP 107
|
|
| CH_DMD-like_rpt1 |
cd21186 |
first calponin homology (CH) domain found in the dystrophin family; The dystrophin family ... |
43-151 |
1.09e-48 |
|
first calponin homology (CH) domain found in the dystrophin family; The dystrophin family includes dystrophin and its paralog, utrophin. Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. Dystrophin is also involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and links the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409035 Cd Length: 107 Bit Score: 169.87 E-value: 1.09e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 43 VQKKTFTKWVNKHLIKhwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKL 122
Cdd:cd21186 2 VQKKTFTKWINSQLSK----ANKPPIKDLFEDLRDGTRLLALLEVLTGKKLKPEKGRMRVHHLNNVNRALQVLEQNNVKL 77
|
90 100
....*....|....*....|....*....
gi 1920237968 123 VNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21186 78 VNISSNDIVDGNPKLTLGLVWSIILHWQV 106
|
|
| CH_SPTB-like_rpt1 |
cd21246 |
first calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I ... |
30-147 |
7.94e-47 |
|
first calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I spectrin-like family includes beta-I, -II, -III and -IV spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-III spectrin, also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5), may play a crucial role as a longer actin-membrane cross-linker or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Members of this subfamily contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409095 Cd Length: 117 Bit Score: 164.85 E-value: 7.94e-47
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 30 RRRVQDEQDERDRVQKKTFTKWVNKHLIKHwraeaQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNV 108
Cdd:cd21246 3 RSRIKALADEREAVQKKTFTKWVNSHLARV-----GCRINDLYTDLRDGRMLIKLLEVLSGERLPKpTKGKMRIHCLENV 77
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237968 109 QIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 147
Cdd:cd21246 78 DKALQFLKEQRVHLENMGSHDIVDGNHRLTLGLIWTIIL 116
|
|
| CH_beta_spectrin_rpt2 |
cd21194 |
second calponin homology (CH) domain found in the beta spectrin family; The beta spectrin ... |
164-264 |
6.67e-45 |
|
second calponin homology (CH) domain found in the beta spectrin family; The beta spectrin family includes beta-I, -II, -III, -IV and -V spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Beta-III spectrin is also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5). Beta-V spectrin, also called spectrin beta chain, non-erythrocytic 5 (SPTBN5), is a mammalian ortholog of Drosophila beta H spectrin. Beta-III and Beta-V spectrins may play crucial roles as longer actin-membrane cross-linkers or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409043 Cd Length: 105 Bit Score: 159.11 E-value: 6.67e-45
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 243
Cdd:cd21194 2 SAKDALLLWCQRKTAGYPGVNIQNFTTSWRDGLAFNALIHAHRPDLIDYNRLDPNDHLGNLNNAFDVAEQELGIAKLLDA 81
|
90 100
....*....|....*....|.
gi 1920237968 244 EDVDVPQPDEKSIITYVSSLY 264
Cdd:cd21194 82 EDVDVARPDEKSIMTYVASYY 102
|
|
| SAC6 |
COG5069 |
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton]; |
36-375 |
2.68e-44 |
|
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];
Pssm-ID: 227401 [Multi-domain] Cd Length: 612 Bit Score: 173.20 E-value: 2.68e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 36 EQDERDRVQKKTFTKWVNKHLIKhwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR--EKGRMRFHKLQNVQIALD 113
Cdd:COG5069 2 EAKKWQKVQKKTFTKWTNEKLIS----GGQKEFGDLDTDLKDGVKLAQLLEALQKDNAGEynETPETRIHVMENVSGRLE 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 114 YLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQvsgQSEDMTAKEKLLLWSQRMVEGCQ-GLRCDNFTTSW 192
Cdd:COG5069 78 FIKGKGVKLFNIGPQDIVDGNPKLILGLIWSLISRLTIATIN---EEGELTKHINLLLWCDEDTGGYKpEVDTFDFFRSW 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 193 RDGRLFNAIIHRHKPTLIDMNKVYRQTNLE--NLDQAFSVAERDLGVTRLLDPEDV-DVPQPDEKSIITYVS------SL 263
Cdd:COG5069 155 RDGLAFSALIHDSRPDTLDPNVLDLQKKNKalNNFQAFENANKVIGIARLIGVEDIvNVSIPDERSIMTYVSwyiirfGL 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 264 YD----AMPRVPDVQDGVKANElQLRwQEYRELVLLLLQWIRAHTAGFEERRFPSSFEEIEILWCQFLKFKETE--LPAK 337
Cdd:COG5069 235 LEkidiALHRVYRLLEADETLI-QLR-LPYEIILLRLLNLIHLKQANWKVVNFSKDVSDGENYTDLLNQLNALCsrAPLE 312
|
330 340 350
....*....|....*....|....*....|....*....
gi 1920237968 338 EAD-KNRSKGIYQSLEgAVQAGQLKVPPGYHPLDVEKEW 375
Cdd:COG5069 313 TTDlHSLAGQILQNAE-KYDCRKYLPPAGNPKLDLAFVA 350
|
|
| CH_SYNE1_rpt1 |
cd21241 |
first calponin homology (CH) domain found in synaptic nuclear envelope protein 1 and similar ... |
39-151 |
2.75e-44 |
|
first calponin homology (CH) domain found in synaptic nuclear envelope protein 1 and similar proteins; Synaptic nuclear envelope protein 1 (SYNE-1), also called nesprin-1, enaptin, KASH domain-containing protein 1 (KASH1), myocyte nuclear envelope protein 1 (MYNE-1), or nuclear envelope spectrin repeat protein 1, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-1 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409090 Cd Length: 113 Bit Score: 157.54 E-value: 2.75e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 39 ERDRVQKKTFTKWVNKHLIKHwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM--RFHKLQNVQIALDYLR 116
Cdd:cd21241 1 EQERVQKKTFTNWINSYLAKR---KPPMKVEDLFEDIKDGTKLLALLEVLSGEKLPCEKGRRlkRVHFLSNINTALKFLE 77
|
90 100 110
....*....|....*....|....*....|....*
gi 1920237968 117 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21241 78 SKKIKLVNINPTDIVDGKPSIVLGLIWTIILYFQI 112
|
|
| CH_SYNE-like_rpt1 |
cd21190 |
first calponin homology (CH) domain found in the synaptic nuclear envelope protein family; The ... |
39-151 |
4.51e-44 |
|
first calponin homology (CH) domain found in the synaptic nuclear envelope protein family; The synaptic nuclear envelope (SYNE) family includes SYNE-1, -2 and calmin. SYNE-1 (also called nesprin-1, enaptin, KASH domain-containing protein 1, KASH1, myocyte nuclear envelope protein 1, MYNE-1, or nuclear envelope spectrin repeat protein 1) and SYNE-2 (also called nesprin-2, KASH domain-containing protein 2, KASH2, nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE) may act redundantly. They are multi-isomeric modular proteins which form a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. They also act as components of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409039 Cd Length: 113 Bit Score: 156.96 E-value: 4.51e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 39 ERDRVQKKTFTKWVNKHLIKHWRAEAqrhISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM--RFHKLQNVQIALDYLR 116
Cdd:cd21190 1 EQERVQKKTFTNWINSHLAKLSQPIV---INDLFVDIKDGTALLRLLEVLSGQKLPIESGRVlqRAHKLSNIRNALDFLT 77
|
90 100 110
....*....|....*....|....*....|....*
gi 1920237968 117 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21190 78 KRCIKLVNINSTDIVDGKPSIVLGLIWTIILYFQI 112
|
|
| CH_beta_spectrin_rpt1 |
cd21193 |
first calponin homology (CH) domain found in the beta spectrin family; The beta spectrin ... |
30-147 |
7.52e-44 |
|
first calponin homology (CH) domain found in the beta spectrin family; The beta spectrin family includes beta-I, -II, -III, -IV and -V spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Beta-III spectrin is also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5). Beta-V spectrin, also called spectrin beta chain, non-erythrocytic 5 (SPTBN5), is a mammalian ortholog of Drosophila beta H spectrin. Beta-III and Beta-V spectrins may play crucial roles as longer actin-membrane cross-linkers or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409042 Cd Length: 116 Bit Score: 156.30 E-value: 7.52e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 30 RRRVQDEQDERDRVQKKTFTKWVNKHLIKHwraeaQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNV 108
Cdd:cd21193 3 KGRIRALQEERINIQKKTFTKWINSFLEKA-----NLEIGDLFTDLSDGKLLLKLLEIISGEKLGKpNRGRLRVQKIENV 77
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237968 109 QIALDYLrHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 147
Cdd:cd21193 78 NKALAFL-KTKVRLENIGAEDIVDGNPRLILGLIWTIIL 115
|
|
| CH_SPTB_like_rpt2 |
cd21248 |
second calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I ... |
164-264 |
4.63e-43 |
|
second calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I spectrin-like family includes beta-I, -II, -III and -IV spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-III spectrin, also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5), may play a crucial role as a longer actin-membrane cross-linker or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Members of this subfamily contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409097 Cd Length: 105 Bit Score: 153.71 E-value: 4.63e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 243
Cdd:cd21248 2 SAKDALLLWCQMKTAGYPNVNVRNFTTSWRDGLAFNALIHKHRPDLIDYDKLSKSNALYNLQNAFNVAEQKLGLTKLLDP 81
|
90 100
....*....|....*....|.
gi 1920237968 244 EDVDVPQPDEKSIITYVSSLY 264
Cdd:cd21248 82 EDVNVEQPDEKSIITYVVTYY 102
|
|
| CH_SYNE1_rpt2 |
cd21243 |
second calponin homology (CH) domain found in synaptic nuclear envelope protein 1 (SYNE-1) and ... |
163-268 |
5.46e-41 |
|
second calponin homology (CH) domain found in synaptic nuclear envelope protein 1 (SYNE-1) and similar proteins; SYNE-1, also called nesprin-1, enaptin, KASH domain-containing protein 1 (KASH1), myocyte nuclear envelope protein 1 (MYNE-1), or nuclear envelope spectrin repeat protein 1, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-1 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409092 Cd Length: 109 Bit Score: 147.85 E-value: 5.46e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 163 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 242
Cdd:cd21243 4 GGAKKALLKWVQNAAAKRFGIEVKDFGPSWRDGVAFNAIIHSIRPDLVDMESLKRRSNRENLETAFTVAEKELGIPRLLD 83
|
90 100
....*....|....*....|....*.
gi 1920237968 243 PEDVDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21243 84 PEDVDVDKPDEKSIMTYVAQFLKKYP 109
|
|
| CH_ACTN_rpt2 |
cd21216 |
second calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin ... |
151-266 |
8.64e-40 |
|
second calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) family includes alpha-actinin-1, -2, -3, and -4. They are F-actin cross-linking proteins which are thought to anchor actin to a variety of intracellular structures. ACTN1 mutations cause congenital macrothrombocytopenia. ACTN2 mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. ACTN3 is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. ACTN4 is associated with cell motility and cancer invasion. It is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409065 Cd Length: 115 Bit Score: 144.81 E-value: 8.64e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 151 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 230
Cdd:cd21216 1 IQDISV----EELSAKEGLLLWCQRKTAPYKNVNVQNFHTSWKDGLAFCALIHRHRPDLLDYDKLRKDDPRENLNLAFDV 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 231 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21216 77 AEKHLDIPKMLDAEDiVNTPRPDERSVMTYVSCYYHA 113
|
|
| CH_SYNE2_rpt1 |
cd21242 |
first calponin homology (CH) domain found in synaptic nuclear envelope protein 2; Synaptic ... |
39-151 |
2.75e-39 |
|
first calponin homology (CH) domain found in synaptic nuclear envelope protein 2; Synaptic nuclear envelope protein 2 (SYNE-2), also called nesprin-2, KASH domain-containing protein 2 (KASH2), nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-2 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-2 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409091 Cd Length: 111 Bit Score: 143.05 E-value: 2.75e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 39 ERDRVQKKTFTKWVNKHLIKHWRAEAqrhISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHR 118
Cdd:cd21242 1 EQEQTQKRTFTNWINSQLAKHSPPSV---VSDLFTDIQDGHRLLDLLEVLSGQQLPREKGHNVFQCRSNIETALSFLKNK 77
|
90 100 110
....*....|....*....|....*....|...
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21242 78 SIKLINIHVPDIIEGKPSIILGLIWTIILHFHI 110
|
|
| CH_SPTBN2_rpt1 |
cd21317 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) ... |
30-147 |
1.49e-38 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN2, also called beta-III spectrin, or spinocerebellar ataxia 5 protein (SCA5), probably plays an important role in the neuronal membrane skeleton. Mutations in SPTBN2 is associated with spinocerebellar ataxia type 5. SPTBN2 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409166 Cd Length: 132 Bit Score: 142.11 E-value: 1.49e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 30 RRRVQDEQDERDRVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNV 108
Cdd:cd21317 18 RSRIKALADEREAVQKKTFTKWVNSHL-----ARVTCRIGDLYTDLRDGRMLIRLLEVLSGEQLPKpTKGRMRIHCLENV 92
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237968 109 QIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 147
Cdd:cd21317 93 DKALQFLKEQKVHLENMGSHDIVDGNHRLTLGLIWTIIL 131
|
|
| CH_SPTBN4_rpt1 |
cd21318 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) ... |
32-147 |
2.29e-38 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN4, also called beta-IV spectrin, or spectrin, non-erythroid beta chain 3 (SPTBN3), is a novel spectrin isolated as an interactor of the receptor tyrosine phosphatase-like protein ICA512. Its mutation associates with congenital myopathy, neuropathy, and central deafness. SPTBN4 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409167 Cd Length: 139 Bit Score: 141.70 E-value: 2.29e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 32 RVQDEQDERDRVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQI 110
Cdd:cd21318 27 RIKALADEREAVQKKTFTKWVNSHL-----ARVPCRINDLYTDLRDGYVLTRLLEVLSGEQLPKpTRGRMRIHSLENVDK 101
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 111 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 147
Cdd:cd21318 102 ALQFLKEQRVHLENVGSHDIVDGNHRLTLGLIWTIIL 138
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1307-1882 |
9.78e-38 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 156.64 E-value: 9.78e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1307 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEErERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1386
Cdd:COG1196 226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELE-AELEELRLELEELELELEEAQAEEYELLAELARLEQDI 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1387 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLrIEEEIRVVRLQLEATERQRGGAEGELQA 1466
Cdd:COG1196 305 ARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEE-AEAELAEAEEALLEAEAELAEAEEELEE 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1467 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1546
Cdd:COG1196 384 LAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLE 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1547 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaEAERARAEAERELERWQLK 1626
Cdd:COG1196 464 LLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLL--------AGLRGLAGAVAVLIGVEAA 535
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1627 ANEALRLRLQAeevAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR 1706
Cdd:COG1196 536 YEAALEAALAA---ALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREAD 612
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1707 LRAETEQGEqqrqLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1786
Cdd:COG1196 613 ARYYVLGDT----LLGRTLVAARLEAALRR--AVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE---- 682
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1787 ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQR 1866
Cdd:COG1196 683 ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
|
570
....*....|....*.
gi 1920237968 1867 RLLEEQAAQHKADIEA 1882
Cdd:COG1196 763 EELERELERLEREIEA 778
|
|
| CH_SPTBN2_rpt2 |
cd21321 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) ... |
160-264 |
2.06e-37 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN2, also called beta-III spectrin, or spinocerebellar ataxia 5 protein (SCA5), probably plays an important role in the neuronal membrane skeleton. Mutations in SPTBN2 is associated with spinocerebellar ataxia type 5. SPTBN2 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409170 Cd Length: 119 Bit Score: 138.27 E-value: 2.06e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 160 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 239
Cdd:cd21321 1 KEKKSAKDALLLWCQMKTAGYPNVNVHNFTTSWRDGLAFNAIVHKHRPDLIDFETLKKSNAHYNLQNAFNVAEKELGLTK 80
|
90 100
....*....|....*....|....*
gi 1920237968 240 LLDPEDVDVPQPDEKSIITYVSSLY 264
Cdd:cd21321 81 LLDPEDVNVDQPDEKSIITYVATYY 105
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1335-2040 |
4.47e-37 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 155.68 E-value: 4.47e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1335 EEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSiqEELQHL 1414
Cdd:PTZ00121 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKA--EDAKKA 1178
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1415 RQSSEAEIQAKARQVEAAERSRlRIEEEIRVVRLQlEATERQRGGAEGELQALRaRAEEAeaqkRQAQEEAERLRRQVQD 1494
Cdd:PTZ00121 1179 EAARKAEEVRKAEELRKAEDAR-KAEAARKAEEER-KAEEARKAEDAKKAEAVK-KAEEA----KKDAEEAKKAEEERNN 1251
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1495 ETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRlQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASF 1574
Cdd:PTZ00121 1252 EEIRKFEEARMAHFARRQAAIKAEEARKA----DELK-KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1326
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1575 AEKTAQ-LERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEAlrlRLQAEEVAQQKSLTQaeaek 1653
Cdd:PTZ00121 1327 AKKKADaAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA---KKKAEEKKKADEAKK----- 1398
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1654 qkeeaerearrrgKAEEQAVRQRELAEQELEKQR-QLAEGTAQQRLAAEqELIRLRAETEQGEQQRQLLE-----EELAR 1727
Cdd:PTZ00121 1399 -------------KAEEDKKKADELKKAAAAKKKaDEAKKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEeakkaEEAKK 1464
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1728 LQREAAAATQKRRELE----AELAKVRAEmevllASKARAEEESRSTSEKSKqrleAEAGRFRELAEEAARLRAlAEEAK 1803
Cdd:PTZ00121 1465 KAEEAKKADEAKKKAEeakkADEAKKKAE-----EAKKKADEAKKAAEAKKK----ADEAKKAEEAKKADEAKK-AEEAK 1534
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1804 RQRQLAEEDAVRQRAEAERvlAEKLAAISEatrlKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEAR 1883
Cdd:PTZ00121 1535 KADEAKKAEEKKKADELKK--AEELKKAEE----KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1884 LAQLRKASESELERQKGLVEDTLRQRrqveEEILALKGSFEKAAAGKAELELELGRIRGT-----AEDTLRSKEQA--EQ 1956
Cdd:PTZ00121 1609 AEEAKKAEEAKIKAEELKKAEEEKKK----VEQLKKKEAEEKKKAEELKKAEEENKIKAAeeakkAEEDKKKAEEAkkAE 1684
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1957 EAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE----VERLKAKVEEARRLRERAEQESARQLQLAQEAA 2032
Cdd:PTZ00121 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKK 1764
|
....*...
gi 1920237968 2033 QKRLQAEE 2040
Cdd:PTZ00121 1765 EEEKKAEE 1772
|
|
| CH_SPTB_rpt2 |
cd21319 |
second calponin homology (CH) domain found in spectrin beta chain, erythrocytic (SPTB) and ... |
160-264 |
4.48e-37 |
|
second calponin homology (CH) domain found in spectrin beta chain, erythrocytic (SPTB) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTB, also called beta-I spectrin, may be involved in anaemia pathogenesis. SPTB contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409168 Cd Length: 112 Bit Score: 137.06 E-value: 4.48e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 160 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 239
Cdd:cd21319 1 RETRSAKDALLLWCQMKTAGYPNVNVTNFTSSWKDGLAFNALIHKHRPDLVDFGKLKKSNARHNLEHAFNVAERQLGITK 80
|
90 100
....*....|....*....|....*
gi 1920237968 240 LLDPEDVDVPQPDEKSIITYVSSLY 264
Cdd:cd21319 81 LLDPEDVFTENPDEKSIITYVVAFY 105
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1340-1941 |
1.33e-36 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 153.17 E-value: 1.33e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1340 AEQQRAEERERLAEVEAAL-EKQRQLAEAHAQAKaQAEReAQGLQrrmqeevarreevaveAQEQKRSIQEELQHLRQSs 1418
Cdd:COG1196 177 AERKLEATEENLERLEDILgELERQLEPLERQAE-KAER-YRELK----------------EELKELEAELLLLKLREL- 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1419 EAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQR 1498
Cdd:COG1196 238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEER 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1499 KRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALEtaQRSAEAELQSEHASFAEKT 1578
Cdd:COG1196 318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE--AEEELEELAEELLEALRAA 395
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1579 AQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEA 1658
Cdd:COG1196 396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1659 EREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR------LRAETEQGEQQRQLLEEELARLQREA 1732
Cdd:COG1196 476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAgavavlIGVEAAYEAALEAALAAALQNIVVED 555
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1733 AAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED 1812
Cdd:COG1196 556 DEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAA 635
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1813 AVRQRAEAERVLAEKLAA--ISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKA 1890
Cdd:COG1196 636 LRRAVTLAGRLREVTLEGegGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEE 715
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1891 SESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIR 1941
Cdd:COG1196 716 RLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELE 766
|
|
| CH_ACTN_rpt1 |
cd21214 |
first calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) ... |
41-147 |
1.61e-36 |
|
first calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) family includes alpha-actinin-1, -2, -3, and -4. They are F-actin cross-linking proteins which are thought to anchor actin to a variety of intracellular structures. ACTN1 mutations cause congenital macrothrombocytopenia. ACTN2 mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. ACTN3 is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. ACTN4 is associated with cell motility and cancer invasion. It is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409063 Cd Length: 105 Bit Score: 135.21 E-value: 1.61e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 41 DRVQKKTFTKWVNKHLIKhwraeAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQIALDYLRHRQ 119
Cdd:cd21214 3 EKQQRKTFTAWCNSHLRK-----AGTQIENIEEDFRDGLKLMLLLEVISGERLPKpERGKMRFHKIANVNKALDFIASKG 77
|
90 100
....*....|....*....|....*...
gi 1920237968 120 VKLVNIRNDDIADGNPKLTLGLIWTIIL 147
Cdd:cd21214 78 VKLVSIGAEEIVDGNLKMTLGMIWTIIL 105
|
|
| CH_SpAIN1-like_rpt1 |
cd21215 |
first calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like ... |
43-149 |
2.28e-36 |
|
first calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like protein 1 and similar proteins; Schizosaccharomyces pombe alpha-actinin-like protein 1 (SpAIN1) binds to actin and is involved in actin-ring formation and organization. It plays a role in cytokinesis and is involved in septation. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409064 Cd Length: 107 Bit Score: 134.84 E-value: 2.28e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 43 VQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR--EKGRMRFHKLQNVQIALDYLRHRQV 120
Cdd:cd21215 4 VQKKTFTKWLNTKL-----SSRGLSITDLVTDLSDGVRLIQLLEIIGDESLGRynKNPKMRVQKLENVNKALEFIKSRGV 78
|
90 100
....*....|....*....|....*....
gi 1920237968 121 KLVNIRNDDIADGNPKLTLGLIWTIILHF 149
Cdd:cd21215 79 KLTNIGAEDIVDGNLKLILGLLWTLILRF 107
|
|
| CH_DMD_rpt1 |
cd21231 |
first calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, ... |
38-151 |
6.88e-36 |
|
first calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. It is involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Mutations in dystrophin lead to Duchenne muscular dystrophy (DMD). Moreover, dystrophin deficiency is associated with abnormal cerebral diffusion and perfusion, as well as in acute Trypanosoma cruzi infection. The dystrophin subfamily has been characterized by a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, dystrophin contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, approximately 24 spectrin repeats (SRs) and a WW domain. This model corresponds to the first CH domain.
Pssm-ID: 409080 Cd Length: 111 Bit Score: 133.51 E-value: 6.88e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 38 DERDRVQKKTFTKWVNKHLIKHWRaeaqRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRH 117
Cdd:cd21231 1 YEREDVQKKTFTKWINAQFAKFGK----PPIEDLFTDLQDGRRLLELLEGLTGQKLVKEKGSTRVHALNNVNKALQVLQK 76
|
90 100 110
....*....|....*....|....*....|....
gi 1920237968 118 RQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21231 77 NNVDLVNIGSADIVDGNHKLTLGLIWSIILHWQV 110
|
|
| CH_SPTBN5_rpt2 |
cd21249 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) ... |
163-264 |
1.17e-35 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN5, also called beta-V spectrin, is a mammalian ortholog of Drosophila beta H spectrin that may play a crucial role as a longer actin-membrane cross-linker or to fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. SPTBN5 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409098 Cd Length: 109 Bit Score: 132.68 E-value: 1.17e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 163 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 242
Cdd:cd21249 3 RSAKEALLIWCQRKTAGYTNVNVQDFSRSWRDGLAFNALIHAHRPDLIDYGSLRPDRPLYNLANAFLVAEQELGISQLLD 82
|
90 100
....*....|....*....|..
gi 1920237968 243 PEDVDVPQPDEKSIITYVSSLY 264
Cdd:cd21249 83 PEDVAVPHPDERSIMTYVSLYY 104
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1467-2067 |
2.94e-35 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 148.55 E-value: 2.94e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1467 LRARAEEAEAQKRQAQEEAERL---RRQVqdETQR---KRQAE-AELALRVQAEAEAaREKQRALQALEELRLQAEEAER 1539
Cdd:COG1196 170 YKERKEEAERKLEATEENLERLediLGEL--ERQLeplERQAEkAERYRELKEELKE-LEAELLLLKLRELEAELEELEA 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1540 RLRQAEAERARqvqvaLETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAvvQLREEATRRAQQQAEAERARAEAERE 1619
Cdd:COG1196 247 ELEELEAELEE-----LEAELAELEAELEELRLELEELELELEEAQAEEYEL--LAELARLEQDIARLEERRRELEERLE 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1620 LERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA 1699
Cdd:COG1196 320 ELEEELAELEEELEELEEELEELEEELEEAEEELEEAE---------AELAEAEEALLEAEAELAEAEEELEELAEELLE 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1700 AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLE 1779
Cdd:COG1196 391 ALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLE 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1780 AEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA----AISEATRLKTEAEIALKEKEAENERL 1855
Cdd:COG1196 471 EAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRglagAVAVLIGVEAAYEAALEAALAAALQN 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1856 RRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELEL 1935
Cdd:COG1196 551 IVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAA 630
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1936 ELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRE 2015
Cdd:COG1196 631 RLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELA 710
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2016 RAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLER 2067
Cdd:COG1196 711 EAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
|
|
| CH_DMD-like_rpt2 |
cd21187 |
second calponin homology (CH) domain found in the dystrophin family; The dystrophin family ... |
167-268 |
1.23e-34 |
|
second calponin homology (CH) domain found in the dystrophin family; The dystrophin family includes dystrophin and its paralog, utrophin. Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. Dystrophin is also involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409036 Cd Length: 104 Bit Score: 129.47 E-value: 1.23e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 167 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 245
Cdd:cd21187 2 EKTLLaWCRQSTRGYEQVDVKNFTTSWRDGLAFNALIHRHRPDLFDFDSLVKDSPESRLEHAFTVAHEHLGIEKLLDPED 81
|
90 100
....*....|....*....|...
gi 1920237968 246 VDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21187 82 VNVEQPDKKSILMYVTSLFQVLP 104
|
|
| CH_SPTBN4_rpt2 |
cd21322 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) ... |
148-264 |
1.57e-34 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN4, also called beta-IV spectrin, or spectrin, non-erythroid beta chain 3 (SPTBN3), is a novel spectrin isolated as an interactor of the receptor tyrosine phosphatase-like protein ICA512. Its mutation associates with congenital myopathy, neuropathy, and central deafness. SPTBN4 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409171 Cd Length: 130 Bit Score: 130.56 E-value: 1.57e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 148 HFQISDIQVSGQSEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQA 227
Cdd:cd21322 1 QIQVIKIETEDNRETRSAKDALLLWCQMKTAGYPEVNIQNFTTSWRDGLAFNALIHRHRPDLIDFSKLTKSNATYNLQQA 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 228 FSVAERDLGVTRLLDPEDVDVPQPDEKSIITYVSSLY 264
Cdd:cd21322 81 FNTAEQHLGLTKLLDPEDVNMEAPDEKSIITYVVSFY 117
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1670-2238 |
2.37e-34 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 145.85 E-value: 2.37e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQLAegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1749
Cdd:COG1196 210 EKAERYRELKEELKELEAELL---LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1750 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLA 1829
Cdd:COG1196 287 QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEE-ELEEAEEELEEAEAELAEAEE 365
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1830 AISEATRLKTEAEIALKEKEAENERLRRlaedEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQR 1909
Cdd:COG1196 366 ALLEAEAELAEAEEELEELAEELLEALR----AAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEE 441
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1910 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEA 1989
Cdd:COG1196 442 EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRG 521
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1990 ARQRKAALEEVERLKAKVEEAR---RLRERAEQESARQLQLAQEAAQKRL-----QAEEKAHAFAVQQKEQELQQTLQQE 2061
Cdd:COG1196 522 LAGAVAVLIGVEAAYEAALEAAlaaALQNIVVEDDEVAAAAIEYLKAAKAgratfLPLDKIRARAALAAALARGAIGAAV 601
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2062 QSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2141
Cdd:COG1196 602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAEL 681
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2142 EQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2221
Cdd:COG1196 682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
|
570
....*....|....*..
gi 1920237968 2222 LRVQMEELGKLKARIEA 2238
Cdd:COG1196 762 LEELERELERLEREIEA 778
|
|
| CH_SPTBN1_rpt2 |
cd21320 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) ... |
164-264 |
5.65e-32 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN1, also called beta-II spectrin, fodrin beta chain, or spectrin, non-erythroid beta chain 1, is also a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. SPTBN1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409169 Cd Length: 108 Bit Score: 122.13 E-value: 5.65e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 243
Cdd:cd21320 2 SAKDALLLWCQMKTAGYPNVNIHNFTTSWRDGMAFNALIHKHRPDLIDFDKLKKSNAHYNLQNAFNLAEQHLGLTKLLDP 81
|
90 100
....*....|....*....|.
gi 1920237968 244 EDVDVPQPDEKSIITYVSSLY 264
Cdd:cd21320 82 EDISVDHPDEKSIITYVVTYY 102
|
|
| CH_UTRN_rpt1 |
cd21232 |
first calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also ... |
43-151 |
9.48e-32 |
|
first calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Like dystrophin, utrophin has a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, it contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, up to 24 spectrin repeats (SRs), and a WW domain. However, utrophin lacks the intrinsic microtubule binding activity of dystrophin SRs. This model corresponds to the first CH domain.
Pssm-ID: 409081 Cd Length: 107 Bit Score: 121.65 E-value: 9.48e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 43 VQKKTFTKWVNKHLIKhwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKL 122
Cdd:cd21232 2 VQKKTFTKWINARFSK----SGKPPIKDMFTDLRDGRKLLDLLEGLTGKSLPKERGSTRVHALNNVNRVLQVLHQNNVEL 77
|
90 100
....*....|....*....|....*....
gi 1920237968 123 VNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21232 78 VNIGGTDIVDGNHKLTLGLLWSIILHWQV 106
|
|
| CH_dFLNA-like_rpt1 |
cd21311 |
first calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and ... |
42-152 |
1.26e-31 |
|
first calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and similar proteins; Drosophila melanogaster filamin-A (dFLNA or dFLN-A), also called actin-binding protein 280 (ABP-280) or filamin-1, is involved in germline ring canal formation. It may tether actin microfilaments within the ovarian ring canal to the cell membrane and contributes to actin microfilament organization. dFLNA contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409160 Cd Length: 124 Bit Score: 121.79 E-value: 1.26e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 42 RVQKKTFTKWVNKHLIKhwraeAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGR--MRFHKLQNVQIALDYLRHRQ 119
Cdd:cd21311 14 RIQQNTFTRWANEHLKT-----ANKHIADLETDLSDGLRLIALVEVLSGKKFPKFNKRptFRSQKLENVSVALKFLEEDE 88
|
90 100 110
....*....|....*....|....*....|....
gi 1920237968 120 -VKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 152
Cdd:cd21311 89 gIKIVNIDSSDIVDGKLKLILGLIWTLILHYSIS 122
|
|
| CH_jitterbug-like_rpt1 |
cd21227 |
first calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ... |
43-151 |
1.38e-31 |
|
first calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409076 Cd Length: 109 Bit Score: 121.24 E-value: 1.38e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 43 VQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR--EKGRMRFHKLQNVQIALDYLRHRQV 120
Cdd:cd21227 4 IQKNTFTNWVNEQL-----KPTGMSVEDLATDLEDGVKLIALVEILQGRKLGRviKKPLNQHQKLENVTLALKAMAEDGI 78
|
90 100 110
....*....|....*....|....*....|.
gi 1920237968 121 KLVNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21227 79 KLVNIGNEDIVNGNLKLILGLIWHLILRYQI 109
|
|
| CH_SPTBN1_rpt1 |
cd21316 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) ... |
30-147 |
2.81e-31 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN1, also called beta-II spectrin, fodrin beta chain, or spectrin, non-erythroid beta chain 1, is also a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. SPTBN1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409165 Cd Length: 154 Bit Score: 122.07 E-value: 2.81e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 30 RRRVQDEQDERDRVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNV 108
Cdd:cd21316 40 RSRIKALADEREAVQKKTFTKWVNSHL-----ARVSCRITDLYMDLRDGRMLIKLLEVLSGERLPKpTKGRMRIHCLENV 114
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237968 109 QIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 147
Cdd:cd21316 115 DKALQFLKEQRVHLENMGSHDIVDGNHRLTLGLIWTIIL 153
|
|
| CH_SYNE-like_rpt2 |
cd21192 |
second calponin homology (CH) domain found in the synaptic nuclear envelope protein (SYNE) ... |
163-261 |
4.14e-31 |
|
second calponin homology (CH) domain found in the synaptic nuclear envelope protein (SYNE) family; The SYNE family includes SYNE-1, -2 and calmin. SYNE-1 (also called nesprin-1, enaptin, KASH domain-containing protein 1, KASH1, myocyte nuclear envelope protein 1, MYNE-1, or nuclear envelope spectrin repeat protein 1) and SYNE-2 (also called nesprin-2, KASH domain-containing protein 2, KASH2, nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE) may act redundantly. They are multi-isomeric modular proteins which form a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. They also act as components of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409041 Cd Length: 107 Bit Score: 119.84 E-value: 4.14e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 163 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 242
Cdd:cd21192 2 GSAEKALLKWVQAEIGKYYGIRVTDFDKSWRDGVAFLALIHAIRPDLVDMKTVKNRSPRDNLELAFRIAEQHLNIPRLLE 81
|
90
....*....|....*....
gi 1920237968 243 PEDVDVPQPDEKSIITYVS 261
Cdd:cd21192 82 VEDVLVDKPDERSIMTYVS 100
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1328-1932 |
6.82e-31 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 135.27 E-value: 6.82e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQRAE------------ERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREE 1395
Cdd:PTZ00121 1209 EEERKAEEARKAEDAKKAEavkkaeeakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEE 1288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1396 V--AVEAQ--EQKRSIQEELQHLRQSSEAEiQAKARQVEA---AERSRLRIEEEirvvRLQLEATERQRGGAEGELQALR 1468
Cdd:PTZ00121 1289 KkkADEAKkaEEKKKADEAKKKAEEAKKAD-EAKKKAEEAkkkADAAKKKAEEA----KKAAEAAKAEAEAAADEAEAAE 1363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1469 ARAEEAEAQKRQAQEEAERLRRQVQ-----DETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRlQAEEAERRLRQ 1543
Cdd:PTZ00121 1364 EKAEAAEKKKEEAKKKADAAKKKAEekkkaDEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKK-KADEAKKKAEE 1442
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1544 A--------EAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLR--EEATRRAQQQAEAERAR 1613
Cdd:PTZ00121 1443 AkkadeakkKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKkaAEAKKKADEAKKAEEAK 1522
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1614 AEAERELERWQLKANEALRlrlqAEEVAQQKSLTQAEAEKQKEEAEREARRRgKAEEqavrQRELAEQELEKQRQLAEGT 1693
Cdd:PTZ00121 1523 KADEAKKAEEAKKADEAKK----AEEKKKADELKKAEELKKAEEKKKAEEAK-KAEE----DKNMALRKAEEAKKAEEAR 1593
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1694 AQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEK 1773
Cdd:PTZ00121 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1774 SKQRLEAEagrfRELAEEAARLRALAEEAKRQRQLAEedaVRQRAEAERVLAEKLAAISEATRLKteAEIALKEKEAENE 1853
Cdd:PTZ00121 1674 KKKAEEAK----KAEEDEKKAAEALKKEAEEAKKAEE---LKKKEAEEKKKAEELKKAEEENKIK--AEEAKKEAEEDKK 1744
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1854 RLRRLAEDEAFQRRLleeqaAQHKADIEARLAQLRKASESELErqKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAE 1932
Cdd:PTZ00121 1745 KAEEAKKDEEEKKKI-----AHLKKEEEKKAEEIRKEKEAVIE--EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKE 1816
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1328-1960 |
9.96e-31 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 134.88 E-value: 9.96e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQRAEERERLAEVEAALE--KQRQLAEAHAQAKAQAEREAQGLQR----------RMQEEVARREE 1395
Cdd:PTZ00121 1161 EDARKAEEARKAEDAKKAEAARKAEEVRKAEElrKAEDARKAEAARKAEEERKAEEARKaedakkaeavKKAEEAKKDAE 1240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1396 VAVEAQEQKRSIQ-EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRlQLEATERQRGGAEGELQALRAR-AEE 1473
Cdd:PTZ00121 1241 EAKKAEEERNNEEiRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD-EAKKAEEKKKADEAKKKAEEAKkADE 1319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1474 AEAQKRQAQEEAERLRRQVQdETQRKRQAEAELALRVQAEAEAAREKQRALQ-ALEELRLQAEEAERRlrqaeAERARQV 1552
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKKAE-EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEkKKEEAKKKADAAKKK-----AEEKKKA 1393
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1553 QVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARaeaerelerwqlKANEALR 1632
Cdd:PTZ00121 1394 DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE------------EAKKAEE 1461
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1633 LRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQElIRLRAETE 1712
Cdd:PTZ00121 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE-AKKADEAK 1540
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1713 QGEQQR---QLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEaEAGRFRELA 1789
Cdd:PTZ00121 1541 KAEEKKkadELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE-EAKKAEEAK 1619
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1790 EEAARLRALAEEAKRQRQLAEEDA--------VRQRAEAERVLAEKLAAISEATrlKTEAEIALKEKEAENERLRRLAED 1861
Cdd:PTZ00121 1620 IKAEELKKAEEEKKKVEQLKKKEAeekkkaeeLKKAEEENKIKAAEEAKKAEED--KKKAEEAKKAEEDEKKAAEALKKE 1697
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1862 EAFQRRLleEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVeEEILALKGSFEKAAAGKAELELELGRIR 1941
Cdd:PTZ00121 1698 AEEAKKA--EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA-EEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
|
650
....*....|....*....
gi 1920237968 1942 GTAEDTLRSKEQAEQEAAR 1960
Cdd:PTZ00121 1775 KEKEAVIEEELDEEDEKRR 1793
|
|
| CH_SYNE2_rpt2 |
cd21244 |
second calponin homology (CH) domain found in synaptic nuclear envelope protein 2 (SYNE-2) and ... |
163-261 |
1.60e-30 |
|
second calponin homology (CH) domain found in synaptic nuclear envelope protein 2 (SYNE-2) and similar proteins; SYNE-2, also called nesprin-2, KASH domain-containing protein 2 (KASH2), nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-2 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-2 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409093 Cd Length: 109 Bit Score: 118.01 E-value: 1.60e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 163 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 242
Cdd:cd21244 4 MSARKALLLWAQEQCAKVGSISVTDFKSSWRNGLAFLAIIHALRPGLVDMEKLKGRSNRENLEEAFRIAEQELKIPRLLE 83
|
90
....*....|....*....
gi 1920237968 243 PEDVDVPQPDEKSIITYVS 261
Cdd:cd21244 84 PEDVDVVNPDEKSIMTYVA 102
|
|
| CH_SpAIN1-like_rpt2 |
cd21291 |
second calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like ... |
151-266 |
1.60e-30 |
|
second calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like protein 1 and similar proteins; Schizosaccharomyces pombe alpha-actinin-like protein 1 (SpAIN1) binds to actin and is involved in actin-ring formation and organization. It plays a role in cytokinesis and is involved in septation. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409140 Cd Length: 115 Bit Score: 118.40 E-value: 1.60e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 151 ISDIQvsgqSEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 230
Cdd:cd21291 1 IADIN----EEGLTAKEGLLLWCQRKTAGYDEVDVQDFTTSWTDGLAFCALIHRHRPDLIDYDKLDKKDHRGNMQLAFDI 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 231 AERDLGVTRLLDPEDV-DVPQPDEKSIITYVSSLYDA 266
Cdd:cd21291 77 ASKEIGIPQLLDVEDVcDVAKPDERSIMTYVAYYFHA 113
|
|
| CH_UTRN_rpt2 |
cd21234 |
second calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also ... |
167-268 |
1.80e-30 |
|
second calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Like dystrophin, utrophin has a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, it contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, up to 24 spectrin repeats (SRs), and a WW domain. However, utrophin lacks the intrinsic microtubule binding activity of dystrophin SRs. This model corresponds to the second CH domain.
Pssm-ID: 409083 [Multi-domain] Cd Length: 104 Bit Score: 117.75 E-value: 1.80e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 167 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 245
Cdd:cd21234 2 EKILLsWVRQSTRPYSQVNVLNFTTSWTDGLAFNAVLHRHKPDLFSWDKVVKMSPVERLEHAFSKAKNHLGIEKLLDPED 81
|
90 100
....*....|....*....|...
gi 1920237968 246 VDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21234 82 VAVQLPDKKSIIMYLTSLFEVLP 104
|
|
| CH_CLMN_rpt1 |
cd21191 |
first calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called ... |
39-153 |
4.03e-30 |
|
first calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Calmin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409040 Cd Length: 114 Bit Score: 117.30 E-value: 4.03e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 39 ERDRVQKKTFTKWVNKHLIKhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRE--KGRMRFHKLQNVQIALDYLR 116
Cdd:cd21191 1 ERENVQKRTFTRWINLHLEK---CNPPLEVKDLFVDIQDGKILMALLEVLSGQNLLQEykPSSHRIFRLNNIAKALKFLE 77
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 117 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISD 153
Cdd:cd21191 78 DSNVKLVSIDAAEIADGNPSLVLGLIWNIILFFQIKE 114
|
|
| Spectrin_like |
pfam18373 |
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links ... |
883-960 |
4.18e-30 |
|
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links desmosomal cadherins to the intermediate filaments. The N-terminal region of DP contains a plakin domain common to members of the plakin family. Plakin domains contain multiple copies of spectrin repeats (SRs) pfam00435. Spectrin repeats (SRs) consist of three alpha-helices (A, B, and C) that form an antiparallel triple-helical bundle. This entry describes SR6 which has a divergent structure relative to the other SRs. SR6 shows significant deviations in helices A and B where they are significantly shorter than in other repeats. Structural comparison revealed that SR6 is more similar to other three-helix-bundle proteins, including target of Myb1 and the syntaxin Habc domain, than to other SR proteins. Due to these differences with other spectrin repeats, this region is termed spectrin-like repeat.
Pssm-ID: 465730 Cd Length: 78 Bit Score: 115.78 E-value: 4.18e-30
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 883 LAWQSLGRDMQLIRSWSLATFRTLKPEEQRQALRSLELHYQAFLRDSQDAGGFGPEDRLQAEREYGSCSRHYQQLLQS 960
Cdd:pfam18373 1 VSWQYLLKDIQRINSWTISMLKTMRPEEYRQVLKNLETHYQDFLRDSQESEMFGAEDRRQLEREVNSAQQHYQTLLVS 78
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1737-2448 |
1.02e-29 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 130.83 E-value: 1.02e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1737 QKRRELEAELAKVRAEMEvllaskaRAEEEsrsTSEKSKQ--RLEAEAgrfrELAEEAARLRALAEEAKRQRQLAEedav 1814
Cdd:COG1196 172 ERKEEAERKLEATEENLE-------RLEDI---LGELERQlePLERQA----EKAERYRELKEELKELEAELLLLK---- 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1815 RQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAqhkadiEARLAQLRKASESE 1894
Cdd:COG1196 234 LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYEL------LAELARLEQDIARL 307
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1895 LERQkglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRRE 1974
Cdd:COG1196 308 EERR----RELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1975 AEERVQKSLAAEEEAARQRKAALEEVERLKAkvEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQEL 2054
Cdd:COG1196 384 LAEELLEALRAAAELAAQLEELEEAEEALLE--RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEAL 461
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2055 QQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQE 2134
Cdd:COG1196 462 LELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALE 541
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2135 AARRAQAEQAALRQKQAAdaemekhkqfAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRgq 2214
Cdd:COG1196 542 AALAAALQNIVVEDDEVA----------AAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLR-- 609
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2215 vEEELFSLRVQMEELGklkarieaenRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2294
Cdd:COG1196 610 -EADARYYVLGDTLLG----------RTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAE 678
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2295 ALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERL 2374
Cdd:COG1196 679 AELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPE 758
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2375 RLRVAEMSRAQARAEEDARRFRK-------QAEDIGERLyrTELATQekvmlVQTLETQRqqsdrdaERLREAIAELEHE 2447
Cdd:COG1196 759 PPDLEELERELERLEREIEALGPvnllaieEYEELEERY--DFLSEQ-----REDLEEAR-------ETLEEAIEEIDRE 824
|
.
gi 1920237968 2448 K 2448
Cdd:COG1196 825 T 825
|
|
| CH_DMD_rpt2 |
cd21233 |
second calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, ... |
167-269 |
3.26e-29 |
|
second calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. It is involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Mutations in dystrophin lead to Duchenne muscular dystrophy (DMD). Moreover, dystrophin deficiency is associated with abnormal cerebral diffusion and perfusion, as well as in acute Trypanosoma cruzi infection. The dystrophin subfamily has been characterized by a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, dystrophin contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, approximately 24 spectrin repeats (SRs) and a WW domain. The model corresponds to the second CH domain.
Pssm-ID: 409082 Cd Length: 111 Bit Score: 114.64 E-value: 3.26e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 167 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN-LENLDQAFSVAERDLGVTRLLDPE 244
Cdd:cd21233 2 EKILLsWVRQSTRNYPQVNVINFTSSWSDGLAFNALIHSHRPDLFDWNSVVSQQSaTERLDHAFNIARQHLGIEKLLDPE 81
|
90 100
....*....|....*....|....*
gi 1920237968 245 DVDVPQPDEKSIITYVSSLYDAMPR 269
Cdd:cd21233 82 DVATAHPDKKSILMYVTSLFQVLPQ 106
|
|
| CH_MICALL2 |
cd21253 |
calponin homology (CH) domain found in MICAL-like protein 2 and similar proteins; MICAL-like ... |
169-264 |
3.44e-29 |
|
calponin homology (CH) domain found in MICAL-like protein 2 and similar proteins; MICAL-like protein 2 (MICAL-L2), also called junctional Rab13-binding protein (JRAB), or molecule interacting with CasL-like 2, acts as an effector of small Rab GTPases which is involved in junctional complexes assembly through the regulation of cell adhesion molecule transport to the plasma membrane, and actin cytoskeleton reorganization. It regulates the endocytic recycling of occludins, claudins, and E-cadherin to the plasma membrane and may thereby regulate the establishment of tight junctions and adherens junctions. Members of this subfamily contain a single copy of CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409102 Cd Length: 106 Bit Score: 113.98 E-value: 3.44e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 169 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED-VD 247
Cdd:cd21253 6 LQQWCRQQTEGYRDVKVTNMTTSWRDGLAFCAIIHRFRPDLIDFDSLSKENVYENNKLAFTVAEKELGIPALLDAEDmVA 85
|
90
....*....|....*..
gi 1920237968 248 VPQPDEKSIITYVSSLY 264
Cdd:cd21253 86 LKVPDKLSILTYVSQYY 102
|
|
| CH_FLN-like_rpt1 |
cd21183 |
first calponin homology (CH) domain found in the filamin family; The filamin family includes ... |
42-149 |
1.45e-28 |
|
first calponin homology (CH) domain found in the filamin family; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. This family also includes Drosophila melanogaster protein jitterbug (Jbug), which is an actin-meshwork organizing protein containing three copies of the CH domain. Other members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409032 Cd Length: 108 Bit Score: 112.57 E-value: 1.45e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 42 RVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHR 118
Cdd:cd21183 3 RIQANTFTRWCNEHL-----KERGMQIHDLATDFSDGLCLIALLENLSTRPLKRsynRRPAFQQHYLENVSTALKFIEAD 77
|
90 100 110
....*....|....*....|....*....|.
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHF 149
Cdd:cd21183 78 HIKLVNIGSGDIVNGNIKLILGLIWTLILHY 108
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1183-1801 |
3.24e-28 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 125.82 E-value: 3.24e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1183 QTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVpLANSQAVREQLRQEKALLED-IERHGEKveecqrfa 1261
Cdd:COG1196 219 KEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAE-LAELEAELEELRLELEELELeLEEAQAE-------- 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1262 kqyinaikdyELQLVTYKAQLEPVASPAKkpkvqsgsESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAE 1341
Cdd:COG1196 290 ----------EYELLAELARLEQDIARLE--------ERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEE 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1342 QQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAqglqrrmqeEVARREEVAVEAQEQKRSIQEELQHLRQSSEAE 1421
Cdd:COG1196 352 ELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELL---------EALRAAAELAAQLEELEEAEEALLERLERLEEE 422
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1422 IQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRqvQDETQRKRQ 1501
Cdd:COG1196 423 LEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAA--RLLLLLEAE 500
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1502 AEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAE----K 1577
Cdd:COG1196 501 ADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATflplD 580
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1578 TAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEE 1657
Cdd:COG1196 581 KIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGG 660
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1658 AEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQ 1737
Cdd:COG1196 661 SLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEE 740
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1738 KRRELEAELAKVRAEMEvllaskaraEEESRSTSEKSKQRLEAEAGRF--------RELAEEAARLRALAEE 1801
Cdd:COG1196 741 LLEEEELLEEEALEELP---------EPPDLEELERELERLEREIEALgpvnllaiEEYEELEERYDFLSEQ 803
|
|
| CH_ACTN4_rpt2 |
cd21290 |
second calponin homology (CH) domain found in alpha-actinin-4; Alpha-actinin-4 (ACTN4), also ... |
149-266 |
3.49e-28 |
|
second calponin homology (CH) domain found in alpha-actinin-4; Alpha-actinin-4 (ACTN4), also called non-muscle alpha-actinin 4, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. It is associated with cell motility and cancer invasion. ACTN4 is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409139 Cd Length: 125 Bit Score: 112.10 E-value: 3.49e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 149 FQISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAF 228
Cdd:cd21290 2 FAIQDISV----EETSAKEGLLLWCQRKTAPYKNVNVQNFHISWKDGLAFNALIHRHRPELIEYDKLRKDDPVTNLNNAF 77
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237968 229 SVAERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21290 78 EVAEKYLDIPKMLDAEDiVNTARPDEKAIMTYVSSFYHA 116
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1497-2212 |
3.50e-28 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 125.82 E-value: 3.50e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1497 QRKRQAEAELAlrvQAEAEAAR--------EKQralqaLEELRLQAEEAERRLRQAEAERARQVQVALetaqrsaeAELQ 1568
Cdd:COG1196 172 ERKEEAERKLE---ATEENLERledilgelERQ-----LEPLERQAEKAERYRELKEELKELEAELLL--------LKLR 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1569 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRaqqqaeaeraraeaerelerwqlkanEALRLRLQAEEvaqqksltq 1648
Cdd:COG1196 236 ELEAELEELEAELEELEAELEELEAELAELEAEL--------------------------EELRLELEELE--------- 280
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1649 aeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARL 1728
Cdd:COG1196 281 ------------------LELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEEL 342
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1729 QREAAAATQKRRELEAELAKVRAEmevLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQL 1808
Cdd:COG1196 343 EEELEEAEEELEEAEAELAEAEEA---LLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL 419
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1809 AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLR 1888
Cdd:COG1196 420 EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA 499
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1889 KASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEE 1968
Cdd:COG1196 500 EADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPL 579
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1969 ERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ 2048
Cdd:COG1196 580 DKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2049 QKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLR 2128
Cdd:COG1196 660 GSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2129 KEAEQEAARRAQAEQAALRQKQAADAEMEKHKqfAEQALRQKAQV----EQELTALRLQLEETDHQKSILDEELQRLK-- 2202
Cdd:COG1196 740 ELLEEEELLEEEALEELPEPPDLEELERELER--LEREIEALGPVnllaIEEYEELEERYDFLSEQREDLEEARETLEea 817
|
730
....*....|.
gi 1920237968 2203 -AEVTEAARQR 2212
Cdd:COG1196 818 iEEIDRETRER 828
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1711-2601 |
4.89e-27 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 122.56 E-value: 4.89e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1711 TEQGEQQRQLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFREL-- 1788
Cdd:PTZ00121 1033 TEYGNNDDVLKEKDIIDEDIDGNHEG--KAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETgk 1110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1789 AEEAARlralAEEAKRQrqlAEEdaVRQRAEAERvlAEKLAAISEATRLKTE--AEIALKEKEAENERLRRLAEDeafQR 1866
Cdd:PTZ00121 1111 AEEARK----AEEAKKK---AED--ARKAEEARK--AEDARKAEEARKAEDAkrVEIARKAEDARKAEEARKAED---AK 1176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1867 RLLEEQAAqhkadIEARLA-QLRKASESelerqkglvedtlrqrRQVEEeilalkgsfekaaAGKAELELELGRIRgTAE 1945
Cdd:PTZ00121 1177 KAEAARKA-----EEVRKAeELRKAEDA----------------RKAEA-------------ARKAEEERKAEEAR-KAE 1221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1946 DTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERlkaKVEEARRLRERAEQESARQL 2025
Cdd:PTZ00121 1222 DAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEAR---KADELKKAEEKKKADEAKKA 1298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2026 QLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERlrseaeaarraaeeaeaareraereaaqsRRQVEEAER 2105
Cdd:PTZ00121 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE-----------------------------AKKAAEAAK 1349
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2106 LKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQaLRQKAQVEQELTALRLQLE 2185
Cdd:PTZ00121 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE-LKKAAAAKKKADEAKKKAE 1428
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2186 ETDHQksildEELQRLKAEVTEAARQRGQVEEelfslRVQMEELGKlkaRIEAENRALVLRDKDSAQRLLQE---EAEKM 2262
Cdd:PTZ00121 1429 EKKKA-----DEAKKKAEEAKKADEAKKKAEE-----AKKAEEAKK---KAEEAKKADEAKKKAEEAKKADEakkKAEEA 1495
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2263 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK----EKMQAVQEATRLKAEAELLQQQKELAQEQARRLQED 2338
Cdd:PTZ00121 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKadeaKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2339 KeQMAQQLAQETQgfqktlETERQRQLEMSAEAERLRLRVAEmsraQARAEEDARrfrKQAEDIGErlyrtelaTQEKVM 2418
Cdd:PTZ00121 1576 K-NMALRKAEEAK------KAEEARIEEVMKLYEEEKKMKAE----EAKKAEEAK---IKAEELKK--------AEEEKK 1633
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2419 LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQalqqsflsekdslLQRErci 2498
Cdd:PTZ00121 1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA-------------LKKE--- 1697
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2499 EQEKAKLEQL---FQDEVAKAQALREEQQRQQQqmqqekqqlaaSMEEARRRQHE----AEEgVRRQQEELQRLAQQQQQ 2571
Cdd:PTZ00121 1698 AEEAKKAEELkkkEAEEKKKAEELKKAEEENKI-----------KAEEAKKEAEEdkkkAEE-AKKDEEEKKKIAHLKKE 1765
|
890 900 910
....*....|....*....|....*....|....
gi 1920237968 2572 QEKLLAEENQR----LRERLQHLEEERRAALARS 2601
Cdd:PTZ00121 1766 EEKKAEEIRKEkeavIEEELDEEDEKRRMEVDKK 1799
|
|
| CH_MICAL_EHBP-like |
cd22198 |
calponin homology (CH) domain found in the MICAL and EHBP families; This group is composed of ... |
167-266 |
1.13e-26 |
|
calponin homology (CH) domain found in the MICAL and EHBP families; This group is composed of the molecule interacting with CasL protein (MICAL) and EH domain-binding protein (EHBP) families. MICAL is a large, multidomain, cytosolic protein with a single LIM domain, a calponin homology (CH) domain and a flavoprotein monooxygenase (MO) domain. In Drosophila, MICAL is expressed in axons, interacts with the neuronal A (PlexA) receptor and is required for Semaphorin 1a (Sema-1a)-PlexA-mediated repulsive axon guidance. The LIM and CH domains mediate interactions with the cytoskeleton, cytoskeletal adaptor proteins, and other signaling proteins. The flavoprotein MO is required for semaphorin-plexin repulsive axon guidance during axonal pathfinding in the Drosophila neuromuscular system. The EHBP family includes EHBP1 and EHBP1-like protein (EHBP1L1). EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP proteins contain a single CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409188 Cd Length: 105 Bit Score: 106.99 E-value: 1.13e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 167 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED- 245
Cdd:cd22198 3 EELLSWCQEQTEGYRGVKVTDLTSSWRSGLALCAIIHRFRPDLIDFSSLDPENIAENNQLAFDVAEQELGIPPVMTGQEm 82
|
90 100
....*....|....*....|.
gi 1920237968 246 VDVPQPDEKSIITYVSSLYDA 266
Cdd:cd22198 83 ASLAVPDKLSMVSYLSQFYEA 103
|
|
| CH_ACTN1_rpt2 |
cd21287 |
second calponin homology (CH) domain found in alpha-actinin-1; Alpha-actinin-1 (ACTN1), also ... |
151-266 |
1.64e-26 |
|
second calponin homology (CH) domain found in alpha-actinin-1; Alpha-actinin-1 (ACTN1), also called alpha-actinin cytoskeletal isoform, or non-muscle alpha-actinin-1, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN1 is a bundling protein. Its mutations cause congenital macrothrombocytopenia. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409136 Cd Length: 124 Bit Score: 107.09 E-value: 1.64e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 151 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 230
Cdd:cd21287 1 IQDISV----EETSAKEGLLLWCQRKTAPYKNVNIQNFHISWKDGLGFCALIHRHRPELIDYGKLRKDDPLTNLNTAFDV 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 231 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21287 77 AEKYLDIPKMLDAEDiVGTARPDEKAIMTYVSSFYHA 113
|
|
| CH_SPTBN5_rpt1 |
cd21247 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) ... |
32-151 |
3.10e-26 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN5, also called beta-V spectrin, is a mammalian ortholog of Drosophila beta H spectrin that may play a crucial role as a longer actin-membrane cross-linker or to fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. SPTBN5 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409096 Cd Length: 125 Bit Score: 106.38 E-value: 3.10e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 32 RVQDEQDERDRVQKKTFTKWVNKHLIKHwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQI 110
Cdd:cd21247 9 HIRKLQEQRMTMQKKTFTKWMNNVFSKN---GAKIEITDIYTELKDGIHLLRLLELISGEQLPRpSRGKMRVHFLENNSK 85
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1920237968 111 ALDYLRHR-QVKLVNIRNddIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21247 86 AITFLKTKvPVKLIGPEN--IVDGDRTLILGLIWIIILRFQI 125
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1877-2605 |
5.82e-26 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 119.09 E-value: 5.82e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1877 KADIEARLAQLRKASESELERQKGLVEDTlRQRRQVEEEilalKGSFE---KAAAGKAELELELGRIRGTAEDTLRSKE- 1952
Cdd:PTZ00121 1059 KAEAKAHVGQDEGLKPSYKDFDFDAKEDN-RADEATEEA----FGKAEeakKTETGKAEEARKAEEAKKKAEDARKAEEa 1133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1953 -QAE-----QEAARQRQLAAEEERRRREAEERVQKSLAAEE----EAARQ----RKAA----LEEVERLKA--KVEEARR 2012
Cdd:PTZ00121 1134 rKAEdarkaEEARKAEDAKRVEIARKAEDARKAEEARKAEDakkaEAARKaeevRKAEelrkAEDARKAEAarKAEEERK 1213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2013 LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAERE 2092
Cdd:PTZ00121 1214 AEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2093 AAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADA--EMEKHKQFAEQALRQK 2170
Cdd:PTZ00121 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAadEAEAAEEKAEAAEKKK 1373
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2171 AQVEQELTALRLQLEETDHQksildEELQRLKAEVTEAARQRGQVEEElfslrvqmeelgKLKARiEAENRALVLRDKDS 2250
Cdd:PTZ00121 1374 EEAKKKADAAKKKAEEKKKA-----DEAKKKAEEDKKKADELKKAAAA------------KKKAD-EAKKKAEEKKKADE 1435
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2251 AQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQlAEEdlAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2328
Cdd:PTZ00121 1436 AKKKAEEakKADEAKKKAEEAKKAEEAKKKAEEAKK-ADE--AKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2329 QEqARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQA--RAEEDARRFRKQAEDigerL 2406
Cdd:PTZ00121 1513 DE-AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK-KAEEKKKAEEakKAEEDKNMALRKAEE----A 1586
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2407 YRTELATQEKVM-LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFL 2485
Cdd:PTZ00121 1587 KKAEEARIEEVMkLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2486 SEKDsllqrerciEQEKAKLEQLFQDEVAKAQAlrEEQQRQQQQMQQEKQQLAASMEEARRRQHE---AEEGVRRQQEEL 2562
Cdd:PTZ00121 1667 AKKA---------EEDKKKAEEAKKAEEDEKKA--AEALKKEAEEAKKAEELKKKEAEEKKKAEElkkAEEENKIKAEEA 1735
|
730 740 750 760
....*....|....*....|....*....|....*....|....*.
gi 1920237968 2563 QRLAQQ-QQQQEKLLAEENQrlRERLQHL--EEERRAALARSEEIA 2605
Cdd:PTZ00121 1736 KKEAEEdKKKAEEAKKDEEE--KKKIAHLkkEEEKKAEEIRKEKEA 1779
|
|
| CH_ACTN3_rpt2 |
cd21289 |
second calponin homology (CH) domain found in alpha-actinin-3; Alpha-actinin-3 (ACTN3), also ... |
151-266 |
7.69e-26 |
|
second calponin homology (CH) domain found in alpha-actinin-3; Alpha-actinin-3 (ACTN3), also called alpha-actinin skeletal muscle isoform 3, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN3 is a bundling protein. It is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409138 Cd Length: 124 Bit Score: 105.19 E-value: 7.69e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 151 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 230
Cdd:cd21289 1 IQDISV----EETSAKEGLLLWCQRKTAPYRNVNVQNFHTSWKDGLALCALIHRHRPDLIDYAKLRKDDPIGNLNTAFEV 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 231 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21289 77 AEKYLDIPKMLDAEDiVNTPKPDEKAIMTYVSCFYHA 113
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1403-2205 |
8.42e-26 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 118.71 E-value: 8.42e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1403 QKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEaEAQKRQAQ 1482
Cdd:PTZ00121 1043 KEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEE-ARKAEEAK 1121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1483 EEAERLR-----RQVQD--ETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE----AERARQ 1551
Cdd:PTZ00121 1122 KKAEDARkaeeaRKAEDarKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEelrkAEDARK 1201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1552 VQVAletaqRSAEAELQSEHASFAEKTAQLERTLKEEHVAvvQLREEATRRAQQQAEAERARAEAERELERWQLKANEAL 1631
Cdd:PTZ00121 1202 AEAA-----RKAEEERKAEEARKAEDAKKAEAVKKAEEAK--KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKA 1274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1632 RLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQavRQRELAEQELEKQRQLAEGTAQQrlaAEQEliRLRAET 1711
Cdd:PTZ00121 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEA--KKADEAKKKAEEAKKKADAAKKK---AEEA--KKAAEA 1347
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1712 EQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEmEVLLA--SKARAEEESRSTSEKSKQrlEAEAGRFRELA 1789
Cdd:PTZ00121 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAE-EKKKAdeAKKKAEEDKKKADELKKA--AAAKKKADEAK 1424
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1790 EEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVlAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLL 1869
Cdd:PTZ00121 1425 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK-AEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAK 1503
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1870 EEQAAQHKADiEARLAQLRKASEsELERQKglvedtlrQRRQVEEeilaLKGSFEKAAAGKAELELELGRirgtAEDTlR 1949
Cdd:PTZ00121 1504 KAAEAKKKAD-EAKKAEEAKKAD-EAKKAE--------EAKKADE----AKKAEEKKKADELKKAEELKK----AEEK-K 1564
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1950 SKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRlrerAEQESARQLQLaq 2029
Cdd:PTZ00121 1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK----AEEEKKKVEQL-- 1638
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2030 eaaqKRLQAEEKAHAFAVQQKEQELQQTLqqeqsvlERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQs 2109
Cdd:PTZ00121 1639 ----KKKEAEEKKKAEELKKAEEENKIKA-------AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEE- 1706
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2110 aeeQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADaemEKHKQFAEQALRQKAQVEQELTALRLQLEETDH 2189
Cdd:PTZ00121 1707 ---LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE---EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAV 1780
|
810
....*....|....*.
gi 1920237968 2190 QKSILDEELQRLKAEV 2205
Cdd:PTZ00121 1781 IEEELDEEDEKRRMEV 1796
|
|
| CH_FLN_rpt1 |
cd21228 |
first calponin homology (CH) domain found in filamins; The filamin family includes filamin-A ... |
42-149 |
2.20e-25 |
|
first calponin homology (CH) domain found in filamins; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. Members of this family contain two copies of the CH domain. The model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409077 Cd Length: 108 Bit Score: 103.34 E-value: 2.20e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 42 RVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHR 118
Cdd:cd21228 3 KIQQNTFTRWCNEHL-----KCVNKRIYNLETDLSDGLRLIALLEVLSQKRMYKkynKRPTFRQMKLENVSVALEFLERE 77
|
90 100 110
....*....|....*....|....*....|.
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHF 149
Cdd:cd21228 78 SIKLVSIDSSAIVDGNLKLILGLIWTLILHY 108
|
|
| CH_FLNC_rpt1 |
cd21310 |
first calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C ... |
42-152 |
1.60e-24 |
|
first calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C (FLN-C), also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. FLN-C contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409159 Cd Length: 125 Bit Score: 101.65 E-value: 1.60e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 42 RVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRE---KGRMRFHKLQNVQIALDYLRHR 118
Cdd:cd21310 15 KIQQNTFTRWCNEHL-----KCVQKRLNDLQKDLSDGLRLIALLEVLSQKKMYRKyhpRPNFRQMKLENVSVALEFLDRE 89
|
90 100 110
....*....|....*....|....*....|....
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 152
Cdd:cd21310 90 HIKLVSIDSKAIVDGNLKLILGLIWTLILHYSIS 123
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1340-2551 |
1.62e-24 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 114.15 E-value: 1.62e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1340 AEQQRAEERERLAEVEAalEKQRQLAE-AHAQAKAQAEREAQGLQRRMQ--EEVARREEvAVEAQEQKRSIQEELQHLRQ 1416
Cdd:NF041483 85 ADQLRADAERELRDARA--QTQRILQEhAEHQARLQAELHTEAVQRRQQldQELAERRQ-TVESHVNENVAWAEQLRART 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1417 SSEA-----EIQAKARQVEAAERSRlrieeeirVVRLQLEAteRQRGGAEGElqalRARAEeAEAQKRQAQEEAERLRRQ 1491
Cdd:NF041483 162 ESQArrlldESRAEAEQALAAARAE--------AERLAEEA--RQRLGSEAE----SARAE-AEAILRRARKDAERLLNA 226
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1492 VQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQaeEAERRLRQAEAERARQVQVALETA-QRSAEAELQSE 1570
Cdd:NF041483 227 ASTQAQEATDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQ--EAEEALREARAEAEKVVAEAKEAAaKQLASAESANE 304
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1571 hasfaektaQLERTLKEEhvaVVQLREEATRRAQQQAEAERARAEAERELERWQL-KANEALRLRLQAEEVAQQKSLTQA 1649
Cdd:NF041483 305 ---------QRTRTAKEE---IARLVGEATKEAEALKAEAEQALADARAEAEKLVaEAAEKARTVAAEDTAAQLAKAART 372
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1650 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQ-RLAAEQELIRLRAETEQgeqqrqlLEEELARL 1728
Cdd:NF041483 373 AEEVLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQlKGAAKDDTKEYRAKTVE-------LQEEARRL 445
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1729 QREAAaatQKRRELEAELAKVRAEmevllaskARAEeesrstsekSKQRLEAEAGRFRELAEEAarlRALAEEAKRQrql 1808
Cdd:NF041483 446 RGEAE---QLRAEAVAEGERIRGE--------ARRE---------AVQQIEEAARTAEELLTKA---KADADELRST--- 499
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1809 aeedavrQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAA-QHKADIEARLAQL 1887
Cdd:NF041483 500 -------ATAESERVRTE---AIERATTLRRQAEETLERTRAEAERLRAEAEEQAEEVRAAAERAArELREETERAIAAR 569
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1888 RKASESELERQKGLVEdtlrQRRQVEEEilALKGSFEKAAAGKAELELELGRIRGTAEDTLRS-KEQAEQEAARQRqlaa 1966
Cdd:NF041483 570 QAEAAEELTRLHTEAE----ERLTAAEE--ALADARAEAERIRREAAEETERLRTEAAERIRTlQAQAEQEAERLR---- 639
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1967 eeerrRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLR-------ERAEQESARQLQLAQ-EAAQKRLQ 2037
Cdd:NF041483 640 -----TEAAADASAARAEGENVAVRLRSEAAAEAERLKSEAQEsADRVRaeaaaaaERVGTEAAEALAAAQeEAARRRRE 714
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2038 AEE---KAHAFAVQQKEQELQQTLQQEQSVLERLrseaeaarraaeeaeaareraEREAAQSRRQVEEAERlkqsaeeqa 2114
Cdd:NF041483 715 AEEtlgSARAEADQERERAREQSEELLASARKRV---------------------EEAQAEAQRLVEEADR--------- 764
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2115 qaqaqaqaaaeklrkeaeqeaarraqaeqaalRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEET-DHQKSI 2193
Cdd:NF041483 765 --------------------------------RATELVSAAEQTAQQVRDSVAGLQEQAEEEIAGLRSAAEHAaERTRTE 812
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2194 LDEELQRLKAEvteAARQRGQVEEELFSLRVQ-MEELGKLKARIEAEnralVLRDKDSAQRLLQEEAEKMKQVAEEAA-R 2271
Cdd:NF041483 813 AQEEADRVRSD---AYAERERASEDANRLRREaQEETEAAKALAERT----VSEAIAEAERLRSDASEYAQRVRTEASdT 885
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2272 LSVAAQEAARLRQLAEEDLAQQRALA---EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAq 2348
Cdd:NF041483 886 LASAEQDAARTRADAREDANRIRSDAaaqADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIA- 964
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2349 etqgfqktleterqrqlEMSAEAERLRLRVAE-MSRAQARAE---EDARRFRKQAEDIGERLyRTELATQEKVMLVQTLE 2424
Cdd:NF041483 965 -----------------EATGEAERLRAEAAEtVGSAQQHAErirTEAERVKAEAAAEAERL-RTEAREEADRTLDEARK 1026
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2425 TQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLL----QETQALQQSFLSEKDSLLQRERcieq 2500
Cdd:NF041483 1027 DANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVgaarKEAERIVAEATVEGNSLVEKAR---- 1102
|
1210 1220 1230 1240 1250
....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 2501 ekAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLaasMEEARRRQHEA 2551
Cdd:NF041483 1103 --TDADELLVGARRDATAIRERAEELRDRITGEIEEL---HERARRESAEQ 1148
|
|
| CH_ACTN2_rpt2 |
cd21288 |
second calponin homology (CH) domain found in alpha-actinin-2; Alpha-actinin-2 (ACTN2), also ... |
151-266 |
2.24e-24 |
|
second calponin homology (CH) domain found in alpha-actinin-2; Alpha-actinin-2 (ACTN2), also called alpha-actinin skeletal muscle isoform 2, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN2 is a bundling protein. Its mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409137 Cd Length: 124 Bit Score: 101.30 E-value: 2.24e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 151 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 230
Cdd:cd21288 1 IQDISV----EETSAKEGLLLWCQRKTAPYRNVNIQNFHTSWKDGLGLCALIHRHRPDLIDYSKLNKDDPIGNINLAMEI 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237968 231 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21288 77 AEKHLDIPKMLDAEDiVNTPKPDERAIMTYVSCFYHA 113
|
|
| CH |
smart00033 |
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ... |
46-148 |
3.19e-24 |
|
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
Pssm-ID: 214479 [Multi-domain] Cd Length: 101 Bit Score: 99.70 E-value: 3.19e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 46 KTFTKWVNKHLIKhwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREK---GRMRFHKLQNVQIALDYLRHRQVKL 122
Cdd:smart00033 1 KTLLRWVNSLLAE----YDKPPVTNFSSDLKDGVALCALLNSLSPGLVDKKKvaaSLSRFKKIENINLALSFAEKLGGKV 76
|
90 100
....*....|....*....|....*.
gi 1920237968 123 VNIRNDDIADGnPKLTLGLIWTIILH 148
Cdd:smart00033 77 VLFEPEDLVEG-PKLILGVIWTLISL 101
|
|
| CH_MICALL |
cd21197 |
calponin homology (CH) domain found in the MICAL-like protein family; The MICAL-L family ... |
169-264 |
6.22e-24 |
|
calponin homology (CH) domain found in the MICAL-like protein family; The MICAL-L family includes MICAL-L1 and MICAL-L2. MICAL-L1, also called molecule interacting with Rab13 (MIRab13), is a probable lipid-binding protein with higher affinity for phosphatidic acid, a lipid enriched in recycling endosome membranes. It is a tubular endosomal membrane hub that connects Rab35 and Arf6 with Rab8a. It may be involved in a late step of receptor-mediated endocytosis regulating endocytosed-EGF receptor trafficking. Alternatively, it may regulate slow endocytic recycling of endocytosed proteins back to the plasma membrane. MICAL-L1 may indirectly play a role in neurite outgrowth. MICAL-L2, also called junctional Rab13-binding protein (JRAB), or molecule interacting with CasL-like 2, acts as an effector of small Rab GTPases which is involved in junctional complexes assembly through the regulation of cell adhesion molecule transport to the plasma membrane, and actin cytoskeleton reorganization. It regulates the endocytic recycling of occludins, claudins, and E-cadherin to the plasma membrane and may thereby regulate the establishment of tight junctions and adherens junctions. Members of this family contain a single copy of CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409046 Cd Length: 105 Bit Score: 99.15 E-value: 6.22e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 169 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED-VD 247
Cdd:cd21197 5 LLRWCRRQCEGYPGVNITNLTSSFRDGLAFCAILHRHRPELIDFHSLKKDNWLENNRLAFRVAETSLGIPALLDAEDmVT 84
|
90
....*....|....*..
gi 1920237968 248 VPQPDEKSIITYVSSLY 264
Cdd:cd21197 85 MHVPDRLSIITYVSQYY 101
|
|
| CH |
pfam00307 |
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ... |
163-269 |
6.88e-24 |
|
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.
Pssm-ID: 425596 [Multi-domain] Cd Length: 109 Bit Score: 99.28 E-value: 6.88e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 163 MTAKEKLLLWSQRMVEGC-QGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVY--RQTNLENLDQAFSVAERDLGVTR 239
Cdd:pfam00307 1 LELEKELLRWINSHLAEYgPGVRVTNFTTDLRDGLALCALLNKLAPGLVDKKKLNksEFDKLENINLALDVAEKKLGVPK 80
|
90 100 110
....*....|....*....|....*....|.
gi 1920237968 240 -LLDPEDVDvpQPDEKSIITYVSSLYDAMPR 269
Cdd:pfam00307 81 vLIEPEDLV--EGDNKSVLTYLASLFRRFQA 109
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1377-2237 |
7.42e-24 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 111.69 E-value: 7.42e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1377 REAQGLQRRMQEEVARREEVAVEAQEQKRSIQ------EELQHLR-QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ 1449
Cdd:TIGR02168 175 KETERKLERTRENLDRLEDILNELERQLKSLErqaekaERYKELKaELRELELALLVLRLEELREELEELQEELKEAEEE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1450 LEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEE 1529
Cdd:TIGR02168 255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1530 LRLQAEEAERRLrqaeaerarqvqvaletaqrsaeAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaea 1609
Cdd:TIGR02168 335 LAEELAELEEKL-----------------------EELKEELESLEAELEELEAELEELESRLEELEEQLE--------- 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1610 eraraeaerelerwQLKANEALRLRLQAEEVAQQKSLTQAEaekqkeeaerearrrgkaeEQAVRQRELAEQELEKQRQL 1689
Cdd:TIGR02168 383 --------------TLRSKVAQLELQIASLNNEIERLEARL-------------------ERLEDRRERLQQEIEELLKK 429
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1690 AEgtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRS 1769
Cdd:TIGR02168 430 LE--EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEG 507
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1770 TSE--KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlaeEDAVRQRAEAERVLAEKLAAiSEATRLKTEAEIALKE 1847
Cdd:TIGR02168 508 VKAllKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRL---QAVVVENLNAAKKAIAFLKQ-NELGRVTFLPLDSIKG 583
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1848 KEAENERLRRLAEDEAFQRRL--LEEQAAQHKADIEARLAQLRKASEselerqkglVEDTLRQRRQVEEEIL-------- 1917
Cdd:TIGR02168 584 TEIQGNDREILKNIEGFLGVAkdLVKFDPKLRKALSYLLGGVLVVDD---------LDNALELAKKLRPGYRivtldgdl 654
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1918 -----ALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaaeeerRRREAEERVQKSLAAEEEAARQ 1992
Cdd:TIGR02168 655 vrpggVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK-------ELEELEEELEQLRKELEELSRQ 727
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1993 RKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAfaVQQKEQELQQTLQQEQSVLERLRSEA 2072
Cdd:TIGR02168 728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAE--AEAEIEELEAQIEQLKEELKALREAL 805
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2073 EAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAA 2152
Cdd:TIGR02168 806 DELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASL 885
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2153 DAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLK---AEVTEAARQRGQVEEELFSLRVQMEEL 2229
Cdd:TIGR02168 886 EEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEvriDNLQERLSEEYSLTLEEAEALENKIED 965
|
....*...
gi 1920237968 2230 GKLKARIE 2237
Cdd:TIGR02168 966 DEEEARRR 973
|
|
| CH |
pfam00307 |
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ... |
43-151 |
1.07e-23 |
|
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.
Pssm-ID: 425596 [Multi-domain] Cd Length: 109 Bit Score: 98.51 E-value: 1.07e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 43 VQKKTFTKWVNKHLIKhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLP-REKGRMRFHKLQNVQIALDYLRHRQ-V 120
Cdd:pfam00307 2 ELEKELLRWINSHLAE---YGPGVRVTNFTTDLRDGLALCALLNKLAPGLVDkKKLNKSEFDKLENINLALDVAEKKLgV 78
|
90 100 110
....*....|....*....|....*....|.
gi 1920237968 121 KLVNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:pfam00307 79 PKVLIEPEDLVEGDNKSVLTYLASLFRRFQA 109
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1303-2039 |
4.68e-23 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 108.99 E-value: 4.68e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1303 QEYVDLRTRYSELS-TLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAAL--------EKQRQLAEA------ 1367
Cdd:TIGR02168 213 ERYKELKAELRELElALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLeelrlevsELEEEIEELqkelya 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1368 HAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE------ELQHLRQSSEAEIQAKARQVEAAERSRLRIEE 1441
Cdd:TIGR02168 293 LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDElaeelaELEEKLEELKEELESLEAELEELEAELEELES 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1442 EIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQ 1521
Cdd:TIGR02168 373 RLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI--EELLKKLEEAELKELQAELEELEEELE 450
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1522 RALQALEELRLQAEEAERRLRQAEAERarqvqvaletaqRSAEAELQsEHASFAEKTAQLERTLKEEHVAVVQLREEAtr 1601
Cdd:TIGR02168 451 ELQEELERLEEALEELREELEEAEQAL------------DAAERELA-QLQARLDSLERLQENLEGFSEGVKALLKNQ-- 515
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1602 raQQQAEAERARAEAERELERWQLKANEALRLRLQA---EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQREL 1678
Cdd:TIGR02168 516 --SGLSGILGVLSELISVDEGYEAAIEAALGGRLQAvvvENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREI 593
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1679 AEQELEKQRQLA---EGTAQQRLAAEQELIRLR-AETEQG--EQQRQLLEEELA------------RLQREAAAATQKRR 1740
Cdd:TIGR02168 594 LKNIEGFLGVAKdlvKFDPKLRKALSYLLGGVLvVDDLDNalELAKKLRPGYRIvtldgdlvrpggVITGGSAKTNSSIL 673
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1741 ELEAELAKVRAEMEvLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEA 1820
Cdd:TIGR02168 674 ERRREIEELEEKIE-ELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQL 752
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1821 ERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAED-----EAFQRRLLEEQAAQHKADIEARLAQLRKAS-ESE 1894
Cdd:TIGR02168 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQlkeelKALREALDELRAELTLLNEEAANLRERLESlERR 832
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1895 LERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELgrirgtaEDTLRSKEQAEQEAARQRQLAAEEERRRRE 1974
Cdd:TIGR02168 833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL-------EALLNERASLEEALALLRSELEELSEELRE 905
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1975 AEERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLRERA--EQESARQLQLAQEAAQKRLQAE 2039
Cdd:TIGR02168 906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNlQERLSEEYslTLEEAEALENKIEDDEEEARRR 973
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1695-2487 |
5.83e-23 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 108.99 E-value: 5.83e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1695 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAAtQKRRELEAELAKVRAEmevLLASKARAEEESRSTSEKS 1774
Cdd:TIGR02168 172 ERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKA-ERYKELKAELRELELA---LLVLRLEELREELEELQEE 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1775 KQRLEAEagrFRELAEEAARLRALAEEAKRQRQLAEEDAvrqrAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENER 1854
Cdd:TIGR02168 248 LKEAEEE---LEELTAELQELEEKLEELRLEVSELEEEI----EELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1855 LRRLAEDEAFQRRLLEEQAAQHKADIEaRLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFekaaagkAELE 1934
Cdd:TIGR02168 321 LEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV-------AQLE 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1935 LELGRIRGTAEDTLRSKEQAEQEAARQRQ-LAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRL 2013
Cdd:TIGR02168 393 LQIASLNNEIERLEARLERLEDRRERLQQeIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2014 RERAEQESARQLQLAQE--AAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAER 2091
Cdd:TIGR02168 473 AEQALDAAERELAQLQArlDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVV 552
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2092 EAAQSRRQVEEAerLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAaDAEMEKHKQFAeqaLRQKA 2171
Cdd:TIGR02168 553 ENLNAAKKAIAF--LKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKF-DPKLRKALSYL---LGGVL 626
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2172 QVEQELTALRLQlEETDHQKSI--LDEELQRLKAEVTEAARQRGQVeeeLFSLRVQMEELGKLKARIEAENRAL--VLRD 2247
Cdd:TIGR02168 627 VVDDLDNALELA-KKLRPGYRIvtLDGDLVRPGGVITGGSAKTNSS---ILERRREIEELEEKIEELEEKIAELekALAE 702
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2248 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM-----------QAVQEATRLKA 2316
Cdd:TIGR02168 703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEaeieeleerleEAEEELAEAEA 782
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2317 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlrvaEMSRAQARAEEDARRFR 2396
Cdd:TIGR02168 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLE----DLEEQIEELSEDIESLA 858
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2397 KQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTvRQEQLLQE 2476
Cdd:TIGR02168 859 AEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL-RLEGLEVR 937
|
810
....*....|.
gi 1920237968 2477 TQALQQSFLSE 2487
Cdd:TIGR02168 938 IDNLQERLSEE 948
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1331-2042 |
8.84e-23 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 108.37 E-value: 8.84e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1331 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLA-EAHAQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQE 1409
Cdd:NF041483 254 RQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAkEAAAKQLASAESANEQRTRTAKEEIAR---LVGEATKEAEALKA 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1410 ELQHLRQSSEAEiqAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGElQALRARAEEAEAQKRQAQEEAERLR 1489
Cdd:NF041483 331 EAEQALADARAE--AEKLVAEAAEKARTVAAED--------TAAQLAKAARTAE-EVLTKASEDAKATTRAAAEEAERIR 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1490 RQVQDETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRLQaEEAeRRLRqAEAERARqvqvaletaqrsaeaelqs 1569
Cdd:NF041483 400 REAEAEADRLRGEAADQAEQLKGAAKDDTKEYRA----KTVELQ-EEA-RRLR-GEAEQLR------------------- 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1570 ehasfAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELErwqlkANEALRLRLQAEEVAqqKSLTQA 1649
Cdd:NF041483 454 -----AEAVAEGERIRGEARREAVQQIEEAARTAEELLTKAKADADELRSTA-----TAESERVRTEAIERA--TTLRRQ 521
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1650 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA-AEQELIRLRAETEQ----GEQQRQLLEEE 1724
Cdd:NF041483 522 AEETLERTRAEAERLRAEAEEQAEEVRAAAERAARELREETERAIAARQAeAAEELTRLHTEAEErltaAEEALADARAE 601
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1725 LARLQREAAAATQKRRELEAE-----LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrelAEEAARLRALA 1799
Cdd:NF041483 602 AERIRREAAEETERLRTEAAErirtlQAQAEQEAERLRTEAAADASAARAEGENVAVRLRSEA------AAEAERLKSEA 675
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1800 EE-AKRQRQLAEEDAVRQRAEAERVLAeklAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAfqrrllEEQAAQHKA 1878
Cdd:NF041483 676 QEsADRVRAEAAAAAERVGTEAAEALA---AAQEEAARRRREAEETLGSARAEADQERERAREQS------EELLASARK 746
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1879 DIEARLAQLRKASESELERQKGLV---EDTLRQRR--------QVEEEILALKGSFEKAAA-GKAELELELGRIRGtaeD 1946
Cdd:NF041483 747 RVEEAQAEAQRLVEEADRRATELVsaaEQTAQQVRdsvaglqeQAEEEIAGLRSAAEHAAErTRTEAQEEADRVRS---D 823
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1947 TLRSKEQAEQEAARQRQlaaeeerrrreaeERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLRERAeqeSARQL 2025
Cdd:NF041483 824 AYAERERASEDANRLRR-------------EAQEETEAAKALAERTVSEAIAEAERLRSDASEyAQRVRTEA---SDTLA 887
|
730
....*....|....*..
gi 1920237968 2026 QLAQEAAQKRLQAEEKA 2042
Cdd:NF041483 888 SAEQDAARTRADAREDA 904
|
|
| CH_MICALL1 |
cd21252 |
calponin homology (CH) domain found in MICAL-like protein 1; MICAL-like protein 1 (MICAL-L1), ... |
165-264 |
9.40e-23 |
|
calponin homology (CH) domain found in MICAL-like protein 1; MICAL-like protein 1 (MICAL-L1), also called molecule interacting with Rab13 (MIRab13), is a probable lipid-binding protein with higher affinity for phosphatidic acid, a lipid enriched in recycling endosome membranes. It is a tubular endosomal membrane hub that connects Rab35 and Arf6 with Rab8a. It may be involved in a late step of receptor-mediated endocytosis regulating endocytosed-EGF receptor trafficking. Alternatively, it may regulate slow endocytic recycling of endocytosed proteins back to the plasma membrane. MICAL-L1 may indirectly play a role in neurite outgrowth. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409101 Cd Length: 107 Bit Score: 95.71 E-value: 9.40e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 165 AKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPE 244
Cdd:cd21252 1 ARRALQAWCRRQCEGYPGVEIRDLSSSFRDGLAFCAILHRHRPDLIDFDSLSKDNVYENNRLAFEVAERELGIPALLDPE 80
|
90 100
....*....|....*....|.
gi 1920237968 245 D-VDVPQPDEKSIITYVSSLY 264
Cdd:cd21252 81 DmVSMKVPDCLSIMTYVSQYY 101
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1866-2507 |
1.01e-22 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 107.72 E-value: 1.01e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1866 RRLLEEQA--AQHKADIEARLAQLRKASE---------SELERQKglveDTLRQRRQVEEEILALKGSFEKAaagkaELE 1934
Cdd:COG1196 158 RAIIEEAAgiSKYKERKEEAERKLEATEEnlerledilGELERQL----EPLERQAEKAERYRELKEELKEL-----EAE 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1935 LELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLR 2014
Cdd:COG1196 229 LLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2015 ERAEQESARQLQLAQEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqsvlERLRSEAEAARRAAEEAEAARERAEREAA 2094
Cdd:COG1196 309 ERRRELEERLEELEEELAELEEELEELE-----------------------EELEELEEELEEAEEELEEAEAELAEAEE 365
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2095 QSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVE 2174
Cdd:COG1196 366 ALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALE 445
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2175 QELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAenrALVLRDKDSAQRL 2254
Cdd:COG1196 446 EAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEG---VKAALLLAGLRGL 522
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2255 LQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARR 2334
Cdd:COG1196 523 AGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVD 602
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2335 LQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQ 2414
Cdd:COG1196 603 LVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE 682
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2415 EKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQR 2494
Cdd:COG1196 683 ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
|
650
....*....|...
gi 1920237968 2495 ERcIEQEKAKLEQ 2507
Cdd:COG1196 763 EE-LERELERLER 774
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1674-2444 |
1.62e-22 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 107.45 E-value: 1.62e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1674 RQRELAEQELEKQRQLAEgtaqqrlaAEQELIRLRAETeqgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEM 1753
Cdd:TIGR02168 207 RQAEKAERYKELKAELRE--------LELALLVLRLEE---------LREELEELQEELKEAEEELEELTAELQELEEKL 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1754 EVLLASKARAEEEsrstsekskqrLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISE 1833
Cdd:TIGR02168 270 EELRLEVSELEEE-----------IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEE 338
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1834 ATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQH---KADIEARLAQLRKASeSELERQKGLVEDTLRQRR 1910
Cdd:TIGR02168 339 LAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLetlRSKVAQLELQIASLN-NEIERLEARLERLEDRRE 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1911 QVEEEILALKGSFEKAAagKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRrreaeervQKSLAAEEEAA 1990
Cdd:TIGR02168 418 RLQQEIEELLKKLEEAE--LKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA--------LDAAERELAQL 487
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1991 RQRKAALEEV-ERLKAKVEEARRLRERAEQ------------ESARQLQLAQEAA-QKRLQA------EEKAHAFAVQQK 2050
Cdd:TIGR02168 488 QARLDSLERLqENLEGFSEGVKALLKNQSGlsgilgvlseliSVDEGYEAAIEAAlGGRLQAvvvenlNAAKKAIAFLKQ 567
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2051 EQELQQTLQQEQSVLER-LRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE-------AERLKQSAEEQAQAQAQAQA 2122
Cdd:TIGR02168 568 NELGRVTFLPLDSIKGTeIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYllggvlvVDDLDNALELAKKLRPGYRI 647
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2123 AAEKLRKEAEQEAARRAQAEQAALRQKQaaDAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLK 2202
Cdd:TIGR02168 648 VTLDGDLVRPGGVITGGSAKTNSSILER--RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELS 725
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2203 AEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQ----- 2277
Cdd:TIGR02168 726 RQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE-IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKalrea 804
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2278 ------------EAARLRQLAEEDLAQQRALAEKML----KEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQ 2341
Cdd:TIGR02168 805 ldelraeltllnEEAANLRERLESLERRIAATERRLedleEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAS 884
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2342 MAQQLAQ---ETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDigerLYRTELATQEKvm 2418
Cdd:TIGR02168 885 LEEALALlrsELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSE----EYSLTLEEAEA-- 958
|
810 820
....*....|....*....|....*.
gi 1920237968 2419 LVQTLETQRQQSDRDAERLREAIAEL 2444
Cdd:TIGR02168 959 LENKIEDDEEEARRRLKRLENKIKEL 984
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1327-2043 |
1.66e-22 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 107.60 E-value: 1.66e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1327 SETLRRMEEEERL---AEQQRAE---ERERLaEVEAALEKQRQLAEAHAQAK---AQAEREAQGLQRRMQEEVAR-REEV 1396
Cdd:NF041483 433 AKTVELQEEARRLrgeAEQLRAEavaEGERI-RGEARREAVQQIEEAARTAEellTKAKADADELRSTATAESERvRTEA 511
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1397 AVEAQEQKRSIQEELQHLRqsSEAEiQAKARQVEAAERSRLRIEEEIRVVRLQLE-ATERQRGGAEGELQALRARAEE-- 1473
Cdd:NF041483 512 IERATTLRRQAEETLERTR--AEAE-RLRAEAEEQAEEVRAAAERAARELREETErAIAARQAEAAEELTRLHTEAEErl 588
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1474 --AEAQKRQAQEEAERLRRQVQDETQRKRQAEAE--LALRVQAEAEAAREKQRALQALEELRLQAEEAERRLR-QAEAER 1548
Cdd:NF041483 589 taAEEALADARAEAERIRREAAEETERLRTEAAEriRTLQAQAEQEAERLRTEAAADASAARAEGENVAVRLRsEAAAEA 668
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1549 ARQVQVALETAQRsaeaeLQSEHASFAEKTAQlertlkeehvavvqlreeatrraqqqaeaeraraeaerelerwqlKAN 1628
Cdd:NF041483 669 ERLKSEAQESADR-----VRAEAAAAAERVGT---------------------------------------------EAA 698
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1629 EALrlrlqaeevaqqksltqaeaekqkeeaerearrrGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI--- 1705
Cdd:NF041483 699 EAL----------------------------------AAAQEEAARRRREAEETLGSARAEADQERERAREQSEELLasa 744
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1706 RLRAETEQGEQQRqLLEEELARLQREAAAATQKRRELEAELAKV--RAEMEV--LLASKARAEEESRSTSEKSKQRLEAE 1781
Cdd:NF041483 745 RKRVEEAQAEAQR-LVEEADRRATELVSAAEQTAQQVRDSVAGLqeQAEEEIagLRSAAEHAAERTRTEAQEEADRVRSD 823
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1782 AGRFRELA-EEAARLRALA-EEAKRQRQLAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLA 1859
Cdd:NF041483 824 AYAERERAsEDANRLRREAqEETEAAKALAERTVSEAIAEAERLRSD---ASEYAQRVRTEASDTLASAEQDAARTRADA 900
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1860 EDEAFQRRllEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAElelelgR 1939
Cdd:NF041483 901 REDANRIR--SDAAAQADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEATGEAE------R 972
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1940 IRGTAEDTLRSkeqAEQEAARQRQLAAEEERRrreaeervqkslaAEEEAARQRKAALEEVERL--KAKVEEARRLRERA 2017
Cdd:NF041483 973 LRAEAAETVGS---AQQHAERIRTEAERVKAE-------------AAAEAERLRTEAREEADRTldEARKDANKRRSEAA 1036
|
730 740
....*....|....*....|....*.
gi 1920237968 2018 EQESARQLQLAQEAAQKRLQAEEKAH 2043
Cdd:NF041483 1037 EQADTLITEAAAEADQLTAKAQEEAL 1062
|
|
| CH_CTX_rpt2 |
cd21226 |
second calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling ... |
167-267 |
1.69e-22 |
|
second calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling proteins that play a critical role in regulating cell morphology and actin cytoskeleton reorganization. They play a major role in cytokinesis and contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409075 Cd Length: 103 Bit Score: 94.84 E-value: 1.69e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 167 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPEDV 246
Cdd:cd21226 3 DGLLAWCRQTTEGYDGVNITSFKSSFNDGRAFLALLHAYDPELFKQAAIEQMDAEARLNLAFDFAEKKLGIPKLLEAEDV 82
|
90 100
....*....|....*....|.
gi 1920237968 247 DVPQPDEKSIITYVSSLYDAM 267
Cdd:cd21226 83 MTGNPDERSIVLYTSLFYHAF 103
|
|
| CH_FLN-like_rpt2 |
cd21184 |
second calponin homology (CH) domain found in the filamin family; The filamin family includes ... |
164-262 |
2.08e-22 |
|
second calponin homology (CH) domain found in the filamin family; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. This family also includes Drosophila melanogaster protein jitterbug (Jbug), which is an actin-meshwork organizing protein containing three copies of the CH domain. Other members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409033 Cd Length: 103 Bit Score: 94.61 E-value: 2.08e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCqglRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVY-RQTNLENLDQAFSVAERDLGVTRLLD 242
Cdd:cd21184 1 SGKSLLLEWVNSKIPEY---KVKNFTTDWNDGKALAALVDALKPGLIPDNESLdKENPLENATKAMDIAEEELGIPKIIT 77
|
90 100
....*....|....*....|
gi 1920237968 243 PEDVDVPQPDEKSIITYVSS 262
Cdd:cd21184 78 PEDMVSPNVDELSVMTYLSY 97
|
|
| CH_EHBP |
cd21198 |
calponin homology (CH) domain found in the EH domain-binding protein (EHBP) family; The EHBP ... |
164-264 |
2.98e-22 |
|
calponin homology (CH) domain found in the EH domain-binding protein (EHBP) family; The EHBP family includes EHBP1 and EHBP1-like protein (EHBP1L1). EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP1 is associated with aggressive prostate cancer and insulin-stimulated trafficking and cell migration. EHBP1L1 may also act as Rab effector protein and play a role in vesicle trafficking. It coordinates Rab8 and Bin1 to regulate apical-directed transport in polarized epithelial cells. Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409047 Cd Length: 105 Bit Score: 94.41 E-value: 2.98e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 243
Cdd:cd21198 1 SSGQDLLEWCQEVTKGYRGVKITNLTTSWRNGLAFCAILHHFRPDLIDFSSLSPHDIKENCKLAFDAAAK-LGIPRLLDP 79
|
90 100
....*....|....*....|..
gi 1920237968 244 EDVDVPQ-PDEKSIITYVSSLY 264
Cdd:cd21198 80 ADMVLLSvPDKLSVMTYLHQIR 101
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1981-2596 |
3.92e-22 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 105.79 E-value: 3.92e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1981 KSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQ 2060
Cdd:COG1196 203 EPLERQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELE 282
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2061 EQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQ 2140
Cdd:COG1196 283 LEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2141 AEQAALRQKQAADAEMEKHKQFAEQALRQkaqvEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELF 2220
Cdd:COG1196 363 AEEALLEAEAELAEAEEELEELAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEE 438
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2221 SLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM 2300
Cdd:COG1196 439 EEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAG 518
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2301 LKEKMQAVQEATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRV 2378
Cdd:COG1196 519 LRGLAGAVAVLIGVEAAYEAALEAALAAalQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIG 598
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2379 AEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLL 2458
Cdd:COG1196 599 AAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAE 678
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2459 QLKSEEMQTVRQEQLLQETQALQQsflsekdsLLQRERCIEQEKAKLEQlfqdevakaqalreeqqrqqqqmqqekqqlA 2538
Cdd:COG1196 679 AELEELAERLAEEELELEEALLAE--------EEEERELAEAEEERLEE------------------------------E 720
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 2539 ASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRA 2596
Cdd:COG1196 721 LEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEA 778
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1778-2593 |
4.46e-22 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 105.91 E-value: 4.46e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1778 LEAEAGRFRELAEEAA---RLRALAEEAKRQRQLAEEDAVR------------QRAEAERVLAEKLAAISEATRlKTEAE 1842
Cdd:TIGR02168 150 IEAKPEERRAIFEEAAgisKYKERRKETERKLERTRENLDRledilnelerqlKSLERQAEKAERYKELKAELR-ELELA 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1843 IALKEKEAENERLRRLAEDEAFQRRLLEEQAAQhKADIEARLAQLRKAS---ESELERQKGLVEDTLRQRRQVEEEILAL 1919
Cdd:TIGR02168 229 LLVLRLEELREELEELQEELKEAEEELEELTAE-LQELEEKLEELRLEVselEEEIEELQKELYALANEISRLEQQKQIL 307
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1920 KGSFEKAAAGKAELELELgrirgtaEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 1999
Cdd:TIGR02168 308 RERLANLERQLEELEAQL-------EELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQ 380
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2000 VERLKAKVEEARR--LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ-QKEQELQQTLQQEQSVLERLRSEAEAAR 2076
Cdd:TIGR02168 381 LETLRSKVAQLELqiASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEeAELKELQAELEELEEELEELQEELERLE 460
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2077 RAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQaeqaaLRQKQAADAEM 2156
Cdd:TIGR02168 461 EALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGV-----LSELISVDEGY 535
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2157 EKHKQFAEQALRQKAQVEQELTALRLQleetDHQKsilDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARI 2236
Cdd:TIGR02168 536 EAAIEAALGGRLQAVVVENLNAAKKAI----AFLK---QNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLV 608
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2237 EAENRA-----------LVLRDKDSAQRLLQEEAEKMKQVAEEAARLS---VAAQEAARLRQLAeedLAQQRALAEkmLK 2302
Cdd:TIGR02168 609 KFDPKLrkalsyllggvLVVDDLDNALELAKKLRPGYRIVTLDGDLVRpggVITGGSAKTNSSI---LERRREIEE--LE 683
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2303 EKMQAVQEAtrlkaEAELLQQQKELAQEQarrlqedkeqmaQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMS 2382
Cdd:TIGR02168 684 EKIEELEEK-----IAELEKALAELRKEL------------EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLE 746
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2383 RAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKS 2462
Cdd:TIGR02168 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERL 826
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2463 EEMQTvRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAASME 2542
Cdd:TIGR02168 827 ESLER-RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEEL-ESELEALLNERASLEEALALLRSELEELSEELR 904
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 2543 EARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEE 2593
Cdd:TIGR02168 905 ELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEE 955
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1005-1569 |
5.59e-22 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 105.40 E-value: 5.59e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1005 ARECAQRITEQQKAQAEVDGLGKGVARLSAEAEKvlalpepspaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVI 1084
Cdd:COG1196 252 EAELEELEAELAELEAELEELRLELEELELELEE-------------AQAEEYELLAELARLEQDIARLEERRRELEERL 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1085 RSTQEAEEVLRAHEEQLKEAQAvpATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVE 1164
Cdd:COG1196 319 EELEEELAELEEELEELEEELE--ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAA 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1165 RWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL 1244
Cdd:COG1196 397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLE 476
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1245 EDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1324
Cdd:COG1196 477 AALAE-----------LLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA------ 539
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1325 fisetlrrMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA----HAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1400
Cdd:COG1196 540 --------LEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRAtflpLDKIRARAALAAALARGAIGAAVDLVASDLREA 611
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1480
Cdd:COG1196 612 DARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEE 691
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1481 AQEEAERLRRQvqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA-LETA 1559
Cdd:COG1196 692 ELELEEALLAE---EEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEeLERE 768
|
570
....*....|
gi 1920237968 1560 QRSAEAELQS 1569
Cdd:COG1196 769 LERLEREIEA 778
|
|
| CH_CLMN_rpt2 |
cd21245 |
second calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called ... |
164-268 |
7.13e-22 |
|
second calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Calmin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409094 Cd Length: 106 Bit Score: 93.32 E-value: 7.13e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGcQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 243
Cdd:cd21245 3 KAIKALLNWVQRRTRK-YGVAVQDFGSSWRSGLAFLALIKAIDPSLVDMRQALEKSPRENLEDAFRIAQESLGIPPLLEP 81
|
90 100
....*....|....*....|....*
gi 1920237968 244 EDVDVPQPDEKSIITYVSSLYDAMP 268
Cdd:cd21245 82 EDVMVDSPDEQSIMTYVAQFLEHFP 106
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1474-2608 |
4.73e-21 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 102.60 E-value: 4.73e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1474 AEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE--AEAAREKQRALQALEELRLQAE-EAERRLRQAEAERAR 1550
Cdd:NF041483 81 AQIQADQLRADAERELRDARAQTQRILQEHAEHQARLQAElhTEAVQRRQQLDQELAERRQTVEsHVNENVAWAEQLRAR 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1551 QVQVA---LETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVvqlREEATRRAQQQAEAERARAEAERELERWQLKA 1627
Cdd:NF041483 161 TESQArrlLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESA---RAEAEAILRRARKDAERLLNAASTQAQEATDH 237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1628 NEALRLRLQAE-EVAQQKSltqaeaekqkeeaereaRRRGKAEEQAVRQrelAEQELEKQRQLAEGTAQQrlAAEQELIR 1706
Cdd:NF041483 238 AEQLRSSTAAEsDQARRQA-----------------AELSRAAEQRMQE---AEEALREARAEAEKVVAE--AKEAAAKQ 295
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1707 LRAETEQGEQQRQLLEEELARLQREAAA-ATQKRRELEAELAKVRAEMEVLLAS---KARAEEESRSTSEKSKQRLEAEA 1782
Cdd:NF041483 296 LASAESANEQRTRTAKEEIARLVGEATKeAEALKAEAEQALADARAEAEKLVAEaaeKARTVAAEDTAAQLAKAARTAEE 375
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1783 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAI---------------SEATRLKTEAEIALKE 1847
Cdd:NF041483 376 VLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQLKGAAkddtkeyraktvelqEEARRLRGEAEQLRAE 455
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1848 KEAENERLRRLAEDEAFQR-----RLLEEQAAQHKADIEarlaQLRKASESELERQKG-LVEDTLRQRRQVEEeilALKG 1921
Cdd:NF041483 456 AVAEGERIRGEARREAVQQieeaaRTAEELLTKAKADAD----ELRSTATAESERVRTeAIERATTLRRQAEE---TLER 528
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1922 SFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQ------LAAEEERRRREAEERVQKSLAAEEEAARQRKA 1995
Cdd:NF041483 529 TRAEAERLRAEAEEQAEEVRAAAERAARELREETERAIAARQaeaaeeLTRLHTEAEERLTAAEEALADARAEAERIRRE 608
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1996 ALEEVERLKAKV-EEARRLRERAEQESARqlqLAQEAAQKRLQAEEKAHAFAVqqkeqelqqtlqqeqsvleRLRSEAEa 2074
Cdd:NF041483 609 AAEETERLRTEAaERIRTLQAQAEQEAER---LRTEAAADASAARAEGENVAV-------------------RLRSEAA- 665
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2075 arraaeeaeaareraereaaqsrrqvEEAERLKqsaeeqaqaqAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaada 2154
Cdd:NF041483 666 --------------------------AEAERLK----------SEAQESADRVRAEAAAAAERVGTEAAEALAAAQ---- 705
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2155 emekhkqfaEQALRQKAQVEQELTALRLQLEEtdhqksildeelqrlkaevtEAARQRGQVEEELFSLRVQMEELGKLKA 2234
Cdd:NF041483 706 ---------EEAARRRREAEETLGSARAEADQ--------------------ERERAREQSEELLASARKRVEEAQAEAQ 756
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2235 RI--EAENRA--LVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA-RLRQLAEEDLAQQRALAekmLKEKMQAVQ 2309
Cdd:NF041483 757 RLveEADRRAteLVSAAEQTAQQVRDSVAGLQEQAEEEIAGLRSAAEHAAeRTRTEAQEEADRVRSDA---YAERERASE 833
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2310 EATRLKAEAellQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLemsAEAERlrlrvaEMSRAQARAE 2389
Cdd:NF041483 834 DANRLRREA---QEETEAAKALAERTVSEAIAEAERLRSDASEYAQRVRTEASDTL---ASAEQ------DAARTRADAR 901
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2390 EDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREaiaelehEKDKLKQEAQLLQLKSEEMQTVR 2469
Cdd:NF041483 902 EDANRIRSDAAAQADRLIGEATSEAERLTAEARAEAERLRDEARAEAERV-------RADAAAQAEQLIAEATGEAERLR 974
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2470 QEQLLQETQALQQSflsekdsllqrerciEQEKAKLEQLFQDEVAKAQALReeqqrqqqqmqqekqqlAASMEEARRRQH 2549
Cdd:NF041483 975 AEAAETVGSAQQHA---------------ERIRTEAERVKAEAAAEAERLR-----------------TEAREEADRTLD 1022
|
1130 1140 1150 1160 1170
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 2550 EAeegvrrQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPSR 2608
Cdd:NF041483 1023 EA------RKDANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTM 1075
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1086-1890 |
9.07e-21 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 101.67 E-value: 9.07e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1086 STQEAEEVLRAHEEQLKEAQAvpatlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVER 1165
Cdd:TIGR02168 247 ELKEAEEELEELTAELQELEE------KLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1166 WRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLE 1245
Cdd:TIGR02168 321 LEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNN 400
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1246 DIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQyirf 1325
Cdd:TIGR02168 401 EIER-----------LEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEE---- 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1326 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQE-EVARREEVAVEAQ--- 1401
Cdd:TIGR02168 466 LREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELiSVDEGYEAAIEAAlgg 545
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1402 -------EQKRSIQEELQHLRQSSE------AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALR 1468
Cdd:TIGR02168 546 rlqavvvENLNAAKKAIAFLKQNELgrvtflPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGV 625
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1469 ARAEEAEAQKRQAQEEAERLR-------------------RQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEE 1529
Cdd:TIGR02168 626 LVVDDLDNALELAKKLRPGYRivtldgdlvrpggvitggsAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK 705
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1530 LRLQAEEAERRLRQAEAERARQVqvaleTAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQqqaea 1609
Cdd:TIGR02168 706 ELEELEEELEQLRKELEELSRQI-----SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEE----- 775
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1610 eraraeaerelerwQLKANEALRLRLQAeEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELaEQELEKQRQL 1689
Cdd:TIGR02168 776 --------------ELAEAEAEIEELEA-QIEQLKEELKALREALDELRAELTLLNEEAANLRERLESL-ERRIAATERR 839
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1690 AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAE---EE 1766
Cdd:TIGR02168 840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRrelEE 919
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1767 SRSTSEKSKQRLEAEAGRFRELAEE-AARLRALAEEAKRQRQLAEEDAVRQRAEAERvLAEKLAAISEATRLkteaeiAL 1845
Cdd:TIGR02168 920 LREKLAQLELRLEGLEVRIDNLQERlSEEYSLTLEEAEALENKIEDDEEEARRRLKR-LENKIKELGPVNLA------AI 992
|
810 820 830 840
....*....|....*....|....*....|....*....|....*
gi 1920237968 1846 KEKEAENERLRRLaedeafqrrlleeqaAQHKADIEARLAQLRKA 1890
Cdd:TIGR02168 993 EEYEELKERYDFL---------------TAQKEDLTEAKETLEEA 1022
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1987-2606 |
2.68e-20 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 100.01 E-value: 2.68e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1987 EEAARQRKAALEEVERLKAKVEEARR----LRERAEQ-ESARQLQlaqeAAQKRLQAEEKAHAFAVQQKEqelqqtlqqe 2061
Cdd:COG1196 175 EEAERKLEATEENLERLEDILGELERqlepLERQAEKaERYRELK----EELKELEAELLLLKLRELEAE---------- 240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2062 qsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQsaeeqaqaqaqaqAAAEKLRKEAEQEAARRAQA 2141
Cdd:COG1196 241 ---LEELEAELEELEAELEELEAELAELEAELEELRLELEELELELE-------------EAQAEEYELLAELARLEQDI 304
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2142 EQAALRQKQAADAEMEKHKQfAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2221
Cdd:COG1196 305 ARLEERRRELEERLEELEEE-LAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2222 LRVQMEELgklkARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKML 2301
Cdd:COG1196 384 LAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEE 459
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2302 KEKmQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAER---LRLRV 2378
Cdd:COG1196 460 ALL-ELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIgveAAYEA 538
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2379 AEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLL 2458
Cdd:COG1196 539 ALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVL 618
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2459 Q--LKSEEMQTVRQEQLLQETQALQQSFLS---EKDSLLQRERCIEQEKAKLEQlfqdEVAKAQALREEQQRQQQQMQQE 2533
Cdd:COG1196 619 GdtLLGRTLVAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLA----ALLEAEAELEELAERLAEEELE 694
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 2534 KQQLAASMEEARRRQHEAEEGVRRQQEELQRLAqqqqqqEKLLAEENQRLRERLQHLEEERRAALARSEEIAP 2606
Cdd:COG1196 695 LEEALLAEEEEERELAEAEEERLEEELEEEALE------EQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
|
|
| CH_FLNB_rpt1 |
cd21309 |
first calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B ... |
42-152 |
4.89e-20 |
|
first calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B (FLN-B) is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton. It may promote orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It anchors various transmembrane proteins to the actin cytoskeleton. FLN-B contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409158 Cd Length: 131 Bit Score: 88.98 E-value: 4.89e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 42 RVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHR 118
Cdd:cd21309 16 KIQQNTFTRWCNEHL-----KCVNKRIGNLQTDLSDGLRLIALLEVLSQKRMYRkyhQRPTFRQMQLENVSVALEFLDRE 90
|
90 100 110
....*....|....*....|....*....|....
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 152
Cdd:cd21309 91 SIKLVSIDSKAIVDGNLKLILGLVWTLILHYSIS 124
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1088-1888 |
5.66e-20 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 98.98 E-value: 5.66e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1088 QEAEEVLRAHEEQLKEAQAVpatlpeLEATKAALKKLRAQAEAQQPvFDALRDELRgaqevgerlqqrHGERDVEVERWR 1167
Cdd:TIGR02168 175 KETERKLERTRENLDRLEDI------LNELERQLKSLERQAEKAER-YKELKAELR------------ELELALLVLRLE 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1168 ErvtlLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVRE---QLRQEKALL 1244
Cdd:TIGR02168 236 E----LREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRleqQKQILRERL 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1245 EDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPV-----ASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLT 1319
Cdd:TIGR02168 312 ANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEElesleAELEELEAELEELESRLEELEEQLETLRSKVAQL 391
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1320 SQYIRFISETLRRMEEE-ERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAV 1398
Cdd:TIGR02168 392 ELQIASLNNEIERLEARlERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELE 471
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1399 EAQEQKRSIQEELQHLR------QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAE----GELQALR 1468
Cdd:TIGR02168 472 EAEQALDAAERELAQLQarldslERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEaalgGRLQAVV 551
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1469 ARAEEAEAQKRQAQEEAERLRR----------QVQDETQRKRQAEAELALRVQAEAEAAREK-QRALQALEELRLQAEEA 1537
Cdd:TIGR02168 552 VENLNAAKKAIAFLKQNELGRVtflpldsikgTEIQGNDREILKNIEGFLGVAKDLVKFDPKlRKALSYLLGGVLVVDDL 631
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1538 ERRLRQAEAERARQVQVALE---------TAQRSAEAEL-----QSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRA 1603
Cdd:TIGR02168 632 DNALELAKKLRPGYRIVTLDgdlvrpggvITGGSAKTNSsilerRREIEELEEKIEELEEKIAELEKALAELRKELEELE 711
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1604 QQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQkeeaerearrrgkaeeqavrqRELAEQEL 1683
Cdd:TIGR02168 712 EELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE---------------------IEELEERL 770
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1684 EKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARA 1763
Cdd:TIGR02168 771 EEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEEL 850
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1764 EEEsRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEI 1843
Cdd:TIGR02168 851 SED-IESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929
|
810 820 830 840
....*....|....*....|....*....|....*....|....*.
gi 1920237968 1844 ALKEKEAENERLR-RLAEDEafqrRLLEEQAAQHKADIEARLAQLR 1888
Cdd:TIGR02168 930 RLEGLEVRIDNLQeRLSEEY----SLTLEEAEALENKIEDDEEEAR 971
|
|
| CH_FLNA_rpt1 |
cd21308 |
first calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A ... |
42-152 |
6.77e-20 |
|
first calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A (FLN-A) is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also anchors various transmembrane proteins to the actin cytoskeleton and serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-A contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409157 Cd Length: 129 Bit Score: 88.60 E-value: 6.77e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 42 RVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHR 118
Cdd:cd21308 19 KIQQNTFTRWCNEHL-----KCVSKRIANLQTDLSDGLRLIALLEVLSQKKMHRkhnQRPTFRQMQLENVSVALEFLDRE 93
|
90 100 110
....*....|....*....|....*....|....
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 152
Cdd:cd21308 94 SIKLVSIDSKAIVDGNLKLILGLIWTLILHYSIS 127
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1334-2070 |
1.33e-19 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 97.97 E-value: 1.33e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1334 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEahaQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQEELQh 1413
Cdd:NF041483 560 EETERAIAARQAEAAEELTRLHTEAEERLTAAE---EALADARAEAERIRREAAEETER---LRTEAAERIRTLQAQAE- 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1414 lrqsSEAEiqaKARQVEAAERSRLRIEEEIRVVRLQLEA-TERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQV 1492
Cdd:NF041483 633 ----QEAE---RLRTEAAADASAARAEGENVAVRLRSEAaAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQ 705
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1493 QDETQRKRQAEAELAlrvQAEAEAAREKQRALQALEELrlqAEEAERRLRQAEAERARQVqvalETAQRSAeaelqSEHA 1572
Cdd:NF041483 706 EEAARRRREAEETLG---SARAEADQERERAREQSEEL---LASARKRVEEAQAEAQRLV----EEADRRA-----TELV 770
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1573 SFAEKTAQLERTlkeehvAVVQLREEATRraqqqaeaeraraeaerelerwqlkanEALRLRLQAEEVAQQksLTQAEAE 1652
Cdd:NF041483 771 SAAEQTAQQVRD------SVAGLQEQAEE---------------------------EIAGLRSAAEHAAER--TRTEAQE 815
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1653 KQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQ--------------- 1717
Cdd:NF041483 816 EADRVRSDAYAERERASEDANRLRREAQEETEAAKALAERTVSE---AIAEAERLRSDASEYAQRvrteasdtlasaeqd 892
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1718 ----RQLLEEELARLQREAAA-----ATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrel 1788
Cdd:NF041483 893 aartRADAREDANRIRSDAAAqadrlIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEA------ 966
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1789 AEEAARLRALAEEAKRQrqlAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEialkekeAENERLRRLAEDEAFQRR- 1867
Cdd:NF041483 967 TGEAERLRAEAAETVGS---AQQHAERIRTEAERVKAE---AAAEAERLRTEAR-------EEADRTLDEARKDANKRRs 1033
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1868 ---------LLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELG 1938
Cdd:NF041483 1034 eaaeqadtlITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVGAARKEAERIVAEATVEGNSLVEKARTDADELLVGAR 1113
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1939 R----IRGTAEDtLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLK--------AK 2006
Cdd:NF041483 1114 RdataIRERAEE-LRDRITGEIEELHERARRESAEQMKSAGERCDALVKAAEEQLAEAEAKAKELVSDANseaskvriAA 1192
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2007 VEEARRLRERAEQESARQLQLAQ------EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRS 2070
Cdd:NF041483 1193 VKKAEGLLKEAEQKKAELVREAEkikaeaEAEAKRTVEEGKRELDVLVRRREDINAEISRVQDVLEALES 1262
|
|
| CH_MICAL2_3-like |
cd21195 |
calponin homology (CH) domain found in molecule interacting with CasL protein 2 (MICAL-2), ... |
168-265 |
1.75e-19 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 2 (MICAL-2), MICAL-3, and similar proteins; Molecule interacting with CasL protein (MICAL) is a large, multidomain, cytosolic protein with a single LIM domain, a calponin homology (CH) domain and a flavoprotein monooxygenase (MO) domain. In Drosophila, MICAL is expressed in axons, interacts with the neuronal A (PlexA) receptor and is required for Semaphorin 1a (Sema-1a)-PlexA-mediated repulsive axon guidance. The LIM and CH domains mediate interactions with the cytoskeleton, cytoskeletal adaptor proteins, and other signaling proteins. The flavoprotein MO is required for semaphorin-plexin repulsive axon guidance during axonal pathfinding in the Drosophila neuromuscular system. In addition, MICAL functions to interact with Rab13 and Rab8 to coordinate the assembly of tight junctions and adherens junctions in epithelial cells. Thus, MICAL is also called junctional Rab13-binding protein (JRAB). Members of this family, which includes MICAL-2, MICAL-3, and similar proteins, contain one CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409044 [Multi-domain] Cd Length: 110 Bit Score: 86.63 E-value: 1.75e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 168 KLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD-PEDV 246
Cdd:cd21195 8 KLLTWCQQQTEGYQHVNVTDLTTSWRSGLALCAIIHRFRPELINFDSLNEDDAVENNQLAFDVAEREFGIPPVTTgKEMA 87
|
90
....*....|....*....
gi 1920237968 247 DVPQPDEKSIITYVSSLYD 265
Cdd:cd21195 88 SAQEPDKLSMVMYLSKFYE 106
|
|
| CH_EHBP1L1 |
cd21255 |
calponin homology (CH) domain found in EH domain-binding protein 1-like protein 1 and similar ... |
164-263 |
1.51e-18 |
|
calponin homology (CH) domain found in EH domain-binding protein 1-like protein 1 and similar proteins; EHBP1L1 may act as Rab effector protein and play a role in vesicle trafficking. It coordinates Rab8 and Bin1 to regulate apical-directed transport in polarized epithelial cells. Members of this subfamily contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409104 Cd Length: 105 Bit Score: 83.68 E-value: 1.51e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 243
Cdd:cd21255 1 SSSQSLLEWCQEVTAGYRGVRVTNFTTSWRNGLAFCAILHHFHPDLVDYESLDPLDIKENNKKAFEAFAS-LGVPRLLEP 79
|
90 100
....*....|....*....|.
gi 1920237968 244 ED-VDVPQPDEKSIITYVSSL 263
Cdd:cd21255 80 ADmVLLPIPDKLIVMTYLCQL 100
|
|
| SH3_10 |
pfam17902 |
SH3 domain; This entry represents an SH3 domain. |
782-848 |
3.39e-18 |
|
SH3 domain; This entry represents an SH3 domain.
Pssm-ID: 407754 Cd Length: 65 Bit Score: 81.15 E-value: 3.39e-18
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 782 QLKPRSpaHPMRGRVPLLAVCDYKQVEVTVHKGDECQMVGPAQPFYWKVLGSSCSEAAMPSVCFLVP 848
Cdd:pfam17902 1 PLKQRR--SPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
|
|
| CH_MICAL3 |
cd21251 |
calponin homology (CH) domain found in molecule interacting with CasL protein 3; MICAL-3 is a ... |
160-265 |
3.91e-18 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 3; MICAL-3 is a [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-3 seems to act as a Rab effector protein and plays a role in vesicle trafficking. It is involved in exocytic vesicle tethering and fusion. MICAL3 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409100 [Multi-domain] Cd Length: 111 Bit Score: 82.69 E-value: 3.91e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 160 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 239
Cdd:cd21251 1 NESVARSSKLLGWCQRQTEGYAGVNVTDLTMSWKSGLALCAIIHRYRPDLIDFDSLDEQDVEKNNQLAFDIAEKEFGISP 80
|
90 100
....*....|....*....|....*..
gi 1920237968 240 LLDPEDV-DVPQPDEKSIITYVSSLYD 265
Cdd:cd21251 81 IMTGKEMaSVGEPDKLSMVMYLTQFYE 107
|
|
| CH_SMTN-like |
cd21200 |
calponin homology (CH) domain found in the smoothelin family; The smoothelin family includes ... |
164-264 |
4.07e-18 |
|
calponin homology (CH) domain found in the smoothelin family; The smoothelin family includes smoothelin and smoothelin-like proteins. Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. SMTNL1, also called calponin homology-associated smooth muscle protein (CHASM), plays a role in the regulation of contractile properties of both striated and smooth muscles. It can bind to calmodulin and tropomyosin. When it is unphosphorylated, SMTNL1 may inhibit myosin dephosphorylation. SMTNL2 is highly expressed in skeletal muscle and could be associated with differentiating myocytes. Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409049 Cd Length: 107 Bit Score: 82.78 E-value: 4.07e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 243
Cdd:cd21200 1 SIKQMLLEWCQAKTRGYEHVDITNFSSSWSDGMAFCALIHHFFPDAFDYSSLDPKNRRKNFELAFSTAEELADIAPLLEV 80
|
90 100
....*....|....*....|...
gi 1920237968 244 EDVDV--PQPDEKSIITYVSSLY 264
Cdd:cd21200 81 EDMVRmgNRPDWKCVFTYVQSLY 103
|
|
| CH_NAV2-like |
cd21212 |
calponin homology (CH) domain found in neuron navigator (NAV) 2, NAV3, and similar proteins; ... |
44-149 |
4.09e-18 |
|
calponin homology (CH) domain found in neuron navigator (NAV) 2, NAV3, and similar proteins; This family includes neuron navigator 2 (NAV2) and NAV3, both of which contain a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs. NAV2, also called helicase APC down-regulated 1 (HELAD1), pore membrane and/or filament-interacting-like protein 2 (POMFIL2), retinoic acid inducible in neuroblastoma 1 (RAINB1), Steerin-2 (STEERIN2), or Unc-53 homolog 2 (unc53H2), possesses 3' to 5' helicase activity and exonuclease activity. It is involved in neuronal development, specifically in the development of different sensory organs. NAV3, also called pore membrane and/or filament-interacting-like protein 1 (POMFIL1), Steerin-3 (STEERIN3), or Unc-53 homolog 3 (unc53H3), may regulate IL2 production by T-cells. It may be involved in neuron regeneration.
Pssm-ID: 409061 Cd Length: 105 Bit Score: 82.63 E-value: 4.09e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 44 QKKTFTKWVNKHLIKHwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGR--MRFHKLQNVQIALDYLRHRQVK 121
Cdd:cd21212 1 EIEIYTDWANHYLEKG---GHKRIITDLQKDLGDGLTLVNLIEAVAGEKVPGIHSRpkTRAQKLENIQACLQFLAALGVD 77
|
90 100
....*....|....*....|....*...
gi 1920237968 122 LVNIRNDDIADGNPKLTLGLIWTIILHF 149
Cdd:cd21212 78 VQGITAEDIVDGNLKAILGLFFSLSRYK 105
|
|
| CH |
smart00033 |
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ... |
167-263 |
4.61e-18 |
|
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
Pssm-ID: 214479 [Multi-domain] Cd Length: 101 Bit Score: 82.36 E-value: 4.61e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 167 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN----LENLDQAFSVAERDLGVTRLLD 242
Cdd:smart00033 1 KTLLRWVNSLLAEYDKPPVTNFSSDLKDGVALCALLNSLSPGLVDKKKVAASLSrfkkIENINLALSFAEKLGGKVVLFE 80
|
90 100
....*....|....*....|.
gi 1920237968 243 PEDVDVPQPDEKSIITYVSSL 263
Cdd:smart00033 81 PEDLVEGPKLILGVIWTLISL 101
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1328-1916 |
4.92e-18 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 92.67 E-value: 4.92e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQR------AEERERLAEVEAALEKQRQLAEAHAQAKAQAERE-AQGLQRRMQEEVARREEVAVEA 1400
Cdd:COG4913 235 DDLERAHEALEDAREQIellepiRELAERYAAARERLAELEYLRAALRLWFAQRRLElLEAELEELRAELARLEAELERL 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRSIQEELQHLRQ-----------SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATE-----------RQRG 1458
Cdd:COG4913 315 EARLDALREELDELEAqirgnggdrleQLEREIERLERELEERERRRARLEALLAALGLPLPASAeefaalraeaaALLE 394
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1459 GAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELrLQAEEAE 1538
Cdd:COG4913 395 ALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALAEALGLDEAELPFVGEL-IEVRPEE 473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1539 RRLRQAeAERarqvqvALETAQRSaeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAER 1618
Cdd:COG4913 474 ERWRGA-IER------VLGGFALT----LLVPPEHYAAALRWVNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDF 542
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1619 ELERWQLKANEALRLR---LQAEEVAQ----QKSLTQAEAEKQKEEAEREARRRGKAEE-----QAVRQRELAEQELEK- 1685
Cdd:COG4913 543 KPHPFRAWLEAELGRRfdyVCVDSPEElrrhPRAITRAGQVKGNGTRHEKDDRRRIRSRyvlgfDNRAKLAALEAELAEl 622
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1686 QRQLAEGTAQ-QRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA---ELAKVRAEMEVLLASKA 1761
Cdd:COG4913 623 EEELAEAEERlEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERLDAssdDLAALEEQLEELEAELE 702
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1762 RAEEESRSTSEKsKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRlKTEA 1841
Cdd:COG4913 703 ELEEELDELKGE-IGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERID-ALRA 780
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1842 EIALKEKEAENERLRRLAEDEAFQ-------------RRLLEEQAAQHKADIEARLAQLRKasESELERQKGLVEDTLRQ 1908
Cdd:COG4913 781 RLNRAEEELERAMRAFNREWPAETadldadleslpeyLALLDRLEEDGLPEYEERFKELLN--ENSIEFVADLLSKLRRA 858
|
....*...
gi 1920237968 1909 RRQVEEEI 1916
Cdd:COG4913 859 IREIKERI 866
|
|
| CH_SF |
cd00014 |
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding ... |
45-147 |
1.47e-17 |
|
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding motifs, which may be present as a single copy or in tandem repeats (which increase binding affinity). They either function as autonomous actin binding motifs or serve a regulatory function. CH domains are found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, as well as proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
Pssm-ID: 409031 [Multi-domain] Cd Length: 103 Bit Score: 80.85 E-value: 1.47e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 45 KKTFTKWVNKHLIKhwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRE--KGRMRFHKLQNVQIALDYLRHRQV-K 121
Cdd:cd00014 1 EEELLKWINEVLGE----ELPVSITDLFESLRDGVLLCKLINKLSPGSIPKInkKPKSPFKKRENINLFLNACKKLGLpE 76
|
90 100
....*....|....*....|....*..
gi 1920237968 122 LVNIRNDDI-ADGNPKLTLGLIWTIIL 147
Cdd:cd00014 77 LDLFEPEDLyEKGNLKKVLGTLWALAL 103
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1326-1897 |
1.71e-17 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 90.48 E-value: 1.71e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1326 ISETLRRMEEEERLAEQQR----------AEERERLAEVEAALEKQRQlaeahaqAKAQAEREAQGLQRRMQEEVARREE 1395
Cdd:PRK02224 218 LDEEIERYEEQREQARETRdeadevleehEERREELETLEAEIEDLRE-------TIAETEREREELAEEVRDLRERLEE 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1396 vaveaqeqkrsIQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAE 1472
Cdd:PRK02224 291 -----------LEEERDDLLAEAGlddADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAE 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1473 EAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQRalQALEELRLQAEEAERRLRQAEAErarqv 1552
Cdd:PRK02224 360 ELREEAAELESELEEAREAV--EDRREEIEELEEEIEELRERFGDAPVDL--GNAEDFLEELREERDELREREAE----- 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1553 qvaLETAQRSAEAELQSEHASFAE-KTAQLERTLKE-EHVAVVQLREEatrraqqqaeaeraraeaerelerwQLKANEA 1630
Cdd:PRK02224 431 ---LEATLRTARERVEEAEALLEAgKCPECGQPVEGsPHVETIEEDRE-------------------------RVEELEA 482
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1631 LRLRLQAEEVAQQKSLTQAEAEKqkeeaerearrrgKAEEQAVR---QRELAEQELEKQRQLAEGTAQQRLAAEQELIRL 1707
Cdd:PRK02224 483 ELEDLEEEVEEVEERLERAEDLV-------------EAEDRIERleeRREDLEELIAERRETIEEKRERAEELRERAAEL 549
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1708 RAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAeMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRE 1787
Cdd:PRK02224 550 EAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLER-IRTLLAAIADAEDEIERLREKREALAELNDERRER 628
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1788 LAEEAARLRALAEEAKRQRqLAEEDAVRQRAEA--ERVlAEKLAAISEAtRLKTEAEIALKEKEAEN-ERLR-RLAEDEA 1863
Cdd:PRK02224 629 LAEKRERKRELEAEFDEAR-IEEAREDKERAEEylEQV-EEKLDELREE-RDDLQAEIGAVENELEElEELReRREALEN 705
|
570 580 590
....*....|....*....|....*....|....*.
gi 1920237968 1864 FQRRL--LEEQAAQHKADIEARLAQLRKASESELER 1897
Cdd:PRK02224 706 RVEALeaLYDEAEELESMYGDLRAELRQRNVETLER 741
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
955-1864 |
3.07e-17 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 90.13 E-value: 3.07e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 955 QQLLQSLEQGEQEESRCQRCISELKDiRLQLEACETRTVHRLRLPLDKEPARECAQRITEQQKAQAEVDGLGKGVARLSA 1034
Cdd:TIGR02169 173 EKALEELEEVEENIERLDLIIDEKRQ-QLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEE 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1035 EAEKVLALpepspaaptlRSELELTLGKLEQVRSLSAIYLEKL---------KTISLVIRSTQEAEEVLRAHEEQLKEAQ 1105
Cdd:TIGR02169 252 ELEKLTEE----------ISELEKRLEEIEQLLEELNKKIKDLgeeeqlrvkEKIGELEAEIASLERSIAEKERELEDAE 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1106 AVPATL-PELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERvtlllerwqavlaqT 1184
Cdd:TIGR02169 322 ERLAKLeAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE--------------L 387
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1185 DVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAvplansqavreqlrqekalleDIERHGEKVEECQRFAKQY 1264
Cdd:TIGR02169 388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNA---------------------AIAGIEAKINELEEEKEDK 446
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1265 INAIKDYELQLVTYKAQLepvaspakkpkvqsgsESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQR 1344
Cdd:TIGR02169 447 ALEIKKQEWKLEQLAADL----------------SKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGR 510
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1345 AEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGlqrRMQEEVARREEVAVEAQEQKRS-------------IQEEL 1411
Cdd:TIGR02169 511 AVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGN---RLNNVVVEDDAVAKEAIELLKRrkagratflplnkMRDER 587
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1412 QHLRQSSEA--------------EIQAKARQ-------VEAAERSRlRIEEEIRVVRLQLEATERQ---RGGA-EGELQA 1466
Cdd:TIGR02169 588 RDLSILSEDgvigfavdlvefdpKYEPAFKYvfgdtlvVEDIEAAR-RLMGKYRMVTLEGELFEKSgamTGGSrAPRGGI 666
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1467 LRARAEEAEAQkrQAQEEAERLRRQVQDETQRKRQAEAELalrvqaeAEAAREKQRALQALEELRLQAEEAERRlRQAEA 1546
Cdd:TIGR02169 667 LFSRSEPAELQ--RLRERLEGLKRELSSLQSELRRIENRL-------DELSQELSDASRKIGEIEKEIEQLEQE-EEKLK 736
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1547 ERARQVQVALETAQRSAEAElQSEHASFAEKTAQLERTLKEEHVAVVQLREEatrraqqqaeaeraraeaeRELERWQLK 1626
Cdd:TIGR02169 737 ERLEELEEDLSSLEQEIENV-KSELKELEARIEELEEDLHKLEEALNDLEAR-------------------LSHSRIPEI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1627 ANEalrLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRlAAEQELIR 1706
Cdd:TIGR02169 797 QAE---LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE-ELEEELEE 872
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1707 LRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRStsekskqrLEAEAGRFR 1786
Cdd:TIGR02169 873 LEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSE--------IEDPKGEDE 944
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1787 ELAEEAARLRALAEEAKR-QRQLAEEDAVRQRA--EAERVLAEKLAAISEATRLKTEAEiALKEKEAENERLRRLAEDEA 1863
Cdd:TIGR02169 945 EIPEEELSLEDVQAELQRvEEEIRALEPVNMLAiqEYEEVLKRLDELKEKRAKLEEERK-AILERIEEYEKKKREVFMEA 1023
|
.
gi 1920237968 1864 F 1864
Cdd:TIGR02169 1024 F 1024
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1333-2204 |
3.15e-17 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 90.03 E-value: 3.15e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1333 MEEEERLAEQQRAEERERLAEVEaaLEKQRQLAEAHAQAKAQAEREAQGLqrRMQEEVARREEVAVEAQEQKRSIQEELQ 1412
Cdd:pfam02463 179 IEETENLAELIIDLEELKLQELK--LKEQAKKALEYYQLKEKLELEEEYL--LYLDYLKLNEERIDLLQELLRDEQEEIE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1413 HLRQSSEAEIQakarqvEAAERSRLRIEEE--IRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRR 1490
Cdd:pfam02463 255 SSKQEIEKEEE------KLAQVLKENKEEEkeKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEK 328
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1491 QVQDETQRKRQAEAELAL--RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERA--RQVQVALETAQRSAEAE 1566
Cdd:pfam02463 329 ELKKEKEEIEELEKELKEleIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAakLKEEELELKSEEEKEAQ 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1567 LQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSL 1646
Cdd:pfam02463 409 LLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLEL 488
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1647 TQAEAEKQKEEAEREARRRGKAEEQAVRQRE---LAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEE 1723
Cdd:pfam02463 489 LLSRQKLEERSQKESKARSGLKVLLALIKDGvggRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVR 568
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1724 ELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEaeagrfRELAEEAARLRALAEEAK 1803
Cdd:pfam02463 569 ALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKV------VEGILKDTELTKLKESAK 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1804 RQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEAR 1883
Cdd:pfam02463 643 AKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEEL 722
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1884 LAQLRKaseSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQ 1963
Cdd:pfam02463 723 LADRVQ---EAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQ 799
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1964 laaeeerrrreaeervQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAH 2043
Cdd:pfam02463 800 ----------------EEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 863
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2044 AFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAA 2123
Cdd:pfam02463 864 TKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE 943
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2124 AEKLRKEAEQEAARRAQAEQAALRQKQAadAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA 2203
Cdd:pfam02463 944 EADEKEKEENNKEEEEERNKRLLLAKEE--LGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKE 1021
|
.
gi 1920237968 2204 E 2204
Cdd:pfam02463 1022 F 1022
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1386-2291 |
3.15e-17 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 90.03 E-value: 3.15e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1386 MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERsrlriEEEIRVVRLQLEATERQRGGAEGELQ 1465
Cdd:pfam02463 158 IEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQA-----KKALEYYQLKEKLELEEEYLLYLDYL 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1466 ALRARAEEAEAQKRQAQEEAERLRRQVQDetqrKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE 1545
Cdd:pfam02463 233 KLNEERIDLLQELLRDEQEEIESSKQEIE----KEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1546 AERARQVQVALETAQRSAEAELQSEHASFAEKtaqleRTLKEEHVAVVQLREEatrraqqqaeaeraraeaerelerwql 1625
Cdd:pfam02463 309 KVDDEEKLKESEKEKKKAEKELKKEKEEIEEL-----EKELKELEIKREAEEE--------------------------- 356
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1626 kANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI 1705
Cdd:pfam02463 357 -EEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEE 435
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1706 RLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEA----- 1780
Cdd:pfam02463 436 EESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKvllal 515
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1781 -EAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLA 1859
Cdd:pfam02463 516 iKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIA 595
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1860 EDEAFQRRLLeeqAAQHKADIEARLAQ---------LRKASESELERQKGLVEDTLRQRRQVEEEILALKG---SFEKAA 1927
Cdd:pfam02463 596 VLEIDPILNL---AQLDKATLEADEDDkrakvvegiLKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEvkaSLSELT 672
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1928 AGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKV 2007
Cdd:pfam02463 673 KELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEE 752
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2008 EEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAhafAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARE 2087
Cdd:pfam02463 753 EKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLK---VEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK 829
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2088 RAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaadAEMEKHKQFAEQAL 2167
Cdd:pfam02463 830 IKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKE---EKEKEEKKELEEES 906
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2168 RQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEvteaarQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRD 2247
Cdd:pfam02463 907 QKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE------EADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAI 980
|
890 900 910 920
....*....|....*....|....*....|....*....|....
gi 1920237968 2248 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLA 2291
Cdd:pfam02463 981 EEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLE 1024
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1304-1957 |
6.40e-17 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 88.97 E-value: 6.40e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1304 EYVDLRTR-----YSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKqrqLAEAHAQAKAQAERE 1378
Cdd:TIGR02169 212 RYQALLKEkreyeGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQL---LEELNKKIKDLGEEE 288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1379 AQGLQRRMQEEVARREEvAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAaersrlrIEEEIRVVRLQLEATERQRG 1458
Cdd:TIGR02169 289 QLRVKEKIGELEAEIAS-LERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE-------LEREIEEERKRRDKLTEEYA 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1459 GAEGELQALRARAEEAEAQKR--------------QAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRAL 1524
Cdd:TIGR02169 361 ELKEELEDLRAELEEVDKEFAetrdelkdyrekleKLKREINELKRELDRLQEELQRLSEELA-DLNAAIAGIEAKINEL 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1525 QA-LEELRLQAEEAERRLRQAEAER--ARQVQVALETAQRSAEAELQS--EHASFAEKTAQLERTLKEEHVAVVQLREEa 1599
Cdd:TIGR02169 440 EEeKEDKALEIKKQEWKLEQLAADLskYEQELYDLKEEYDRVEKELSKlqRELAEAEAQARASEERVRGGRAVEEVLKA- 518
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1600 trraqQQAEAERARAEAERELERWQLKANEALRLRLQA-----EEVAQQK---------------SLTQAEAEKQKEEAE 1659
Cdd:TIGR02169 519 -----SIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNvvvedDAVAKEAiellkrrkagratflPLNKMRDERRDLSIL 593
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1660 REARRRGKA---------EEQAVR---QRELAEQELEKQRQL--------------------------AEGTAQQRLAAE 1701
Cdd:TIGR02169 594 SEDGVIGFAvdlvefdpkYEPAFKyvfGDTLVVEDIEAARRLmgkyrmvtlegelfeksgamtggsraPRGGILFSRSEP 673
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1702 QELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLL----ASKARAEEESRSTSEKSKQR 1777
Cdd:TIGR02169 674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEqeeeKLKERLEELEEDLSSLEQEI 753
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1778 LEAEAgrfrELAEEAARLRALaEEAKRQRQLAEED--------AVRQRAEAERVLAEKLAAISEATRlktEAEIALKEKE 1849
Cdd:TIGR02169 754 ENVKS----ELKELEARIEEL-EEDLHKLEEALNDlearlshsRIPEIQAELSKLEEEVSRIEARLR---EIEQKLNRLT 825
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1850 AENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKAsESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG 1929
Cdd:TIGR02169 826 LEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL-EEELEELEAALRDLESRLGDLKKERDELEAQLRELERK 904
|
730 740
....*....|....*....|....*...
gi 1920237968 1930 KAELELELGRIRGTAEDTLRSKEQAEQE 1957
Cdd:TIGR02169 905 IEELEAQIEKKRKRLSELKAKLEALEEE 932
|
|
| CH_EHBP1 |
cd21254 |
calponin homology (CH) domain found in EH domain-binding protein 1 and similar proteins; EHBP1 ... |
164-263 |
1.21e-16 |
|
calponin homology (CH) domain found in EH domain-binding protein 1 and similar proteins; EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP1 is associated with aggressive prostate cancer and insulin-stimulated trafficking and cell migration. Members of this subfamily contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409103 Cd Length: 107 Bit Score: 78.36 E-value: 1.21e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 243
Cdd:cd21254 1 NASQSLLAWCKEVTKGYRGVKITNFTTSWRNGLAFCAILHHFRPDLIDYKSLNPHDIKENNKKAYDGFAS-LGISRLLEP 79
|
90 100
....*....|....*....|.
gi 1920237968 244 ED-VDVPQPDEKSIITYVSSL 263
Cdd:cd21254 80 SDmVLLAVPDKLTVMTYLYQI 100
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1113-2039 |
2.97e-16 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 86.95 E-value: 2.97e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1113 ELEATKAALKKLRAQAEAQQPVFDALRDELRGaqevgerlqqrhgerdveverwrervtlllerwqavlaqtdvRQRELE 1192
Cdd:pfam02463 167 LKRKKKEALKKLIEETENLAELIIDLEELKLQ------------------------------------------ELKLKE 204
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1193 QLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL-EDIERHGEKVEECQRFAKQYINAIKDY 1271
Cdd:pfam02463 205 QAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSkQEIEKEEEKLAQVLKENKEEEKEKKLQ 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1272 ELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRtryselstltsqyIRFISETLRRMEEEERLAEQQRAEERERL 1351
Cdd:pfam02463 285 EEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKE-------------KKKAEKELKKEKEEIEELEKELKELEIKR 351
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1352 AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQsseaeiQAKARQVEA 1431
Cdd:pfam02463 352 EAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQ------LEDLLKEEK 425
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1432 AERSRLRIEEEIRVVRLQLEATERQrggaegelqaLRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ 1511
Cdd:pfam02463 426 KEELEILEEEEESIELKQGKLTEEK----------EELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKL 495
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1512 AEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ-LERTLKEEHV 1590
Cdd:pfam02463 496 EERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQkLVRALTELPL 575
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1591 AVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGK--A 1668
Cdd:pfam02463 576 GARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVslE 655
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1669 EEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK 1748
Cdd:pfam02463 656 EGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKIN 735
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1749 VRAEMEVLLASKARAEEESRSTSEKSKQRLEAEagrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKL 1828
Cdd:pfam02463 736 EELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSE----LSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELK 811
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1829 AAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDT-LR 1907
Cdd:pfam02463 812 EEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELeSK 891
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1908 QRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAArQRQLAAEEERRRREAEERVQKSLAAEE 1987
Cdd:pfam02463 892 EEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE-EADEKEKEENNKEEEEERNKRLLLAKE 970
|
890 900 910 920 930
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1988 EAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAE 2039
Cdd:pfam02463 971 ELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEF 1022
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1419-2345 |
3.48e-16 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 86.66 E-value: 3.48e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1419 EAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEgELQALRARAEEAEA-----QKRQAQEEAERLRRQVQ 1493
Cdd:TIGR02169 169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAE-RYQALLKEKREYEGyellkEKEALERQKEAIERQLA 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1494 DETQRKRQAEAELALRVQAEAEAAR------EKQRALQALEELRLQAEEAE-----RRLRQAEAERARQVQVALETaqrs 1562
Cdd:TIGR02169 248 SLEEELEKLTEEISELEKRLEEIEQlleelnKKIKDLGEEEQLRVKEKIGEleaeiASLERSIAEKERELEDAEER---- 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1563 aEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQaeaeraraeaerelerwqlkanEALRLRLQAEEVAQ 1642
Cdd:TIGR02169 324 -LAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL----------------------EDLRAELEEVDKEF 380
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1643 QKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQR-ELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1721
Cdd:TIGR02169 381 AETRDELKDYREKLEKLKREINELKRELDRLQEElQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQL 460
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1722 EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE---SRSTSEKSKQRLEAEAGRFRELAEeaarlral 1798
Cdd:TIGR02169 461 AADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERvrgGRAVEEVLKASIQGVHGTVAQLGS-------- 532
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1799 aeeAKRQRQLAEEDAVRQRAEAERVLAEKLAAiseatrlktEAEIALKEKEAENER---LRRLAEDEAFQRRLLEEQAAQ 1875
Cdd:TIGR02169 533 ---VGERYATAIEVAAGNRLNNVVVEDDAVAK---------EAIELLKRRKAGRATflpLNKMRDERRDLSILSEDGVIG 600
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1876 HKADIEARLAQLRKASESELeRQKGLVEDTLRQRRQVEE-EILALKGS-FEKAAA--GKAElelelgRIRGTAEDTLRSK 1951
Cdd:TIGR02169 601 FAVDLVEFDPKYEPAFKYVF-GDTLVVEDIEAARRLMGKyRMVTLEGElFEKSGAmtGGSR------APRGGILFSRSEP 673
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1952 EQAEQEAARqrqlaaeeerrrreaeervqkslaaEEEAARQRKAALEEVERLKAKVEEARRLRERAEQ-----ESARQLQ 2026
Cdd:TIGR02169 674 AELQRLRER-------------------------LEGLKRELSSLQSELRRIENRLDELSQELSDASRkigeiEKEIEQL 728
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2027 LAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRseaeaarraAEEAEAARERAEREAAQSRRQVEEAERL 2106
Cdd:TIGR02169 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELE---------EDLHKLEEALNDLEARLSHSRIPEIQAE 799
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2107 KQSAEeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEE 2186
Cdd:TIGR02169 800 LSKLE--------------EEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE 865
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2187 tdhqksiLDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRlLQEEAEKMKQVA 2266
Cdd:TIGR02169 866 -------LEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAK-LEALEEELSEIE 937
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 2267 EEAARLSVAAQEAARLRQLAEEDLAQQRALaEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQ 2345
Cdd:TIGR02169 938 DPKGEDEEIPEEELSLEDVQAELQRVEEEI-RALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1131-2041 |
6.60e-16 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 85.61 E-value: 6.60e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1131 QQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQA----------VLAQTDVRQRELEQLGRQLRY 1200
Cdd:pfam01576 3 QEEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQAetelcaeaeeMRARLAARKQELEEILHELES 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1201 YRESADPLGAWLRDAKQR-QEQIQAVP--LANSQAVREQLRQEKALLE-DIERHGEKVEECQRFAKQYINAIKDYELQLV 1276
Cdd:pfam01576 83 RLEEEEERSQQLQNEKKKmQQHIQDLEeqLDEEEAARQKLQLEKVTTEaKIKKLEEDILLLEDQNSKLSKERKLLEERIS 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1277 TYKAQLEPVASPAKK-PKVQSGSESIIQEyVDLRTRYSElstltsqyirfisETLRRMEEEERLAEQQRAEERERLAEVe 1355
Cdd:pfam01576 163 EFTSNLAEEEEKAKSlSKLKNKHEAMISD-LEERLKKEE-------------KGRQELEKAKRKLEGESTDLQEQIAEL- 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1356 aalekQRQLAEAHAQAkAQAEREAQGLQRRMQEEVARREevavEAQEQKRSIQEELQHLRQSSEAEIQAKARqveaAERS 1435
Cdd:pfam01576 228 -----QAQIAELRAQL-AKKEEELQAALARLEEETAQKN----NALKKIRELEAQISELQEDLESERAARNK----AEKQ 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1436 RLRIEEEIRVVRLQLEATErqrgGAEGELQALRARAE-EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELalrvqaeA 1514
Cdd:pfam01576 294 RRDLGEELEALKTELEDTL----DTTAAQQELRSKREqEVTELKKALEEETRSHEAQLQEMRQKHTQALEEL-------T 362
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1515 EAAREKQRALQALEELRlQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEktAQLERTLKEEHVAVVQ 1594
Cdd:pfam01576 363 EQLEQAKRNKANLEKAK-QALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSE--SERQRAELAEKLSKLQ 439
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1595 LREEATRRAQQQAEAERARAEAERELERWQLKANEALRlrlqAEEVAQQKSLTQAEAEKqkeeaerearrrgkaEEQAVR 1674
Cdd:pfam01576 440 SELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELL----QEETRQKLNLSTRLRQL---------------EDERNS 500
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1675 QRELAEQELEKQRQLaegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK------ 1748
Cdd:pfam01576 501 LQEQLEEEEEAKRNV----ERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKlektkn 576
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1749 -VRAEMEVLLAS-------------------KARAEEESRSTS-EKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQ 1807
Cdd:pfam01576 577 rLQQELDDLLVDldhqrqlvsnlekkqkkfdQMLAEEKAISARyAEERDRAEAEA---REKETRALSLARALEEALEAKE 653
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1808 LAEEDAVRQRAEAERVLAEKLAA---ISEATRLKTEAEIALKEKEAENERLR-RLAEDEAFQRRL---LEEQAAQHKADI 1880
Cdd:pfam01576 654 ELERTNKQLRAEMEDLVSSKDDVgknVHELERSKRALEQQVEEMKTQLEELEdELQATEDAKLRLevnMQALKAQFERDL 733
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1881 EARlaqlrkaSESELERQKGLVedtlRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAAR 1960
Cdd:pfam01576 734 QAR-------DEQGEEKRRQLV----KQVRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKK 802
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1961 QRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR-QLQLAQEAAQKRLQAE 2039
Cdd:pfam01576 803 LQAQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQLQEDLAASERARRQAQQERDElADEIASGASGKSALQD 882
|
..
gi 1920237968 2040 EK 2041
Cdd:pfam01576 883 EK 884
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
910-1598 |
7.87e-16 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 85.49 E-value: 7.87e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 910 EQRQALRSLELHYQA----FLRDSQDAGgfgPEDRLQAEREYGSCSRHYQQLLQSLEQGEQE----ESRCQRCISELKDI 981
Cdd:TIGR02168 217 ELKAELRELELALLVlrleELREELEEL---QEELKEAEEELEELTAELQELEEKLEELRLEvselEEEIEELQKELYAL 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 982 RLQLEACETRTVH---RLRlPLDKEPARECAQRITEQQKAQAEVDGLGKGVARLSAEAEKVLALPEPSPAAPTLRSELEL 1058
Cdd:TIGR02168 294 ANEISRLEQQKQIlreRLA-NLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELES 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1059 TLGKL-EQVRSLSAIYLEKLKTISLvIRSTQEaeeVLRAHEEQLKEAQAVPATlpelEATKAALKKLRAQAEAQQPVFDA 1137
Cdd:TIGR02168 373 RLEELeEQLETLRSKVAQLELQIAS-LNNEIE---RLEARLERLEDRRERLQQ----EIEELLKKLEEAELKELQAELEE 444
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1138 LRDELRGAQEVGERLQQRHGERDVEVERWRERVTLL---LERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRD 1214
Cdd:TIGR02168 445 LEEELEELQEELERLEEALEELREELEEAEQALDAAereLAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGV 524
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1215 AKQR---------------QEQIQAVPLANSQAVR---EQLRQEK----ALLEDIERHGEKVEECQRFAKQYINAIKDYE 1272
Cdd:TIGR02168 525 LSELisvdegyeaaieaalGGRLQAVVVENLNAAKkaiAFLKQNElgrvTFLPLDSIKGTEIQGNDREILKNIEGFLGVA 604
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1273 LQLVTYKAQLEPVASP---------------AKKPKVQSGSESIIQEYVDLRTRYS--------ELSTL-TSQYIRFISE 1328
Cdd:TIGR02168 605 KDLVKFDPKLRKALSYllggvlvvddldnalELAKKLRPGYRIVTLDGDLVRPGGVitggsaktNSSILeRRREIEELEE 684
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1329 TLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1408
Cdd:TIGR02168 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIE 764
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1409 EELQHLRQSSEAEiqakarqvEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL 1488
Cdd:TIGR02168 765 ELEERLEEAEEEL--------AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAAT 836
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1489 RRQVQDETQRKRQAEAELAlrvqaeaeaarekqRALQALEELRLQAEEAERRLRQAEAERArQVQVALETAqRSAEAELQ 1568
Cdd:TIGR02168 837 ERRLEDLEEQIEELSEDIE--------------SLAAEIEELEELIEELESELEALLNERA-SLEEALALL-RSELEELS 900
|
730 740 750
....*....|....*....|....*....|..
gi 1920237968 1569 SEHASFAEKTAQLERTLKE--EHVAVVQLREE 1598
Cdd:TIGR02168 901 EELRELESKRSELRRELEElrEKLAQLELRLE 932
|
|
| CH_MICAL2 |
cd21250 |
calponin homology (CH) domain found in molecule interacting with CasL protein 2; MICAL-2 is a ... |
168-265 |
9.93e-16 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 2; MICAL-2 is a nuclear [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-2 acts as a key regulator of the serum response factor (SRF) signaling pathway elicited by nerve growth factor and serum. It mediates oxidation and subsequent depolymerization of nuclear actin, leading to the increased MKL1/MRTF-A presence in the nucleus, promoting SRF:MKL1/MRTF-A-dependent gene transcription. MICAL-2 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409099 [Multi-domain] Cd Length: 110 Bit Score: 76.07 E-value: 9.93e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 168 KLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD-PEDV 246
Cdd:cd21250 8 KLLTWCQKQTEGYQNVNVTDLTTSWKSGLALCAIIHRFRPELIDFDSLNEDDAVKNNQLAFDVAEREFGIPPVTTgKEMA 87
|
90
....*....|....*....
gi 1920237968 247 DVPQPDEKSIITYVSSLYD 265
Cdd:cd21250 88 SAEEPDKLSMVMYLSKFYE 106
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1346-1908 |
1.02e-15 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 84.97 E-value: 1.02e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1346 EERERLAEVEAALEKQRQLAEAHAQAKAQAEreaqglQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRqsseaeIQAK 1425
Cdd:COG4913 219 EEPDTFEAADALVEHFDDLERAHEALEDARE------QIELLEPIRELAERYAAARERLAELEYLRAALR------LWFA 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1426 ARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAE-AQKRQAQEEAERLRRQvQDETQRKRQAEA 1504
Cdd:COG4913 287 QRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGgDRLEQLEREIERLERE-LEERERRRARLE 365
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1505 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQS-EH------ASFAEK 1577
Cdd:COG4913 366 ALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASlERrksnipARLLAL 445
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1578 TAQLERTLKEEHVAV------VQLREEAtrraqqqaeaeraraeaerelERWQLKANEAL---RLRL--------QAEEV 1640
Cdd:COG4913 446 RDALAEALGLDEAELpfvgelIEVRPEE---------------------ERWRGAIERVLggfALTLlvppehyaAALRW 504
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1641 AQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEK--QRQLAEGTAQQRLAAEQELIRL-RAETEQG--- 1714
Cdd:COG4913 505 VNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAwlEAELGRRFDYVCVDSPEELRRHpRAITRAGqvk 584
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1715 ------------------------EQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRST 1770
Cdd:COG4913 585 gngtrhekddrrrirsryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVA 664
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1771 S-EKSKQRLEAEagrFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKE 1849
Cdd:COG4913 665 SaEREIAELEAE---LERLDASSDDLAALEEQLEELEA-ELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAE 740
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1850 AENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRkaseSELERQKGLVEDTLRQ 1908
Cdd:COG4913 741 DLARLELRALLEERFAAALGDAVERELRENLEERIDALR----ARLNRAEEELERAMRA 795
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1303-1959 |
1.75e-15 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 84.12 E-value: 1.75e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1303 QEYVDLRTRYSELSTLTSQYIRFISETLRRMEE-EERLAE--QQRAEERERLAEVEAALEKQRQLAEAhAQAKAQAEREA 1379
Cdd:pfam12128 248 QEFNTLESAELRLSHLHFGYKSDETLIASRQEErQETSAElnQLLRTLDDQWKEKRDELNGELSAADA-AVAKDRSELEA 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1380 QGLQRRMQEEVarREEVAVEAQEQKRSIQEELQHLRQSSEAeIQAKARQVEAA-ERSRLRIEEEIRVV--------RLQL 1450
Cdd:pfam12128 327 LEDQHGAFLDA--DIETAAADQEQLPSWQSELENLEERLKA-LTGKHQDVTAKyNRRRSKIKEQNNRDiagikdklAKIR 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1451 EATERQRGGAEGELQALRAR-AEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEA-----EAAREKQ--- 1521
Cdd:pfam12128 404 EARDRQLAVAEDDLQALESElREQLEAGKLEFNEEEYRLKSRLGELKLRLNQATATPELLLQLENfderiERAREEQeaa 483
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1522 -----RALQALEELRLQAEEAERRLRQAEAeRARQVQVALETAQR-------------SAEAELQSEHASFAEKTAQLER 1583
Cdd:pfam12128 484 naeveRLQSELRQARKRRDQASEALRQASR-RLEERQSALDELELqlfpqagtllhflRKEAPDWEQSIGKVISPELLHR 562
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1584 TLKEEHVAVVQLREEATRRaqqqaeaeraraeaerelerwqlkaneALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREAR 1663
Cdd:pfam12128 563 TDLDPEVWDGSVGGELNLY---------------------------GVKLDLKRIDVPEWAASEEELRERLDKAEEALQS 615
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1664 RRGKAEEQAvRQRELAEQELEKQrQLAEGTAQQRLA-AEQELIRLraeTEQGEQQRQLLEEELARLQREaaaATQKRREL 1742
Cdd:pfam12128 616 AREKQAAAE-EQLVQANGELEKA-SREETFARTALKnARLDLRRL---FDEKQSEKDKKNKALAERKDS---ANERLNSL 687
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1743 EAELAKVraEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQraeaer 1822
Cdd:pfam12128 688 EAQLKQL--DKKHQAWLEEQKEQKREARTEKQAYWQVVEGALDAQLALLKAAIAARRSGAKAELKALETWYKRD------ 759
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1823 vLAEKlaAISEATRLKTEAEIalKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASeSELERQKG-L 1901
Cdd:pfam12128 760 -LASL--GVDPDVIAKLKREI--RTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLATQLSNIERAI-SELQQQLArL 833
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1902 VEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTlrSKEQAEQEAA 1959
Cdd:pfam12128 834 IADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMSKLATLKEDA--NSEQAQGSIG 889
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
2794-2832 |
2.40e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 72.36 E-value: 2.40e-15
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 2794 LLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEMNRVL 2832
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1737-2604 |
5.13e-15 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 82.81 E-value: 5.13e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1737 QKRRELEAELAKVrAEMEVLLAsKARAEEEsrsTSEKSKQRLEAeagRFRELAEEAARLRALAEEAKRQRQLAEEdavRQ 1816
Cdd:TIGR02169 153 VERRKIIDEIAGV-AEFDRKKE-KALEELE---EVEENIERLDL---IIDEKRQQLERLRREREKAERYQALLKE---KR 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1817 RAEAERVLAEKLAAisEATRLKTEAEIAlkEKEAENERLRRLAEDeafqrrlLEEQAAQHKADIEARLAQLRKASESELE 1896
Cdd:TIGR02169 222 EYEGYELLKEKEAL--ERQKEAIERQLA--SLEEELEKLTEEISE-------LEKRLEEIEQLLEELNKKIKDLGEEEQL 290
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1897 RQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDtlrSKEQAEQEAARQRQLAAEEERRRREAE 1976
Cdd:TIGR02169 291 RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE---LEREIEEERKRRDKLTEEYAELKEELE 367
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1977 ERVQKSLAAEEEAARQR---KAALEEVERLKAKVEE----ARRLRERAEQESARQLQLAQ-----EAAQKRLQAEEKAHA 2044
Cdd:TIGR02169 368 DLRAELEEVDKEFAETRdelKDYREKLEKLKREINElkreLDRLQEELQRLSEELADLNAaiagiEAKINELEEEKEDKA 447
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2045 FAVQQKEQELQQTLQQEQSVLERLRseaeaarraaeeaeaareraerEAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAA 2124
Cdd:TIGR02169 448 LEIKKQEWKLEQLAADLSKYEQELY----------------------DLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2125 EKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTAlrlqleetdhQKSIldEELQRLKA- 2203
Cdd:TIGR02169 506 VRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVA----------KEAI--ELLKRRKAg 573
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2204 EVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENR-----------ALVLRDKDSAQRLL---------QEEAEKMK 2263
Cdd:TIGR02169 574 RATFLPLNKMRDERRDLSILSEDGVIGFAVDLVEFDPKyepafkyvfgdTLVVEDIEAARRLMgkyrmvtleGELFEKSG 653
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2264 QVA----EEAARLSVAAQEAARLRQLAEE---------DLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2330
Cdd:TIGR02169 654 AMTggsrAPRGGILFSRSEPAELQRLRERleglkrelsSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEE 733
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2331 QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARaeEDARRFRKQAEDIGERLYRTE 2410
Cdd:TIGR02169 734 KLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIE 811
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2411 LATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQalqqsfLSEKDS 2490
Cdd:TIGR02169 812 ARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRD------LESRLG 885
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2491 LLQRERC-IEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEG---------VRRQQE 2560
Cdd:TIGR02169 886 DLKKERDeLEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEElsledvqaeLQRVEE 965
|
890 900 910 920
....*....|....*....|....*....|....*....|....*..
gi 1920237968 2561 ELQRLAQQQQQQEKLLAEENQR---LRERLQHLEEERRAALARSEEI 2604
Cdd:TIGR02169 966 EIRALEPVNMLAIQEYEEVLKRldeLKEKRAKLEEERKAILERIEEY 1012
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3453-3491 |
5.45e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 71.20 E-value: 5.45e-15
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3453 LLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEMNRVL 3491
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4031-4069 |
5.45e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 71.20 E-value: 5.45e-15
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 4031 LLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEMNEIL 4069
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
852-1582 |
7.93e-15 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 82.03 E-value: 7.93e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 852 QEAQEAIARLEAQHQALVALWHQLHTEMKSLLAWQSLGRDMQLIRSWSLATFRTLKpEEQRQALRSLELHYQAFLRDSQD 931
Cdd:TIGR02168 291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL-EELKEELESLEAELEELEAELEE 369
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 932 AggfgpEDRLQA-EREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQ 1010
Cdd:TIGR02168 370 L-----ESRLEElEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEE 444
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1011 RITEQQKAQAEVDGLGKGVARLSAE-AEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLS---AIYLEKLKTISLVIRS 1086
Cdd:TIGR02168 445 LEEELEELQEELERLEEALEELREElEEAEQALDAAERELAQLQARLDSLERLQENLEGFSegvKALLKNQSGLSGILGV 524
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1087 TQEAEEVlrahEEQLKeaQAVPATLPE---------LEATKAALKKLrAQAEAQQPVFDALrDELRGAQEVGERLQQRHG 1157
Cdd:TIGR02168 525 LSELISV----DEGYE--AAIEAALGGrlqavvvenLNAAKKAIAFL-KQNELGRVTFLPL-DSIKGTEIQGNDREILKN 596
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1158 ERDV-----EVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYR------ESADPLGAWLRDAKQRqeqiQAVP 1226
Cdd:TIGR02168 597 IEGFlgvakDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRivtldgDLVRPGGVITGGSAKT----NSSI 672
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1227 LANSQAVREqLRQEKALLEDIERHGEKveecqrfakqyinAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYV 1306
Cdd:TIGR02168 673 LERRREIEE-LEEKIEELEEKIAELEK-------------ALAELRKELEELEEELE---------QLRKELEELSRQIS 729
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1307 DLRTRYSELST---LTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ 1383
Cdd:TIGR02168 730 ALRKDLARLEAeveQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELR 809
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1384 ---RRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQR 1457
Cdd:TIGR02168 810 aelTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEelsEDIESLAAEIEELEELIEELESELEALLNERASLEEAL 889
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1458 GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA-LRVQAEAEAAREKQRALQALEELRLQAEE 1536
Cdd:TIGR02168 890 ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDnLQERLSEEYSLTLEEAEALENKIEDDEEE 969
|
730 740 750 760
....*....|....*....|....*....|....*....|....*.
gi 1920237968 1537 AERRLRQAEAERARQVQVALEtaqrsAEAELQSEHASFAEKTAQLE 1582
Cdd:TIGR02168 970 ARRRLKRLENKIKELGPVNLA-----AIEEYEELKERYDFLTAQKE 1010
|
|
| CH_SMTNB |
cd21259 |
calponin homology (CH) domain found in smoothelin-B and similar proteins; Smoothelins are ... |
166-277 |
8.76e-15 |
|
calponin homology (CH) domain found in smoothelin-B and similar proteins; Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. The human SMTN gene encodes smoothelin-A and smoothelin-B. This model corresponds to the single CH domain of smoothelin-B. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409108 Cd Length: 112 Bit Score: 73.49 E-value: 8.76e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 166 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 245
Cdd:cd21259 3 KQMLLDWCRAKTRGYENVDIQNFSSSWSDGMAFCALVHNFFPEAFDYSQLSPQNRRHNFEVAFSSAEKHADCPQLLDVED 82
|
90 100 110
....*....|....*....|....*....|...
gi 1920237968 246 -VDVPQPDEKSIITYVSSLYDAMprvpdVQDGV 277
Cdd:cd21259 83 mVRMREPDWKCVYTYIQEFYRCL-----VQKGL 110
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1088-1588 |
9.67e-15 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 81.62 E-value: 9.67e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1088 QEAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRaqaeaqqpvfDALRDELRGAQEVGERLQQRHGERDVEVERWR 1167
Cdd:PRK02224 237 DEADEVLEEHEERREELETLEAEIEDLRETIAETERER----------EELAEEVRDLRERLEELEEERDDLLAEAGLDD 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1168 ERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQ---AVPLANSQAVREQLRQEKALL 1244
Cdd:PRK02224 307 ADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELReeaAELESELEEAREAVEDRREEI 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1245 EDIERHGEKVEEcqRFAKqyinaikdyelqlvtykaqlepvaSPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1324
Cdd:PRK02224 387 EELEEEIEELRE--RFGD------------------------APVDLGNAEDFLEELREERDELREREAELEAT------ 434
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1325 fISETLRRMEEEERLAEQQR-----------------AEERERLAEVEAALE----KQRQLAEAHAQAK--AQAEREAQG 1381
Cdd:PRK02224 435 -LRTARERVEEAEALLEAGKcpecgqpvegsphvetiEEDRERVEELEAELEdleeEVEEVEERLERAEdlVEAEDRIER 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1382 LQRR---MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvRLQLEATERQRG 1458
Cdd:PRK02224 514 LEERredLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNS--KLAELKERIESL 591
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1459 GAEGELQALRARAE---EAEAQKRQAQEEAERLRR-QVQDETQRKRQAEAELAlrvQAEAEAARE-KQRALQALeelrlq 1533
Cdd:PRK02224 592 ERIRTLLAAIADAEdeiERLREKREALAELNDERReRLAEKRERKRELEAEFD---EARIEEAREdKERAEEYL------ 662
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 1534 aEEAERRLRQAEAERARqVQVALETAQRSAE--AELQSEHASFAEKTAQLErTLKEE 1588
Cdd:PRK02224 663 -EQVEEKLDELREERDD-LQAEIGAVENELEelEELRERREALENRVEALE-ALYDE 716
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1233-1897 |
1.11e-14 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 81.50 E-value: 1.11e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1233 VREQLRQEKALLEDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQ---LEPVaspakkpkvqsgsESIIQEYVDLR 1309
Cdd:COG4913 213 VREYMLEEPDTFEAADA-----------LVEHFDDLERAHEALEDAREQielLEPI-------------RELAERYAAAR 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1310 TRYSELSTLTSQY-IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQ----LAEAHAQAKAQAEREAQGLQR 1384
Cdd:COG4913 269 ERLAELEYLRAALrLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREeldeLEAQIRGNGGDRLEQLEREIE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1385 RMQEEVARREEVAVEAQEQKRSI--------------QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL 1450
Cdd:COG4913 349 RLERELEERERRRARLEALLAALglplpasaeefaalRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEI 428
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1451 EATERQRGGAEGELQALRARAEEAeaqkrqAQEEAERLR-----RQVQDETQRKRQAeAELALRVQA-----EAEAAREK 1520
Cdd:COG4913 429 ASLERRKSNIPARLLALRDALAEA------LGLDEAELPfvgelIEVRPEEERWRGA-IERVLGGFAltllvPPEHYAAA 501
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1521 QRALQALE-ELRLQAEEAERRLRQAEAER------ARQVQVALETAQRSAEAELQSehaSFAEKTAQLERTLKEEHVAV- 1592
Cdd:COG4913 502 LRWVNRLHlRGRLVYERVRTGLPDPERPRldpdslAGKLDFKPHPFRAWLEAELGR---RFDYVCVDSPEELRRHPRAIt 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1593 --VQLREEATrraqqqaeaERARAEAERELERWQL-KANEALRLRLQAEEVAQQKSLTQAEAEKQKEeaerearrrgKAE 1669
Cdd:COG4913 579 raGQVKGNGT---------RHEKDDRRRIRSRYVLgFDNRAKLAALEAELAELEEELAEAEERLEAL----------EAE 639
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQL---AEGTAQQRLAAEQELIRLRAETEQGEQqrqlLEEELARLQREAAAATQKRRELEAEL 1746
Cdd:COG4913 640 LDALQERREALQRLAEYSWDeidVASAEREIAELEAELERLDASSDDLAA----LEEQLEELEAELEELEEELDELKGEI 715
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1747 AKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1826
Cdd:COG4913 716 GRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRA 795
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1827 -KLAAISEATRLKTEAEiALKEKEAENERLR--RLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELER 1897
Cdd:COG4913 796 fNREWPAETADLDADLE-SLPEYLALLDRLEedGLPEYEERFKELLNENSIEFVADLLSKLRRAIREIKERIDP 868
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1048-1801 |
1.31e-14 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 81.17 E-value: 1.31e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1048 AAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLkeaqavPATLPELEATKAALKKLRAQ 1127
Cdd:TIGR00618 160 AKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCM------PDTYHERKQVLEKELKHLRE 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1128 AEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEverwrervtllLERWQAVLAQTDVRQRELEQLGRQLRYYRESAdp 1207
Cdd:TIGR00618 234 ALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRAR-----------IEELRAQEAVLEETQERINRARKAAPLAAHIK-- 300
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1208 lgAWLRDAKQRQEQIQAvpLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVAS 1287
Cdd:TIGR00618 301 --AVTQIEQQAQRIHTE--LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHT 376
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1288 PAKKPKVQSGSESIIQEyvDLRTRYSELSTLTSQYIRFISETLRRMEEEERL--AEQQRAEERERLAEVEAALEKQRQ-- 1363
Cdd:TIGR00618 377 LTQHIHTLQQQKTTLTQ--KLQSLCKELDILQREQATIDTRTSAFRDLQGQLahAKKQQELQQRYAELCAAAITCTAQce 454
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1364 -LAEAHAQAKAQAERE-------AQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEA--EIQAKARQVEAAE 1433
Cdd:TIGR00618 455 kLEKIHLQESAQSLKEreqqlqtKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDidNPGPLTRRMQRGE 534
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1434 RSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRaRAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE 1513
Cdd:TIGR00618 535 QTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQ-QSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACE 613
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1514 AEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVV 1593
Cdd:TIGR00618 614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLT 693
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1594 QLREEATRRAQQQAEAERARAEAERELERWQLkANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREArrrgkaEEQAV 1673
Cdd:TIGR00618 694 YWKEMLAQCQTLLRELETHIEEYDREFNEIEN-ASSSLGSDLAAREDALNQSLKELMHQARTVLKARTE------AHFNN 766
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1674 RQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQR-----QLLEEELARLQREAAAATQKRRELEAELAK 1748
Cdd:TIGR00618 767 NEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEipsdeDILNLQCETLVQEEEQFLSRLEEKSATLGE 846
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 1749 VRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEE 1801
Cdd:TIGR00618 847 ITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQIKIQFDGDALIKFLHE 899
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1328-1819 |
1.52e-14 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 80.58 E-value: 1.52e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHA--QAKAQAEREAQGLQRRMQEEVARREEVAvEAQEQKR 1405
Cdd:COG4717 88 EEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLEELEERLEELR-ELEEELE 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1406 SIQEELQHLRQSSEaeiqakarqvEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEA 1485
Cdd:COG4717 167 ELEAELAELQEELE----------ELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENEL 236
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1486 ER--LRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSA 1563
Cdd:COG4717 237 EAaaLEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEEL 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1564 EAElqsehasfaektaQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAqq 1643
Cdd:COG4717 317 EEE-------------ELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGV-- 381
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1644 ksltqaeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEgtAQQRLAAEQELIRLRAETEQGEQqrqlLEE 1723
Cdd:COG4717 382 -----------------------EDEEELRAALEQAEEYQELKEELEE--LEEQLEELLGELEELLEALDEEE----LEE 432
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1724 ELARLQREAAAATQKRRELEAELAKVRAEMEVLlaskaraeeESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAK 1803
Cdd:COG4717 433 ELEELEEELEELEEELEELREELAELEAELEQL---------EEDGELAELLQELEELKAELRELAEEWAALKLALELLE 503
|
490
....*....|....*....
gi 1920237968 1804 RQRQLAEED---AVRQRAE 1819
Cdd:COG4717 504 EAREEYREErlpPVLERAS 522
|
|
| CH_SMTNL2 |
cd21261 |
calponin homology (CH) domain found in smoothelin-like protein 2; Smoothelin-like protein 2 ... |
166-265 |
1.95e-14 |
|
calponin homology (CH) domain found in smoothelin-like protein 2; Smoothelin-like protein 2 (SMTNL2) is highly expressed in skeletal muscle and could be associated with differentiating myocytes. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409110 Cd Length: 107 Bit Score: 72.31 E-value: 1.95e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 166 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 245
Cdd:cd21261 3 KQILLEWCRSKTIGYKNIDLQNFSSSWSDGMAFCALVHSFFPEAFDYDSLSPSNRKHNFELAFSMAEKLANCDRLIEVED 82
|
90 100
....*....|....*....|..
gi 1920237968 246 VDV--PQPDEKSIITYVSSLYD 265
Cdd:cd21261 83 MMVmgRKPDPMCVFTYVQSLYN 104
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1322-2041 |
2.64e-14 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 80.40 E-value: 2.64e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1322 YIRFISETLRRMEEEERLAEQQRAEERERLAEVEAaLEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQ 1401
Cdd:TIGR00618 140 YKTFTRVVLLPQGEFAQFLKAKSKEKKELLMNLFP-LDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYH 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1402 EQKRSIQEELQHLR--QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQAL---RARAEEAEA 1476
Cdd:TIGR00618 219 ERKQVLEKELKHLReaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERInraRKAAPLAAH 298
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1477 QKRQAQEEAERLRRQVQ-DETQRKRQAEAELALRVQAEAEAAREKQRALQAL--EELRLQAEEAERRLRQAEAERARQVQ 1553
Cdd:TIGR00618 299 IKAVTQIEQQAQRIHTElQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLhsQEIHIRDAHEVATSIREISCQQHTLT 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1554 VALETAQRSAEAELQSEHASFAEKTaqlertlkeehvavvQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRL 1633
Cdd:TIGR00618 379 QHIHTLQQQKTTLTQKLQSLCKELD---------------ILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELC 443
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1634 RLQAEEVAQQKSLTQAEAEKQKEEAerearrrgKAEEQAVRQRELAEQELEKQRQLAEgtaqQRLAAEQELIRLRAETEQ 1713
Cdd:TIGR00618 444 AAAITCTAQCEKLEKIHLQESAQSL--------KEREQQLQTKEQIHLQETRKKAVVL----ARLLELQEEPCPLCGSCI 511
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1714 GEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLAS----KARAEEESRSTSEKSKQR---------LEA 1780
Cdd:TIGR00618 512 HPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQraslKEQMQEIQQSFSILTQCDnrskedipnLQN 591
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1781 EAGRFRELAEEAARLR-ALAEEAKRQ-RQLAEEDAVRQRAEAERVLAEKLAaiSEATRLKTEAEIALKEKEAENERLRRL 1858
Cdd:TIGR00618 592 ITVRLQDLTEKLSEAEdMLACEQHALlRKLQPEQDLQDVRLHLQQCSQELA--LKLTALHALQLTLTQERVREHALSIRV 669
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1859 AEDEAFQRRLLEEQAAQHKADieaRLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELG 1938
Cdd:TIGR00618 670 LPKELLASRQLALQKMQSEKE---QLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLK 746
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1939 RIRGTAEDTLRSKEQAEQEAARQrqlaaeeerrRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAE 2018
Cdd:TIGR00618 747 ELMHQARTVLKARTEAHFNNNEE----------VTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDE 816
|
730 740
....*....|....*....|....
gi 1920237968 2019 QE-SARQLQLAQEAAQKRLQAEEK 2041
Cdd:TIGR00618 817 DIlNLQCETLVQEEEQFLSRLEEK 840
|
|
| CH_PLS_FIM_rpt3 |
cd21219 |
third calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ... |
35-148 |
2.83e-14 |
|
third calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5, which cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409068 Cd Length: 113 Bit Score: 71.93 E-value: 2.83e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 35 DEQDERdrvqkkTFTKWVNKHLIKHwraeaqrHISDLYEDLRDGhnlISLLEVLsgDSL-P---------REKGRMRFHK 104
Cdd:cd21219 2 GSREER------AFRMWLNSLGLDP-------LINNLYEDLRDG---LVLLQVL--DKIqPgcvnwkkvnKPKPLNKFKK 63
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1920237968 105 LQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILH 148
Cdd:cd21219 64 VENCNYAVDLAKKLGFSLVGIGGKDIADGNRKLTLALVWQLMRY 107
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
938-1477 |
3.09e-14 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 79.98 E-value: 3.09e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 938 EDRLQAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLrlpldkeparecAQRITEQQK 1017
Cdd:COG1196 274 LELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEEL------------EELEEELEE 341
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1018 AQAEVDGLGKGVARLSAEAEKVLAlpepspaapTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAH 1097
Cdd:COG1196 342 LEEELEEAEEELEEAEAELAEAEE---------ALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEAL 412
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1098 EEQLKEAQAvpatlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERw 1177
Cdd:COG1196 413 LERLERLEE------ELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEE- 485
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1178 qavlaqtdvRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEkALLEDIERHGEKVEEC 1257
Cdd:COG1196 486 ---------LAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA-LEAALAAALQNIVVED 555
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1258 QRFAKQYINAIKDYELQLVT-YKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEE 1336
Cdd:COG1196 556 DEVAAAAIEYLKAAKAGRATfLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAA 635
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1337 ERLAEQQRAEERERLAEVEAALEKQRqLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQ 1416
Cdd:COG1196 636 LRRAVTLAGRLREVTLEGEGGSAGGS-LTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEE 714
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1417 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQ 1477
Cdd:COG1196 715 ERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLERE 775
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1140-1919 |
3.20e-14 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 80.11 E-value: 3.20e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1140 DELRGAQEVGERLQQRHGERDvEVERWRERVTLLLERwqaVLAQTDVRQRELEQLGRqlryYREsadplgawLRDAKQRQ 1219
Cdd:TIGR02169 160 DEIAGVAEFDRKKEKALEELE-EVEENIERLDLIIDE---KRQQLERLRREREKAER----YQA--------LLKEKREY 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1220 EQiqAVPLANSQAVREQLRQEKALLEDIERHGEKVEEcqrfakqyinAIKDYELQLVTYKAQLEPVASPAKKpkvqSGSE 1299
Cdd:TIGR02169 224 EG--YELLKEKEALERQKEAIERQLASLEEELEKLTE----------EISELEKRLEEIEQLLEELNKKIKD----LGEE 287
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1300 SIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREA 1379
Cdd:TIGR02169 288 EQLRVKEKIGELEAEIASLERS-IAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1380 QGLQRRMQEEVARREEVAVEAQEQKRSIqEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGG 1459
Cdd:TIGR02169 367 EDLRAELEEVDKEFAETRDELKDYREKL-EKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1460 AEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQALEEL---RLQA-- 1534
Cdd:TIGR02169 446 KALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA-EAEAQARASEERVRGGRAVEEVlkaSIQGvh 524
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1535 ----------EEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFA-----EKTAQLERTLKEEHVA-------- 1591
Cdd:TIGR02169 525 gtvaqlgsvgERYATAIEVAAGNRLNNVVVEDDAVAKEAIELLKRRKAGRAtflplNKMRDERRDLSILSEDgvigfavd 604
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1592 --------------------VVQLREEATRRAQQQAEAERA--------------RAEAERELERWQLKAnEALRLRLQA 1637
Cdd:TIGR02169 605 lvefdpkyepafkyvfgdtlVVEDIEAARRLMGKYRMVTLEgelfeksgamtggsRAPRGGILFSRSEPA-ELQRLRERL 683
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1638 EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAV---RQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1714
Cdd:TIGR02169 684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGeieKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKEL 763
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1715 EQQRQLLEEELARLQreAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTsEKSKQRLEAEAGRFRELAEEAAR 1794
Cdd:TIGR02169 764 EARIEELEEDLHKLE--EALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREI-EQKLNRLTLEKEYLEKEIQELQE 840
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1795 LRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAA 1874
Cdd:TIGR02169 841 QRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLS 920
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1875 QHKADIEA---RLAQLRKASESELERQKGL--VEDTLRQRRQVEEEILAL 1919
Cdd:TIGR02169 921 ELKAKLEAleeELSEIEDPKGEDEEIPEEElsLEDVQAELQRVEEEIRAL 970
|
|
| CH_FLN_rpt2 |
cd21230 |
second calponin homology (CH) domain found in filamins; The filamin family includes filamin-A ... |
164-261 |
5.10e-14 |
|
second calponin homology (CH) domain found in filamins; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. Members of this family contain two copies of the CH domain. The model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409079 Cd Length: 103 Bit Score: 70.87 E-value: 5.10e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI----DMNKVYRqtnLENLDQAFSVAERDLGVTR 239
Cdd:cd21230 1 TPKQRLLGWIQNKIPQ---LPITNFTTDWNDGRALGALVDSCAPGLCpdweTWDPNDA---LENATEAMQLAEDWLGVPQ 74
|
90 100
....*....|....*....|..
gi 1920237968 240 LLDPEDVDVPQPDEKSIITYVS 261
Cdd:cd21230 75 LITPEEIINPNVDEMSVMTYLS 96
|
|
| CH_SMTNA |
cd21258 |
calponin homology (CH) domain found in smoothelin-A and similar proteins; Smoothelins are ... |
166-269 |
5.18e-14 |
|
calponin homology (CH) domain found in smoothelin-A and similar proteins; Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. This model corresponds to the single CH domain of smoothelin-A. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409107 Cd Length: 111 Bit Score: 71.23 E-value: 5.18e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 166 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 245
Cdd:cd21258 3 KQMLLDWCRAKTRGYEHVDIQNFSSSWSDGMAFCALVHNFFPDAFDYSQLSPQNRRQNFEVAFSAAEMLADCVPLVEVED 82
|
90 100
....*....|....*....|....*.
gi 1920237968 246 VDV--PQPDEKSIITYVSSLYDAMPR 269
Cdd:cd21258 83 MMImgKKPDSKCVFTYVQSLYNHLRR 108
|
|
| CH_CYTS |
cd21199 |
calponin homology (CH) domain found in the cytospin family; The cytospin family includes ... |
169-264 |
6.06e-14 |
|
calponin homology (CH) domain found in the cytospin family; The cytospin family includes cytospin-A and cytospin-B. Cytospin-A, also called renal carcinoma antigen NY-REN-22, sperm antigen with calponin homology and coiled-coil domains 1-like, or SPECC1-like (SPECC1L) protein, is involved in cytokinesis and spindle organization. It may play a role in actin cytoskeleton organization and microtubule stabilization and hence, is required for proper cell adhesion and migration. Cytospin-B, also called nuclear structure protein 5 (NSP5), sperm antigen HCMOGT-1, or sperm antigen with calponin homology and coiled-coil domains 1 (SPECC1), is a novel fusion partner to PDGFRB in juvenile myelomonocytic leukemia with translocation t(5;17)(q33;p11.2). Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409048 Cd Length: 112 Bit Score: 70.85 E-value: 6.06e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 169 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDPED-VD 247
Cdd:cd21199 13 LLKWCQEKTQGYKGIDITNFSSSWNDGLAFCALLHSYLPDKIPYSELNPQDKRRNFTLAFKAAE-SVGIPTTLTIDEmVS 91
|
90
....*....|....*..
gi 1920237968 248 VPQPDEKSIITYVSSLY 264
Cdd:cd21199 92 MERPDWQSVMSYVTAIY 108
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1315-2226 |
6.40e-14 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 79.06 E-value: 6.40e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1315 LSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEV----EAALEKQRQLAEAHAQAkAQAEREAQGLQRRMQEEV 1390
Cdd:pfam01576 178 LSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGEStdlqEQIAELQAQIAELRAQL-AKKEEELQAALARLEEET 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1391 ARREEvaveAQEQKRSIQEELQHLRQSSEAEIQAKARqveaAERSRLRIEEEIRVVRLQLEATErqrgGAEGELQALRAR 1470
Cdd:pfam01576 257 AQKNN----ALKKIRELEAQISELQEDLESERAARNK----AEKQRRDLGEELEALKTELEDTL----DTTAAQQELRSK 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1471 AE-EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvqaeaEAAREKQRALQALEELRlQAEEAERRLRQAEAERA 1549
Cdd:pfam01576 325 REqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELT-------EQLEQAKRNKANLEKAK-QALESENAELQAELRTL 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1550 RQVQVALETAQRSAEAELQSEHASFAEktAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1629
Cdd:pfam01576 397 QQAKQDSEHKRKKLEGQLQELQARLSE--SERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQ 474
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1630 ALRlrlqAEEVAQQKSLTQAEAEKqkeeaerearrrgkaEEQAVRQRELAEQELEKQRQLAegtaQQRLAAEQELIRLRA 1709
Cdd:pfam01576 475 ELL----QEETRQKLNLSTRLRQL---------------EDERNSLQEQLEEEEEAKRNVE----RQLSTLQAQLSDMKK 531
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1710 ETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK-------VRAEMEVLLAS-------------------KARA 1763
Cdd:pfam01576 532 KLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKlektknrLQQELDDLLVDldhqrqlvsnlekkqkkfdQMLA 611
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1764 EEESRSTS-EKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA---ISEATRLKT 1839
Cdd:pfam01576 612 EEKAISARyAEERDRAEAEA---REKETRALSLARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVgknVHELERSKR 688
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1840 EAEIALKEKEAENERLR-RLAEDEAFQRRL---LEEQAAQHKADIEARlaqlrkaSESELERQKGLVedtlRQRRQVEEE 1915
Cdd:pfam01576 689 ALEQQVEEMKTQLEELEdELQATEDAKLRLevnMQALKAQFERDLQAR-------DEQGEEKRRQLV----KQVRELEAE 757
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1916 ILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKA 1995
Cdd:pfam01576 758 LEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKESEKKLKN 837
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1996 ALEEVERLKAKVEEARRLRERAEQESAR-QLQLAQEAAQKRLQAEEKAHAFA-VQQKEQELQQTLQQEQSVLERLRSEAE 2073
Cdd:pfam01576 838 LEAELLQLQEDLAASERARRQAQQERDElADEIASGASGKSALQDEKRRLEArIAQLEEELEEEQSNTELLNDRLRKSTL 917
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2074 AARR---AAEEAEAARERAEREAAQSRRQVEEAeRLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQ 2150
Cdd:pfam01576 918 QVEQlttELAAERSTSQKSESARQQLERQNKEL-KAKLQEMEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQAANKL 996
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2151 AADAE---------MEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2221
Cdd:pfam01576 997 VRRTEkklkevllqVEDERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEASRANAARRKLQRELDDATESNESMNREVST 1076
|
....*
gi 1920237968 2222 LRVQM 2226
Cdd:pfam01576 1077 LKSKL 1081
|
|
| CH_DIXDC1 |
cd21213 |
calponin homology (CH) domain found in Dixin and similar proteins; Dixin, also called ... |
44-149 |
6.94e-14 |
|
calponin homology (CH) domain found in Dixin and similar proteins; Dixin, also called coiled-coil protein DIX1, coiled-coil-DIX1, or DIX domain-containing protein 1, is a positive effector of the Wnt signaling pathway. It activates WNT3A signaling via DVL2 and regulates JNK activation by AXIN1 and DVL2. Members of this family contain a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409062 Cd Length: 107 Bit Score: 70.40 E-value: 6.94e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 44 QKKTFTKWVNKHLIKHwraEAQRHISDLYEDLRDGHNLISLLEVLSGDSL------PREKGRMRfhklQNVQIALDYLRH 117
Cdd:cd21213 1 QLQAYVAWVNSQLKKR---PGIRPVQDLRRDLRDGVALAQLIEILAGEKLpgidwnPTTDAERK----ENVEKVLQFMAS 73
|
90 100 110
....*....|....*....|....*....|..
gi 1920237968 118 RQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 149
Cdd:cd21213 74 KRIRMHQTSAKDIVDGNLKAIMRLILALAAHF 105
|
|
| CH_CTX_rpt1 |
cd21225 |
first calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling ... |
41-145 |
7.28e-14 |
|
first calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling proteins that play a critical role in regulating cell morphology and actin cytoskeleton reorganization. They play a major role in cytokinesis and contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409074 Cd Length: 111 Bit Score: 70.64 E-value: 7.28e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 41 DRVQKKTFTKWVNKHLIKhwRAEAQrhISDLYEDLRDGHNLISLLEVLSGDSLPRE---KGRMRFHKLQNVQIALDYLRH 117
Cdd:cd21225 2 EKVQIKAFTAWVNSVLEK--RGIPK--ISDLATDLSDGVRLIFFLELVSGKKFPKKfdlEPKNRIQMIQNLHLAMLFIEE 77
|
90 100
....*....|....*....|....*....
gi 1920237968 118 R-QVKLVNIRNDDIADGNPKLTLGLIWTI 145
Cdd:cd21225 78 DlKIRVQGIGAEDFVDNNKKLILGLLWTL 106
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1064-1567 |
1.11e-13 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 78.42 E-value: 1.11e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1064 EQVRSLSAI--YLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPatlpELEATKAALKKLRAQAEAQQPVFDALRDE 1141
Cdd:COG4913 249 EQIELLEPIreLAERYAAARERLAELEYLRAALRLWFAQRRLELLEA----ELEELRAELARLEAELERLEARLDALREE 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1142 LRGAQEV-----GERLQQRhgERDVE-VERWRERVTLLLERWQAVLAQTDVR---------------QRELEQLGRQLRY 1200
Cdd:COG4913 325 LDELEAQirgngGDRLEQL--EREIErLERELEERERRRARLEALLAALGLPlpasaeefaalraeaAALLEALEEELEA 402
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1201 YRESADPLGAWLRDAKQRQEQIQA-----------VPlANSQAVREQLRQEKALLED---------------------IE 1248
Cdd:COG4913 403 LEEALAEAEAALRDLRRELRELEAeiaslerrksnIP-ARLLALRDALAEALGLDEAelpfvgelievrpeeerwrgaIE 481
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1249 R--HGEK----VEEcQRFAK--QYINAIKDyELQLVTYKAQLEPVASPAKKPKVQS------GSESIIQEYVD--LRTRY 1312
Cdd:COG4913 482 RvlGGFAltllVPP-EHYAAalRWVNRLHL-RGRLVYERVRTGLPDPERPRLDPDSlagkldFKPHPFRAWLEaeLGRRF 559
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1313 S--------ELST----LTSQYIRFISETLRRMEEEERL---------AEQQRAEERERLAEVEAALEKQRQLAEAHAQA 1371
Cdd:COG4913 560 DyvcvdspeELRRhpraITRAGQVKGNGTRHEKDDRRRIrsryvlgfdNRAKLAALEAELAELEEELAEAEERLEALEAE 639
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1372 KAQAEREAQGLQRRmqEEVARREEVAVEAQEQKRSIQEELQHLRQSSeAEIQAKARQVEAAERSRLRIEEEIRVVRLQLE 1451
Cdd:COG4913 640 LDALQERREALQRL--AEYSWDEIDVASAEREIAELEAELERLDASS-DDLAALEEQLEELEAELEELEEELDELKGEIG 716
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1452 ATERQRGGAEGELQALRARAEEAE------------------AQKRQAQEEAERLRRQVQDETQRKRQAEAEL------- 1506
Cdd:COG4913 717 RLEKELEQAEEELDELQDRLEAAEdlarlelralleerfaaaLGDAVERELRENLEERIDALRARLNRAEEELeramraf 796
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1507 -------ALRVQAEAEAAREKQRALQALEELRL---QAEEAERRLRQAEAERArQVQVALETAQRSAEAEL 1567
Cdd:COG4913 797 nrewpaeTADLDADLESLPEYLALLDRLEEDGLpeyEERFKELLNENSIEFVA-DLLSKLRRAIREIKERI 866
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3712-3750 |
1.12e-13 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 67.74 E-value: 1.12e-13
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3712 LLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPELHDRL 3750
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1624-2484 |
1.48e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 77.80 E-value: 1.48e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1624 QLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA---- 1699
Cdd:TIGR02169 231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLersi 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1700 --AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQR 1777
Cdd:TIGR02169 311 aeKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDY 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1778 LEAEAGRFRELAEEAARLRALAEEAKRQRQlaeedavrQRAEAERVLAEKLAAISEatrLKTEAEIALKEKEAENERLRR 1857
Cdd:TIGR02169 391 REKLEKLKREINELKRELDRLQEELQRLSE--------ELADLNAAIAGIEAKINE---LEEEKEDKALEIKKQEWKLEQ 459
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1858 LAED-EAFQRRLLEEQAAQhkADIEARLAQLRKaSESELERQKGLVEDTLRQRRQVEEeilalkgsfekaaagkaELELE 1936
Cdd:TIGR02169 460 LAADlSKYEQELYDLKEEY--DRVEKELSKLQR-ELAEAEAQARASEERVRGGRAVEE-----------------VLKAS 519
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1937 LGRIRGTAEDTLRSKEQ---AEQEAARQRqlaaeeerrrrEAEERVQKSLAAEE--EAARQRK---AALEEVERLKAKVE 2008
Cdd:TIGR02169 520 IQGVHGTVAQLGSVGERyatAIEVAAGNR-----------LNNVVVEDDAVAKEaiELLKRRKagrATFLPLNKMRDERR 588
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2009 EARRLRERAEQESARQLqlaQEAAQKRlqaeEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARER 2088
Cdd:TIGR02169 589 DLSILSEDGVIGFAVDL---VEFDPKY----EPAFKYVFGDTLVVEDIEAARRLMGKYRMVTLEGELFEKSGAMTGGSRA 661
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2089 AEREAAQSRRQVEEAERL---KQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQ 2165
Cdd:TIGR02169 662 PRGGILFSRSEPAELQRLrerLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2166 ALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQveEELFSLRVQMEELGKLKARIEAENRALvl 2245
Cdd:TIGR02169 742 LEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIEARLREI-- 817
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2246 rDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLaEEDLAQQRALAEKmLKEKMQAVQEatrLKAEAELLQQQK 2325
Cdd:TIGR02169 818 -EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI-ENLNGKKEELEEE-LEELEAALRD---LESRLGDLKKER 891
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2326 ELAQEQARRLQEDKEQMAQQlaqetqgfqktLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRfRKQAEDIGER 2405
Cdd:TIGR02169 892 DELEAQLRELERKIEELEAQ-----------IEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE-ELSLEDVQAE 959
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 2406 LYRTELAtqekvmlVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSF 2484
Cdd:TIGR02169 960 LQRVEEE-------IRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFEAINENF 1031
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1055-1577 |
1.70e-13 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 77.65 E-value: 1.70e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1055 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEvLRAHEEQLKEAQAvpaTLPELEATKAALKKLRAQAEAQQpv 1134
Cdd:COG4913 266 AARERLAELEYLRAALRLWFAQRRLELLEAELEELRAE-LARLEAELERLEA---RLDALREELDELEAQIRGNGGDR-- 339
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1135 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTdvrQRELEQLGRQLRYYRESADPLGAWLRD 1214
Cdd:COG4913 340 LEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEA---AALLEALEEELEALEEALAEAEAALRD 416
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1215 AKQRQEQIQA-----------VPlANSQAVREQLRQEKALLED---------------------IER--HGEK----VEE 1256
Cdd:COG4913 417 LRRELRELEAeiaslerrksnIP-ARLLALRDALAEALGLDEAelpfvgelievrpeeerwrgaIERvlGGFAltllVPP 495
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1257 cQRFAK--QYINAIKDyELQLVTYKAQLEPVASPAKKPKVQSGSEsiiqeyvdlrtrysELSTLTSQYIRFISETLRRM- 1333
Cdd:COG4913 496 -EHYAAalRWVNRLHL-RGRLVYERVRTGLPDPERPRLDPDSLAG--------------KLDFKPHPFRAWLEAELGRRf 559
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1334 -----EEEERLAEQQRAEERERLA-EVEAALEK--QRQLAEAH------AQAKAQAEREAQGLQRRMQEEVARREEVAvE 1399
Cdd:COG4913 560 dyvcvDSPEELRRHPRAITRAGQVkGNGTRHEKddRRRIRSRYvlgfdnRAKLAALEAELAELEEELAEAEERLEALE-A 638
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1400 AQEQKRSIQEELQHLRQSSEAEIQakarqVEAAERSRLRIEEEIRvvrlQLEAterqrggAEGELQALRARAEEAEAQKR 1479
Cdd:COG4913 639 ELDALQERREALQRLAEYSWDEID-----VASAEREIAELEAELE----RLDA-------SSDDLAALEEQLEELEAELE 702
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1480 QAQEEAERLRRQVQDETQRKRQAEAEL-ALRVQAEAEAAREKQRALQALEELRlqAEEAERRLRQAEAERARQVQVALET 1558
Cdd:COG4913 703 ELEEELDELKGEIGRLEKELEQAEEELdELQDRLEAAEDLARLELRALLEERF--AAALGDAVERELRENLEERIDALRA 780
|
570
....*....|....*....
gi 1920237968 1559 AQRSAEAELQSEHASFAEK 1577
Cdd:COG4913 781 RLNRAEEELERAMRAFNRE 799
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1705-2518 |
2.29e-13 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 77.32 E-value: 2.29e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1705 IRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAE--MEVLLASKARAEEESRSTSEKSKQRLEAea 1782
Cdd:TIGR00618 94 LRCTRSHRKTEQPEQLYLEQKKGRGRILAAKKSETEEVIHDLLKLDYKtfTRVVLLPQGEFAQFLKAKSKEKKELLMN-- 171
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1783 grfrelAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAErvlAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDE 1862
Cdd:TIGR00618 172 ------LFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQ---LLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSH 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1863 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEeilalkgsfeKAAAGKAELELELGRIRG 1942
Cdd:TIGR00618 243 AYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAP----------LAAHIKAVTQIEQQAQRI 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1943 TAEdtLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2022
Cdd:TIGR00618 313 HTE--LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTT 390
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2023 rQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE 2102
Cdd:TIGR00618 391 -LTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK 469
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2103 AERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRL 2182
Cdd:TIGR00618 470 EREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYH 549
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2183 QLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLkarIEAENRAlvlrdKDSAQRLLQEEAEKm 2262
Cdd:TIGR00618 550 QLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDL---TEKLSEA-----EDMLACEQHALLRK- 620
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2263 KQVAEEAARLSVAAQEAARLRQLAEEDLAQqraLAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQA--RRLQEDKE 2340
Cdd:TIGR00618 621 LQPEQDLQDVRLHLQQCSQELALKLTALHA---LQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSekEQLTYWKE 697
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2341 QMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEdigERLYRTELATQEKVMLV 2420
Cdd:TIGR00618 698 MLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLK---ARTEAHFNNNEEVTAAL 774
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2421 QTLeTQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQ 2500
Cdd:TIGR00618 775 QTG-AELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLK 853
|
810
....*....|....*...
gi 1920237968 2501 EKAKLEQlfQDEVAKAQA 2518
Cdd:TIGR00618 854 YEECSKQ--LAQLTQEQA 869
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2146-2605 |
2.83e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 77.02 E-value: 2.83e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQ 2225
Cdd:TIGR02168 231 VLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2226 MEELGKLKARIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM 2305
Cdd:TIGR02168 311 LANLERQLEELEAQ-LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2306 QAVQEATRLKAEAELLQQQKE-LAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2384
Cdd:TIGR02168 390 QLELQIASLNNEIERLEARLErLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2385 QARAEEDARRFRKQAEDIGERLYRTE-----------------------------LATQEKV------------------ 2417
Cdd:TIGR02168 470 LEEAEQALDAAERELAQLQARLDSLErlqenlegfsegvkallknqsglsgilgvLSELISVdegyeaaieaalggrlqa 549
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2418 MLVQTLETQRQ--QSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE-----------------------MQTVRQEQ 2472
Cdd:TIGR02168 550 VVVENLNAAKKaiAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEgflgvakdlvkfdpklrkalsylLGGVLVVD 629
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2473 LLQETQALQ------------------------QSFLSEKDSLLQRERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQ 2528
Cdd:TIGR02168 630 DLDNALELAkklrpgyrivtldgdlvrpggvitGGSAKTNSSILERRREIEELEEKIEEL-EEKIAELEKALAELRKELE 708
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 2529 QMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIA 2605
Cdd:TIGR02168 709 ELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1345-2451 |
3.62e-13 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 76.92 E-value: 3.62e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1345 AEERERLaeVEAALEKQRQLAEAHAQAKAQAEReaqglqrrmQEEVARREEvavEAQEQKRSIQEELQ----HLRQSSEA 1420
Cdd:PRK04863 278 ANERRVH--LEEALELRRELYTSRRQLAAEQYR---------LVEMARELA---ELNEAESDLEQDYQaasdHLNLVQTA 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1421 EIQAKA--RQVEAAERSRLRIEEEIRVVRlqlEATERQrggaegelqalraraEEAEAQKRQAQEEAERLRRQVQDETQr 1498
Cdd:PRK04863 344 LRQQEKieRYQADLEELEERLEEQNEVVE---EADEQQ---------------EENEARAEAAEEEVDELKSQLADYQQ- 404
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1499 krqaeaelALRVQAEAeaAREKQRALQALEELR-------LQAEEAERRLRQAEAERARQVQVALETAQRSAEAElqsEH 1571
Cdd:PRK04863 405 --------ALDVQQTR--AIQYQQAVQALERAKqlcglpdLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQ---AA 471
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1572 ASFAEKTAQLERTLKEEHVavvqlREEAtrraqqqaeaeraraeaerelerWQlKANEALRlrlqaeevaqqksltqaea 1651
Cdd:PRK04863 472 HSQFEQAYQLVRKIAGEVS-----RSEA-----------------------WD-VARELLR------------------- 503
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1652 ekqkeeaerearrrgkaeeQAVRQRELAEQELEKQRQLAEgtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQRE 1731
Cdd:PRK04863 504 -------------------RLREQRHLAEQLQQLRMRLSE--LEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEE 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1732 AAAAtqkRRELEAELAKVRAEMEVLlaskaRAEEESrstSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1811
Cdd:PRK04863 563 LEAR---LESLSESVSEARERRMAL-----RQQLEQ---LQARIQRLAARAPAWLAAQDALARLREQSGEEFEDSQDVTE 631
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1812 dAVRQRAEAERVLAEKLAAISEA-TRLKTEAEIALKEKEAENERLRRLAEDeaFQRRLLEEQ----AAQHKADIEARLAQ 1886
Cdd:PRK04863 632 -YMQQLLERERELTVERDELAARkQALDEEIERLSQPGGSEDPRLNALAER--FGGVLLSEIyddvSLEDAPYFSALYGP 708
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1887 LRKASeseLERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG--KAElELELGRIRGTAEDTLR-SKEQAEQ---EAAR 1960
Cdd:PRK04863 709 ARHAI---VVPDLSDAAEQLAGLEDCPEDLYLIEGDPDSFDDSvfSVE-ELEKAVVVKIADRQWRySRFPEVPlfgRAAR 784
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1961 QRQLaaeeerrrreaeervqkslaaeEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLA---------QEA 2031
Cdd:PRK04863 785 EKRI----------------------EQLRAEREELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAfeadpeaelRQL 842
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2032 AQKRLQAEEKahafavqqkeqelqqtlqqeqsvLERLRSeaeaarraaeeaeaareraerEAAQSRRQVEEAERLKQSae 2111
Cdd:PRK04863 843 NRRRVELERA-----------------------LADHES---------------------QEQQQRSQLEQAKEGLSA-- 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2112 eqaqaqaqaqaaaekLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRlqleeTDhqk 2191
Cdd:PRK04863 877 ---------------LNRLLPRLNLLADETLADRVEEIREQLDEAEEAKRFVQQHGNALAQLEPIVSVLQ-----SD--- 933
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2192 silDEELQRLKAEVTEAARQRGQVEEELFSLrvqmEELGKLKARIEAENRALVLRDKDSAQRLLQ---EEAEKMKQVAEE 2268
Cdd:PRK04863 934 ---PEQFEQLKQDYQQAQQTQRDAKQQAFAL----TEVVQRRAHFSYEDAAEMLAKNSDLNEKLRqrlEQAEQERTRARE 1006
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2269 AARlsvaaQEAARLRQLAE--EDLAQQRALAEKMLKEKMQAVQEAT-RLKAEAE-LLQQQKELAQEQARRLQEDKEQMAQ 2344
Cdd:PRK04863 1007 QLR-----QAQAQLAQYNQvlASLKSSYDAKRQMLQELKQELQDLGvPADSGAEeRARARRDELHARLSANRSRRNQLEK 1081
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2345 QLaqetqgfqkTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEkvmlvqtLE 2424
Cdd:PRK04863 1082 QL---------TFCEAEMDNLTKKLRKLERDYHEMREQVVNAKAGWCAVLRLVKDNGVERRLHRRELAYLS-------AD 1145
|
1130 1140
....*....|....*....|....*..
gi 1920237968 2425 TQRQQSDRDAERLREAIAELEHEKDKL 2451
Cdd:PRK04863 1146 ELRSMSDKALGALRLAVADNEHLRDVL 1172
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1111-1570 |
4.94e-13 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 75.57 E-value: 4.94e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1111 LPELEATKAALKKLRAQAEAqqpvFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTL--LLERWQAVLAQTDVRQ 1188
Cdd:COG4717 70 LKELKELEEELKEAEEKEEE----YAELQEELEELEEELEELEAELEELREELEKLEKLLQLlpLYQELEALEAELAELP 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1189 RELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQekaLLEDIERHGEKVEECQRFAKQYINAI 1268
Cdd:COG4717 146 ERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQD---LAEELEELQQRLAELEEELEEAQEEL 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1269 KDYELQLVTYKAQLEPVASPAK--KPKVQSGSESII-------QEYVDLRTRYSELSTLTSQYIRFISETLRRMEE--EE 1337
Cdd:COG4717 223 EELEEELEQLENELEAAALEERlkEARLLLLIAAALlallglgGSLLSLILTIAGVLFLVLGLLALLFLLLAREKAslGK 302
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1338 RLAEQQRAEERERL--AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKrsIQEELQHLR 1415
Cdd:COG4717 303 EAEELQALPALEELeeEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQE--IAALLAEAG 380
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1416 QSSEAEIQAKARQVEAAErsrlRIEEEIRVVRLQLEA--TERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQ 1493
Cdd:COG4717 381 VEDEEELRAALEQAEEYQ----ELKEELEELEEQLEEllGELEELLEALDEEELEEELEELEEELEELEEELEELREELA 456
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1494 DETQRKRQAEAElalrvqaeaeaarekqralQALEELRLQAEEAERRLRQAEAERARQ--VQVALETAQRSAEAELQSE 1570
Cdd:COG4717 457 ELEAELEQLEED-------------------GELAELLQELEELKAELRELAEEWAALklALELLEEAREEYREERLPP 516
|
|
| CH_jitterbug-like_rpt2 |
cd21229 |
second calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ... |
166-261 |
6.82e-13 |
|
second calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409078 Cd Length: 105 Bit Score: 67.80 E-value: 6.82e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 166 KEKLLLWSQRMVEGCqglRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPE 244
Cdd:cd21229 5 KKLMLAWLQAVLPEL---KITNFSTDWNDGIALSALLDYCKPGLCpNWRKLDPSNSLENCRRAMDLAKREFNIPMVLSPE 81
|
90
....*....|....*..
gi 1920237968 245 DVDVPQPDEKSIITYVS 261
Cdd:cd21229 82 DLSSPHLDELSGMTYLS 98
|
|
| CH_ASPM_rpt1 |
cd21223 |
first calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated ... |
63-146 |
7.28e-13 |
|
first calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated protein (ASPM) and similar proteins; ASPM, also called abnormal spindle protein homolog, or Asp homolog, is involved in mitotic spindle regulation and coordination of mitotic processes. It may also have a preferential role in regulating neurogenesis. Members of this family contain two copies of the CH domain in the middle region. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409072 Cd Length: 113 Bit Score: 68.00 E-value: 7.28e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 63 EAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM----RFHKLQNVQIALDYLRHRQV----KLVNIRNDDIADGN 134
Cdd:cd21223 21 EFDFAVTNLAVDLRDGVRLCRLVELLTGDWSLLSKLRVpaisRLQKLHNVEVALKALKEAGVlrggDGGGITAKDIVDGH 100
|
90
....*....|..
gi 1920237968 135 PKLTLGLIWTII 146
Cdd:cd21223 101 REKTLALLWRII 112
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2148-2678 |
7.80e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 75.95 E-value: 7.80e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2148 QKQAADAEMEKHKQFAEQALRqkaqVEQELTAlrlqleETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRV-QM 2226
Cdd:PTZ00121 1121 KKKAEDARKAEEARKAEDARK----AEEARKA------EDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVrKA 1190
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2227 EELGKLKA--RIEAENRALVLRDKDSAQRllQEEAEKMKQV--AEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK 2302
Cdd:PTZ00121 1191 EELRKAEDarKAEAARKAEEERKAEEARK--AEDAKKAEAVkkAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2303 EKMQAVQEATrlKAEaELLQQQKELAQEQARRLQEDKEqmAQQLAQETQGFQKTLETERQRQlEMSAEAERLRLRVAEMS 2382
Cdd:PTZ00121 1269 QAAIKAEEAR--KAD-ELKKAEEKKKADEAKKAEEKKK--ADEAKKKAEEAKKADEAKKKAE-EAKKKADAAKKKAEEAK 1342
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2383 RAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLEtQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQlKS 2462
Cdd:PTZ00121 1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK-KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK-KA 1420
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2463 EEMQTvRQEQLLQETQALQQSFLSEKDSLLQRErciEQEKAKLEQLFQ--DEVAKAQALREEQQRqqqqmqqekqqlAAS 2540
Cdd:PTZ00121 1421 DEAKK-KAEEKKKADEAKKKAEEAKKADEAKKK---AEEAKKAEEAKKkaEEAKKADEAKKKAEE------------AKK 1484
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2541 MEEARRRQHEAeegvRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR-SEEIAPSRA-AAARALPNG 2618
Cdd:PTZ00121 1485 ADEAKKKAEEA----KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKkAEEKKKADElKKAEELKKA 1560
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2619 QDAADGPAAAAEPEHAFDGLRR-----KVPAQRLQEVGVL-------SAEELQQLAQGRTTVAELAQREDVR 2678
Cdd:PTZ00121 1561 EEKKKAEEAKKAEEDKNMALRKaeeakKAEEARIEEVMKLyeeekkmKAEEAKKAEEAKIKAEELKKAEEEK 1632
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3046-3084 |
7.91e-13 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 65.04 E-value: 7.91e-13
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3046 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPELHEKL 3084
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| CH_SMTNL1 |
cd21260 |
calponin homology (CH) domain found in smoothelin-like protein 1; Smoothelin-like protein 1 ... |
166-267 |
9.25e-13 |
|
calponin homology (CH) domain found in smoothelin-like protein 1; Smoothelin-like protein 1 (SMTNL1), also called calponin homology-associated smooth muscle protein (CHASM), plays a role in the regulation of contractile properties of both striated and smooth muscles. It can bind to calmodulin and tropomyosin. When it is unphosphorylated, SMTNL1 may inhibit myosin dephosphorylation. SMTNL1 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409109 Cd Length: 116 Bit Score: 67.80 E-value: 9.25e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 166 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 245
Cdd:cd21260 3 KNMLLEWCRAKTRGYEHVDIQNFSSSWSSGMAFCALIHKFFPDAFDYAELDPANRRHNFTLAFSTAEKHADCAPLLEVED 82
|
90 100
....*....|....*....|...
gi 1920237968 246 -VDVPQPDEKSIITYVSSLYDAM 267
Cdd:cd21260 83 mVRMSVPDSKCVYTYIQELYRSL 105
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1314-1531 |
1.09e-12 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 73.75 E-value: 1.09e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1314 ELSTLTSQYIR-----FISETLRRMEEEERLAEQQRAEeRERLAEVEAAlEKQRQLAEAHAQAKAQAER----EAQGLQR 1384
Cdd:COG2268 170 ELESVAITDLEdennyLDALGRRKIAEIIRDARIAEAE-AERETEIAIA-QANREAEEAELEQEREIETariaEAEAELA 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1385 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEirvvrlqLEATERQRGGAEGEL 1464
Cdd:COG2268 248 KKKAEERREAETARAEAEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAEREEAE-------LEADVRKPAEAEKQA 320
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 1465 QALRARAeEAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELR 1531
Cdd:COG2268 321 AEAEAEA-EAEAIRAKGLAEAEGKRALA--EAWNKLGDAAILLMLIEKLPEIAEAAAKPLEKIDKIT 384
|
|
| CH_CYTSA |
cd21256 |
calponin homology (CH) domain found in cytospin-A; Cytospin-A, also called renal carcinoma ... |
164-264 |
1.19e-12 |
|
calponin homology (CH) domain found in cytospin-A; Cytospin-A, also called renal carcinoma antigen NY-REN-22, or sperm antigen with calponin homology and coiled-coil domains 1-like, or SPECC1-like protein (SPECC1L), is involved in cytokinesis and spindle organization. It may play a role in actin cytoskeleton organization and microtubule stabilization and hence, is required for proper cell adhesion and migration. Cytospin-A contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409105 Cd Length: 119 Bit Score: 67.41 E-value: 1.19e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDP 243
Cdd:cd21256 14 SKRNALLKWCQKKTEGYQNIDITNFSSSWNDGLAFCALLHTYLPAHIPYQELNSQDKRRNFTLAFQAAE-SVGIKSTLDI 92
|
90 100
....*....|....*....|..
gi 1920237968 244 ED-VDVPQPDEKSIITYVSSLY 264
Cdd:cd21256 93 NEmVRTERPDWQSVMTYVTAIY 114
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1089-1585 |
1.26e-12 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 74.95 E-value: 1.26e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1089 EAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELrgAQEVGERLQQRHGERDVEVERWRE 1168
Cdd:COG4913 239 RAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRRLEL--LEAELEELRAELARLEAELERLEA 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1169 RVTLLLERWQAVLAQ-TDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDI 1247
Cdd:COG4913 317 RLDALREELDELEAQiRGNGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEAL 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1248 ERHGEKVEECQRFAKQyinAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFIS 1327
Cdd:COG4913 397 EEELEALEEALAEAEA---ALRDLRRELRELEAEIA---------SLERRKSNIPARLLALRDALAEALGLDEAELPFVG 464
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLaeqQRAEER-------------ERLAEVEAALEKQR-------QLAEAHAQAKAQAEREAQGLQRRM- 1386
Cdd:COG4913 465 ELIEVRPEEERW---RGAIERvlggfaltllvppEHYAAALRWVNRLHlrgrlvyERVRTGLPDPERPRLDPDSLAGKLd 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1387 ----------QEEVARREEVA-VEAQEQ----KRSIQEELQ--------------HLRQ------SSEAEIQAKARQVEA 1431
Cdd:COG4913 542 fkphpfrawlEAELGRRFDYVcVDSPEElrrhPRAITRAGQvkgngtrhekddrrRIRSryvlgfDNRAKLAALEAELAE 621
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1432 AERSRLRIEEEIRVVRLQLEATERQRGGAEG---------ELQALRARAEEAEAQKRQAQE---EAERLRRQVQDETQRK 1499
Cdd:COG4913 622 LEEELAEAEERLEALEAELDALQERREALQRlaeyswdeiDVASAEREIAELEAELERLDAssdDLAALEEQLEELEAEL 701
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1500 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV--QVALETAQRSAEAELQSEHASFAEK 1577
Cdd:COG4913 702 EELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERfaAALGDAVERELRENLEERIDALRAR 781
|
....*...
gi 1920237968 1578 TAQLERTL 1585
Cdd:COG4913 782 LNRAEEEL 789
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1331-1833 |
1.41e-12 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 74.99 E-value: 1.41e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1331 RRMEEEERLAEQQRAEERERLAE----VEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRS 1406
Cdd:COG3096 242 RMTLEAIRVTQSDRDLFKHLITEatnyVAADYMRHANERRELSERALELRRELFGARRQLAEEQYRLVEMARELEELSAR 321
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1407 iQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEeirvVRLQLEATERQRGGAEGELqalraraEEAEAQKRQAQE 1483
Cdd:COG3096 322 -ESDLEQDYQAASdhlNLVQTALRQQEKIERYQEDLEE----LTERLEEQEEVVEEAAEQL-------AEAEARLEAAEE 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1484 EAERLRRQVQDETQrkrqaeaelALRVQAEAeaAREKQRALQALEELR-------LQAEEAERRLRQAEAERARQVQVAL 1556
Cdd:COG3096 390 EVDSLKSQLADYQQ---------ALDVQQTR--AIQYQQAVQALEKARalcglpdLTPENAEDYLAAFRAKEQQATEEVL 458
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1557 ETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVavvqlREEAtrraqqqaeaeraraeaerelerWQlKANEALR---- 1632
Cdd:COG3096 459 ELEQKLSVAD---AARRQFEKAYELVCKIAGEVE-----RSQA-----------------------WQ-TARELLRryrs 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1633 LRLQAEEVAQQKSltqaeaekqkeeaerearRRGKAEEQAVRQRELAEQelekQRQLAEGTAQQRLAAEQelirLRAETE 1712
Cdd:COG3096 507 QQALAQRLQQLRA------------------QLAELEQRLRQQQNAERL----LEEFCQRIGQQLDAAEE----LEELLA 560
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1713 QGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAE----------EESRSTSEKSKQRLEAEA 1782
Cdd:COG3096 561 ELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARAPAWLAAQDALErlreqsgealADSQEVTAAMQQLLERER 640
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1783 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISE 1833
Cdd:COG3096 641 EATVERDELAARKQALESQIERLSQPGGAEDPRLLALAERLGGVLLSEIYD 691
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1332-2456 |
1.50e-12 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 74.44 E-value: 1.50e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1332 RMEEEERLAEQQRAEERERLAEVEAAL----EKQRQLAEAHAQAKAQAEREAqglqrrmqEEVARREEVAVEAQEQKRSI 1407
Cdd:pfam01576 2 RQEEEMQAKEEELQKVKERQQKAESELkeleKKHQQLCEEKNALQEQLQAET--------ELCAEAEEMRARLAARKQEL 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1408 QEELQHLrqssEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegelQALRARAEEAEAQKRQAQEEAER 1487
Cdd:pfam01576 74 EEILHEL----ESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAAR-------QKLQLEKVTTEAKIKKLEEDILL 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1488 LRRQVQDETQRKRQAE---AELALRVQAEAEAA------REKQRALQALEELRLQAEEAERRlrqaEAERARQVQVALET 1558
Cdd:pfam01576 143 LEDQNSKLSKERKLLEeriSEFTSNLAEEEEKAkslsklKNKHEAMISDLEERLKKEEKGRQ----ELEKAKRKLEGEST 218
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1559 AQRSAEAELQsehASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAeaeraraeaerelerwQLKANEALRLRLQaE 1638
Cdd:pfam01576 219 DLQEQIAELQ---AQIAELRAQLAKKEEELQAALARLEEETAQKNNALK----------------KIRELEAQISELQ-E 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1639 EVAQQKSltqaeaekqkeeaerearrrgkAEEQAVRQRELAEQELEKQRQLAEGT-----AQQRLAA--EQELIRLR--- 1708
Cdd:pfam01576 279 DLESERA----------------------ARNKAEKQRRDLGEELEALKTELEDTldttaAQQELRSkrEQEVTELKkal 336
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1709 -AETEQGEQQRQ-----------LLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEEsRSTSEKSKQ 1776
Cdd:pfam01576 337 eEETRSHEAQLQemrqkhtqaleELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHK-RKKLEGQLQ 415
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1777 RLEA---EAGRFR-ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERV---LAEKLAAISEATRLKTEAEIALKEKE 1849
Cdd:pfam01576 416 ELQArlsESERQRaELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLesqLQDTQELLQEETRQKLNLSTRLRQLE 495
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1850 AENERLRRLAEDEAFQRRLLEEQAAQHkadiEARLAQLRKASESELERQKGLVEDtlrqRRQVEEEILALKGSFEKAAAG 1929
Cdd:pfam01576 496 DERNSLQEQLEEEEEAKRNVERQLSTL----QAQLSDMKKKLEEDAGTLEALEEG----KKRLQRELEALTQQLEEKAAA 567
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1930 KAELELELGRIRGTAEDTLRSKEQaeqeaarQRQLAAEEERRRREAeervqKSLAAEEEAARQRKAALEEVERLKAKVEE 2009
Cdd:pfam01576 568 YDKLEKTKNRLQQELDDLLVDLDH-------QRQLVSNLEKKQKKF-----DQMLAEEKAISARYAEERDRAEAEAREKE 635
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2010 ARRLRERAEQESARQLQLAQEAAQKRLQAEekahafavqqkeqelqqtlqqeqsvLERLRSEAEAARRAAEEAEAARERA 2089
Cdd:pfam01576 636 TRALSLARALEEALEAKEELERTNKQLRAE-------------------------MEDLVSSKDDVGKNVHELERSKRAL 690
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2090 EREAAQSRRQVEEAERLKQSaeeqaqaqaqaqAAAEKLRKEAEQEAARRAQAeqaalRQKQAADAEMEKHKQfaeQALRQ 2169
Cdd:pfam01576 691 EQQVEEMKTQLEELEDELQA------------TEDAKLRLEVNMQALKAQFE-----RDLQARDEQGEEKRR---QLVKQ 750
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2170 KAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKD 2249
Cdd:pfam01576 751 VRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKE 830
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2250 SAQRLLQEEAEKMkQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQ 2329
Cdd:pfam01576 831 SEKKLKNLEAELL-QLQEDLAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLN 909
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2330 EQARRLQEDKEQMAQQLAQETQGFQKtLETERQrqlEMSAEAERLRLRVAEMSRAQ---------------ARAEEDARR 2394
Cdd:pfam01576 910 DRLRKSTLQVEQLTTELAAERSTSQK-SESARQ---QLERQNKELKAKLQEMEGTVkskfkssiaaleakiAQLEEQLEQ 985
|
1130 1140 1150 1160 1170 1180
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2395 FRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQ 2456
Cdd:pfam01576 986 ESRERQAANKLVRRTEKKLKEVLLQVEDERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEAS 1047
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1462-2040 |
2.00e-12 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 73.92 E-value: 2.00e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1462 GELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRqaEAELALRVQAEAEAAREKQRALQALEELRLQAEEA--ER 1539
Cdd:PRK02224 162 GKLEEYRERASDARLGVERVLSDQRGSLDQLKAQIEEKE--EKDLHERLNGLESELAELDEEIERYEEQREQARETrdEA 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1540 RLRQAEAERARQVQVALETA---QRSAEAELQSEHASFAEKTAQLERTLKEehvavvqLREEATRRAQQQAEAERARAEA 1616
Cdd:PRK02224 240 DEVLEEHEERREELETLEAEiedLRETIAETEREREELAEEVRDLRERLEE-------LEEERDDLLAEAGLDDADAEAV 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1617 ERELERWQlKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEaerearrrgKAEEQAVRQRELA---EQELEKQRQLAEGT 1693
Cdd:PRK02224 313 EARREELE-DRDEELRDRLEECRVAAQAHNEEAESLREDAD---------DLEERAEELREEAaelESELEEAREAVEDR 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1694 AQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVR---AEMEVLLA------------ 1758
Cdd:PRK02224 383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARervEEAEALLEagkcpecgqpve 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1759 --SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAE--EDAVRQRAEAERVLAEKLAAISEA 1834
Cdd:PRK02224 463 gsPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEErrEDLEELIAERRETIEEKRERAEEL 542
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1835 TRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIE---------ARLAQLRKASESELERQKGLVE-- 1903
Cdd:PRK02224 543 RERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIEslerirtllAAIADAEDEIERLREKREALAEln 622
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1904 ----DTLRQRRqveEEILALKGSFEKAAAGKAELELElgrirgTAEDTLRSKEQAEQEAARQRqlaaeeerrrreaeERV 1979
Cdd:PRK02224 623 derrERLAEKR---ERKRELEAEFDEARIEEAREDKE------RAEEYLEQVEEKLDELREER--------------DDL 679
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1980 QKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEE 2040
Cdd:PRK02224 680 QAEIGAVENELEELEELRERREALENRVEALEALYDEAEELESMYGDLRAELRQRNVETLE 740
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3788-3826 |
3.10e-12 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 63.50 E-value: 3.10e-12
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3788 LLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDTHDQL 3826
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3377-3415 |
4.67e-12 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 63.12 E-value: 4.67e-12
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3377 LLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPELHEKL 3415
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4300-4338 |
4.86e-12 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 63.12 E-value: 4.86e-12
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 4300 LLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRI 4338
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| CH_SF |
cd00014 |
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding ... |
166-265 |
5.67e-12 |
|
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding motifs, which may be present as a single copy or in tandem repeats (which increase binding affinity). They either function as autonomous actin binding motifs or serve a regulatory function. CH domains are found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, as well as proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
Pssm-ID: 409031 [Multi-domain] Cd Length: 103 Bit Score: 65.05 E-value: 5.67e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 166 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN---LENLDQAFSVAER-DLGVTRLL 241
Cdd:cd00014 1 EEELLKWINEVLGEELPVSITDLFESLRDGVLLCKLINKLSPGSIPKINKKPKSPfkkRENINLFLNACKKlGLPELDLF 80
|
90 100
....*....|....*....|....
gi 1920237968 242 DPEDVdVPQPDEKSIITYVSSLYD 265
Cdd:cd00014 81 EPEDL-YEKGNLKKVLGTLWALAL 103
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1667-1888 |
6.09e-12 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 70.95 E-value: 6.09e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1746
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1747 AKVRAE----------------MEVLLASKARAEEESRSTSEKS-KQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLA 1809
Cdd:COG4942 100 EAQKEElaellralyrlgrqppLALLLSPEDFLDAVRRLQYLKYlAPARREQAEELRADLAELAALRAELEAERAELEAL 179
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1810 EEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEdeafqrRLLEEQAAQHKADIEARLAQLR 1888
Cdd:COG4942 180 LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIA------RLEAEAAAAAERTPAAGFAALK 252
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2164-2488 |
7.13e-12 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 72.08 E-value: 7.13e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2164 EQALRQKAQVEQELTALRLQLEETDHQKSIlDEELQRLKAEVTEAARQRGQVEEELFSL--RVQMEELGKLK-------A 2234
Cdd:pfam17380 255 EYTVRYNGQTMTENEFLNQLLHIVQHQKAV-SERQQQEKFEKMEQERLRQEKEEKAREVerRRKLEEAEKARqaemdrqA 333
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2235 RIEAENRALVLRDKDSAQRLLQEEaekmKQVAEEAARLSVAAQEAARLRQLaeEDLAQQRALAEKMLKEKMQAVQEATRL 2314
Cdd:pfam17380 334 AIYAEQERMAMERERELERIRQEE----RKRELERIRQEEIAMEISRMREL--ERLQMERQQKNERVRQELEAARKVKIL 407
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2315 KAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQgfQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARR 2394
Cdd:pfam17380 408 EEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEER--AREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRD 485
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2395 fRKQAEDIGERLYRTELATQEKVM--------LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQ 2466
Cdd:pfam17380 486 -RKRAEEQRRKILEKELEERKQAMieeerkrkLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEE 564
|
330 340
....*....|....*....|..
gi 1920237968 2467 TVRQEQLLQETQALQQSFLSEK 2488
Cdd:pfam17380 565 RSRLEAMEREREMMRQIVESEK 586
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1865-2605 |
7.23e-12 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 72.31 E-value: 7.23e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1865 QRRLLEEQAA---QHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIR 1941
Cdd:pfam02463 153 ERRLEIEEEAagsRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYL 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1942 GTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQES 2021
Cdd:pfam02463 233 KLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDD 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2022 ARQLQLAQE---AAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRR 2098
Cdd:pfam02463 313 EEKLKESEKekkKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKL 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2099 QVEEAErlKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELT 2178
Cdd:pfam02463 393 KEEELE--LKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKS 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2179 ALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEE 2258
Cdd:pfam02463 471 EDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVI 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2259 AEKMKQVAEEAARLSVAAQEAARLRQLAEedlaqqralaekmlkekmqavqeaTRLKAEAELLQQQKELAQEQARRLQED 2338
Cdd:pfam02463 551 VEVSATADEVEERQKLVRALTELPLGARK------------------------LRLLIPKLKLPLKSIAVLEIDPILNLA 606
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2339 KEQMAQQLAQETQGFQKTLETERQRQLEMSaeaERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVM 2418
Cdd:pfam02463 607 QLDKATLEADEDDKRAKVVEGILKDTELTK---LKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQE 683
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2419 LVQTLETQRQQSdrdaERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCI 2498
Cdd:pfam02463 684 KAESELAKEEIL----RRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKK 759
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2499 EQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGvrrQQEELQRLAQQQQQQEKLLAE 2578
Cdd:pfam02463 760 EEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAE---LLEEEQLLIEQEEKIKEEELE 836
|
730 740
....*....|....*....|....*..
gi 1920237968 2579 ENQRLRERLQHLEEERRAALARSEEIA 2605
Cdd:pfam02463 837 ELALELKEEQKLEKLAEEELERLEEEI 863
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1006-1591 |
8.92e-12 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 72.29 E-value: 8.92e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1006 RECAQRITEQQKAQAEVDGLGKGVARLSAEAEkvlalpepspaaptlrsELELTLGKLEqvrslsaiylEKLKTISLVIR 1085
Cdd:COG3096 522 AELEQRLRQQQNAERLLEEFCQRIGQQLDAAE-----------------ELEELLAELE----------AQLEELEEQAA 574
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1086 STQEAEEVLRAHEEQLKE-AQAVPATLPELEATKAALKKLRAQAEAQqpvFDALRDELRGAQEVGERLQQRHGERDvEVE 1164
Cdd:COG3096 575 EAVEQRSELRQQLEQLRArIKELAARAPAWLAAQDALERLREQSGEA---LADSQEVTAAMQQLLEREREATVERD-ELA 650
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1165 RWRERVTLLLERWQAVLAQTDVRQREL-EQLGRQL--RYYR----ESADPLGAWLRDAKQrqeqiqAVPLANSQAVREQL 1237
Cdd:COG3096 651 ARKQALESQIERLSQPGGAEDPRLLALaERLGGVLlsEIYDdvtlEDAPYFSALYGPARH------AIVVPDLSAVKEQL 724
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1238 RQEKALLED---IERHGEKVEECQRFAKQYINAI--KDYELQLVTYKAQLEPV----ASPAKKPKVQSGSESIIQEYVDL 1308
Cdd:COG3096 725 AGLEDCPEDlylIEGDPDSFDDSVFDAEELEDAVvvKLSDRQWRYSRFPEVPLfgraAREKRLEELRAERDELAEQYAKA 804
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1309 RTRYSELSTLTSQYIRFISETLRRMEEEErlAEQQRAEERERLAEVEAALEKQRQlAEAHAQAKAQAEREAQGLQRRMQE 1388
Cdd:COG3096 805 SFDVQKLQRLHQAFSQFVGGHLAVAFAPD--PEAELAALRQRRSELERELAQHRA-QEQQLRQQLDQLKEQLQLLNKLLP 881
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1389 EV---------ARREEVAVE---AQEQKRSIQEELQHLRQSSE--AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATE 1454
Cdd:COG3096 882 QAnlladetlaDRLEELREEldaAQEAQAFIQQHGKALAQLEPlvAVLQSDPEQFEQLQADYLQAKEQQRRLKQQIFALS 961
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1455 --RQR------GGAEGEL-------QALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAARE 1519
Cdd:COG3096 962 evVQRrphfsyEDAVGLLgensdlnEKLRARLEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQELEQE 1041
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1520 -KQRALQALEELRLQAEEaERRLRQAEAERARQVQVALETAQRSAEAELQSEHASF--AEKTAQLERTLKEEHVA 1591
Cdd:COG3096 1042 lEELGVQADAEAEERARI-RRDELHEELSQNRSRRSQLEKQLTRCEAEMDSLQKRLrkAERDYKQEREQVVQAKA 1115
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3955-3993 |
1.12e-11 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 61.96 E-value: 1.12e-11
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3955 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKL 3993
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1363-1733 |
1.17e-11 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 71.31 E-value: 1.17e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1363 QLAEAHAQAKAQAEREAQGLQRRMQEEVARREevaveaqeqKRSIQEELQHLRQSSEAEiqaKARQVEA-------AERS 1435
Cdd:pfam17380 273 QLLHIVQHQKAVSERQQQEKFEKMEQERLRQE---------KEEKAREVERRRKLEEAE---KARQAEMdrqaaiyAEQE 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1436 RLRIEEEIRVVRLQLEatERQRggaegELQALRaraEEAEAQKRQAQEEAERLRRQVQDETQRKRQaEAELALRVQ-AEA 1514
Cdd:pfam17380 341 RMAMERERELERIRQE--ERKR-----ELERIR---QEEIAMEISRMRELERLQMERQQKNERVRQ-ELEAARKVKiLEE 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1515 EAAREKQRALQALEELRLQAEEA-ERRLRQAEAERARQVQ-VALETAQRSAEAE-LQSEHASFAEKTAQLERTLKEEHVA 1591
Cdd:pfam17380 410 ERQRKIQQQKVEMEQIRAEQEEArQREVRRLEEERAREMErVRLEEQERQQQVErLRQQEEERKRKKLELEKEKRDRKRA 489
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1592 VVQLREeatrraqqqaeaeraraeaereLERWQLKANEalrlRLQAEEVAQQKSLTQAEAEKQKEEAEREARRrgKAEEQ 1671
Cdd:pfam17380 490 EEQRRK----------------------ILEKELEERK----QAMIEEERKRKLLEKEMEERQKAIYEEERRR--EAEEE 541
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1672 AVRQrelaeQELEKQRQLAEgtaqQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAA 1733
Cdd:pfam17380 542 RRKQ-----QEMEERRRIQE----QMRKATEERSRLEAMEREREMMRQIVESEKARAEYEAT 594
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3122-3160 |
1.24e-11 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 61.96 E-value: 1.24e-11
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3122 LLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKETSAAL 3160
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1075-1960 |
1.38e-11 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 71.54 E-value: 1.38e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1075 EKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRaqaeaqqpvfdalrdelrgaqevgerlqq 1154
Cdd:pfam02463 180 EETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLD----------------------------- 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1155 rhgERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESAdplgawlrDAKQRQEQIQAVPLANSQAVR 1234
Cdd:pfam02463 231 ---YLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEE--------KEKKLQEEELKLLAKEEEELK 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1235 EQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSgsesiIQEYVDLRTRYSE 1314
Cdd:pfam02463 300 SELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEEL-----EKLQEKLEQLEEE 374
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1315 LSTLTSQYIRFISETLRR--MEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL---QRRMQEE 1389
Cdd:pfam02463 375 LLAKKKLESERLSSAAKLkeEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGkltEEKEELE 454
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1390 VARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLR-IEEEIRVVRLQLEATERQRGGAEGELQALR 1468
Cdd:pfam02463 455 KQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESkARSGLKVLLALIKDGVGGRIISAHGRLGDL 534
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1469 ARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQ-AEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE 1547
Cdd:pfam02463 535 GVAVENYKVAISTAVIVEVSATADEVEERQKLVrALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLE 614
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1548 RARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHvavvQLREEATRRAQQQAEAERARAEAERELERWQLKA 1627
Cdd:pfam02463 615 ADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEE----GLAEKSEVKASLSELTKELLEIQELQEKAESELA 690
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1628 NEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRL 1707
Cdd:pfam02463 691 KEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELS 770
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1708 RAETEQGEQQRQLLEEELARLQREAAAATQKrrELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRE 1787
Cdd:pfam02463 771 LKEKELAEEREKTEKLKVEEEKEEKLKAQEE--ELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 848
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1788 LAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRR 1867
Cdd:pfam02463 849 EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAE 928
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1868 LLEEQAAQHKADIEARLAQLRKASESELERQKglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDT 1947
Cdd:pfam02463 929 ILLKYEEEPEELLLEEADEKEKEENNKEEEEE---RNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
|
890
....*....|...
gi 1920237968 1948 LRSKEQAEQEAAR 1960
Cdd:pfam02463 1006 KLIRAIIEETCQR 1018
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
2190-2597 |
1.42e-11 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 70.95 E-value: 1.42e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2190 QKSILDEELQRLKAEVTEAARQRG---QVEEELFSLRVQMEELGKLKARIEAENRAL--------VLRDKDSAQRLLQEE 2258
Cdd:COG4717 65 KPELNLKELKELEEELKEAEEKEEeyaELQEELEELEEELEELEAELEELREELEKLekllqllpLYQELEALEAELAEL 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2259 AEKMKQVAEEAARLSVAAQEAARLRQLAEEdlaQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQED 2338
Cdd:COG4717 145 PERLEELEERLEELRELEEELEELEAELAE---LQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEE 221
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2339 KEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEED---------------ARRFRKQAEDIG 2403
Cdd:COG4717 222 LEELEEELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTiagvlflvlgllallFLLLAREKASLG 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2404 ERLYRTELATQEkvmlvQTLETQRQQSDRDAERLREAI--AELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQAL- 2480
Cdd:COG4717 302 KEAEELQALPAL-----EELEEEELEELLAALGLPPDLspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALl 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2481 QQSFLSEKDSLLQRERCIEQEKAKLEQLfqdEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQE 2560
Cdd:COG4717 377 AEAGVEDEEELRAALEQAEEYQELKEEL---EELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEELRE 453
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1920237968 2561 ELQRLAQQQQQQEK-----LLAEENQRLRERLQHLEEERRAA 2597
Cdd:COG4717 454 ELAELEAELEQLEEdgelaELLQELEELKAELRELAEEWAAL 495
|
|
| CH_CYTSB |
cd21257 |
calponin homology (CH) domain found in cytospin-B; Cytospin-B, also called nuclear structure ... |
164-264 |
1.64e-11 |
|
calponin homology (CH) domain found in cytospin-B; Cytospin-B, also called nuclear structure protein 5 (NSP5), or sperm antigen HCMOGT-1, or sperm antigen with calponin homology and coiled-coil domains 1 (SPECC1), is a novel fusion Cytospin-B that contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409106 Cd Length: 112 Bit Score: 63.90 E-value: 1.64e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDP 243
Cdd:cd21257 8 SKRNALLKWCQKKTEGYPNIDITNFSSSWSDGLAFCALLHTYLPAHIPYQELSSQDKKRNLLLAFQAAE-SVGIKPSLEL 86
|
90 100
....*....|....*....|..
gi 1920237968 244 ED-VDVPQPDEKSIITYVSSLY 264
Cdd:cd21257 87 SEmMYTDRPDWQSVMQYVAQIY 108
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1396-1599 |
1.71e-11 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 69.41 E-value: 1.71e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1396 VAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAE 1475
Cdd:COG4942 14 AAAAQADAAAEAEAELEQLQQ----EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1476 AQKRQAQEEAERLRRQVQDET----QRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRL-----RQAEA 1546
Cdd:COG4942 90 KEIAELRAELEAQKEELAELLralyRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLaelaaLRAEL 169
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 1547 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1599
Cdd:COG4942 170 EAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEA 222
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1302-1800 |
1.76e-11 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 71.14 E-value: 1.76e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1302 IQEYVDLRTRYSELSTLTSQYirFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAqg 1381
Cdd:COG3096 248 IRVTQSDRDLFKHLITEATNY--VAADYMRHANERRELSERALELRRELFGARRQLAEEQYRLVEMARELEELSARES-- 323
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1382 lqrrmqeevarreevAVEAQEQKRSiqeelQHLrqsseAEIQAKARQVEAAERSRLRIEEeirvVRLQLEATERQRGGAE 1461
Cdd:COG3096 324 ---------------DLEQDYQAAS-----DHL-----NLVQTALRQQEKIERYQEDLEE----LTERLEEQEEVVEEAA 374
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1462 GELqalraraEEAEAQKRQAQEEAERLRRQVQDETQrkrqaeaelALRVQaeAEAAREKQRALQALEELR-------LQA 1534
Cdd:COG3096 375 EQL-------AEAEARLEAAEEEVDSLKSQLADYQQ---------ALDVQ--QTRAIQYQQAVQALEKARalcglpdLTP 436
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1535 EEAERRLRQAEAERARQVQVALETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVavvqlREEATRRAQQQAEAERARA 1614
Cdd:COG3096 437 ENAEDYLAAFRAKEQQATEEVLELEQKLSVAD---AARRQFEKAYELVCKIAGEVE-----RSQAWQTARELLRRYRSQQ 508
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1615 EAERELERWQLKANEA-LRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrGKAEEQAVRQRELAEQELEKQRQLAEGT 1693
Cdd:COG3096 509 ALAQRLQQLRAQLAELeQRLRQQQNAERLLEEF-------------------CQRIGQQLDAAEELEELLAELEAQLEEL 569
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1694 AQQRLAAEQELIRLRAETEQGEQQRQLLeEELARLQREAAAATQKRRELEAE----LAKVRAEMEVLLaSKARAEEESRS 1769
Cdd:COG3096 570 EEQAAEAVEQRSELRQQLEQLRARIKEL-AARAPAWLAAQDALERLREQSGEaladSQEVTAAMQQLL-EREREATVERD 647
|
490 500 510
....*....|....*....|....*....|....*
gi 1920237968 1770 TSEKSKQRLEAEAgrfRELA----EEAARLRALAE 1800
Cdd:COG3096 648 ELAARKQALESQI---ERLSqpggAEDPRLLALAE 679
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
607-785 |
1.86e-11 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 66.70 E-value: 1.86e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 607 LHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQ 686
Cdd:cd00176 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERL 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 687 AALQTQWSWMLQLCCCIEAHLKENTAYFQFFSDVREAEEQLRKLQETLRRKYTCDrsiTATRLEDLLQDAQDEKEQLSEY 766
Cdd:cd00176 82 EELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGK---DLESVEELLKKHKELEEELEAH 158
|
170
....*....|....*....
gi 1920237968 767 RGHLSGLAKRAKAIVQLKP 785
Cdd:cd00176 159 EPRLKSLNELAEELLEEGH 177
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1677-2597 |
2.07e-11 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 70.97 E-value: 2.07e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1677 ELAEQELEKQR-QLAEGTAQQRLAA-EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAeME 1754
Cdd:pfam01576 111 QLDEEEAARQKlQLEKVTTEAKIKKlEEDILLLEDQNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEA-MI 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1755 VLLASKARAEEESRSTSEKSKQRLEAEAGrfrELAEEAARLRALAEEAKRQRQLAEEDavrqraeaervLAEKLAAISEA 1834
Cdd:pfam01576 190 SDLEERLKKEEKGRQELEKAKRKLEGEST---DLQEQIAELQAQIAELRAQLAKKEEE-----------LQAALARLEEE 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1835 TRLKTEAEIALKEKEAENERLRRLAEDEAFQRrlleEQAAQHKADIEARLAQLRKASESELERQKGLVEdtLRQRRqvEE 1914
Cdd:pfam01576 256 TAQKNNALKKIRELEAQISELQEDLESERAAR----NKAEKQRRDLGEELEALKTELEDTLDTTAAQQE--LRSKR--EQ 327
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1915 EILALKGSFEKAAAGKAELELELGRIRGTAEDTLrsKEQAEQeAARQRQLAAEEERRRREAEERVQKSL----AAEEEAA 1990
Cdd:pfam01576 328 EVTELKKALEEETRSHEAQLQEMRQKHTQALEEL--TEQLEQ-AKRNKANLEKAKQALESENAELQAELrtlqQAKQDSE 404
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1991 RQRKAALEEVERLKAKVEEARRLRERAEQESARqLQLAQEAAQKRLQAEEKahafavqqKEQELQQTLQQEQSVLERLRS 2070
Cdd:pfam01576 405 HKRKKLEGQLQELQARLSESERQRAELAEKLSK-LQSELESVSSLLNEAEG--------KNIKLSKDVSSLESQLQDTQE 475
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2071 EAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQ 2150
Cdd:pfam01576 476 LLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELE 555
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2151 AADAEMEKHKQFAEQALRQKAQVEQELTALRLQLeetDHQKSILDEELQRLKAEVTEAArqrgqvEEELFSLRVQMEelg 2230
Cdd:pfam01576 556 ALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDL---DHQRQLVSNLEKKQKKFDQMLA------EEKAISARYAEE--- 623
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2231 klKARIEAENRalvlrDKDSAQRLLQEEAEKMKQVAEEAARlsvaaqeAARLRQLAEEDLAQQRALAEKMLKE----KMQ 2306
Cdd:pfam01576 624 --RDRAEAEAR-----EKETRALSLARALEEALEAKEELER-------TNKQLRAEMEDLVSSKDDVGKNVHElersKRA 689
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2307 AVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLEtERQRQL-----EMSAEAERLRLRVAEM 2381
Cdd:pfam01576 690 LEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGE-EKRRQLvkqvrELEAELEDERKQRAQA 768
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2382 SRAQARAEEDARRFRKQAEDIGERlyRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLK 2461
Cdd:pfam01576 769 VAAKKKLELDLKELEAQIDAANKG--REEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQLQ 846
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2462 SEEMQTVRQE-QLLQETQALQQ---SFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQL 2537
Cdd:pfam01576 847 EDLAASERARrQAQQERDELADeiaSGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTEL 926
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2538 AAS---------------------------MEEARRRQHEA-----EEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRE 2585
Cdd:pfam01576 927 AAErstsqksesarqqlerqnkelkaklqeMEGTVKSKFKSsiaalEAKIAQLEEQLEQESRERQAANKLVRRTEKKLKE 1006
|
970
....*....|..
gi 1920237968 2586 RLQHLEEERRAA 2597
Cdd:pfam01576 1007 VLLQVEDERRHA 1018
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1708-2024 |
2.13e-11 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 70.54 E-value: 2.13e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1708 RAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAElAKVRAEMEVLLASKARAEEESRSTSEKSkqrlEAEAGRFRE 1787
Cdd:pfam17380 295 KMEQERLRQEKEEKAREVERRRKLEEAEKARQAEMDRQ-AAIYAEQERMAMERERELERIRQEERKR----ELERIRQEE 369
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1788 LAEEAARLRALaEEAKRQRQLAEEdAVRQRAEAERV--LAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEA-- 1863
Cdd:pfam17380 370 IAMEISRMREL-ERLQMERQQKNE-RVRQELEAARKvkILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAre 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1864 FQRRLLEEQAAQHKADIEARLAQLRKASESELERQKglvedtlRQRRQVEEEilaLKGSFEKAAAGKAELELELGRIRGT 1943
Cdd:pfam17380 448 MERVRLEEQERQQQVERLRQQEEERKRKKLELEKEK-------RDRKRAEEQ---RRKILEKELEERKQAMIEEERKRKL 517
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1944 AEDTLRSKEQAEQEAARQRQlaaeeerrrREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2023
Cdd:pfam17380 518 LEKEMEERQKAIYEEERRRE---------AEEERRKQQEMEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKAR 588
|
.
gi 1920237968 2024 Q 2024
Cdd:pfam17380 589 A 589
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1415-2040 |
2.62e-11 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 70.64 E-value: 2.62e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1415 RQSSEAEIQAKARQVEA--AERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE----RL 1488
Cdd:pfam12128 209 DGVVPPKSRLNRQQVEHwiRDIQAIAGIMKIRPEFTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQetsaEL 288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1489 RRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAeLQ 1568
Cdd:pfam12128 289 NQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKA-LT 367
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1569 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAerelerwqlkANEALRLRLQAEEVAQQKSLTQ 1648
Cdd:pfam12128 368 GKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAED----------DLQALESELREQLEAGKLEFNE 437
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1649 AEAEKQKEEAEREARRRG-KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR 1727
Cdd:pfam12128 438 EEYRLKSRLGELKLRLNQaTATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEE 517
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1728 LQREAAAATQ----KRRELEAELAKVRAEMEVLLASKARAEEESRS------TSEKSKQRLEAEAGRFRELAEEAARLRA 1797
Cdd:pfam12128 518 RQSALDELELqlfpQAGTLLHFLRKEAPDWEQSIGKVISPELLHRTdldpevWDGSVGGELNLYGVKLDLKRIDVPEWAA 597
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1798 LAEEAKRQRQLAEEDAVRQR---AEAERVLAEKLAAISEATRLKTEAEIALKEKEaenERLRRLAEDEAFQRRLLEEQAA 1874
Cdd:pfam12128 598 SEEELRERLDKAEEALQSARekqAAAEEQLVQANGELEKASREETFARTALKNAR---LDLRRLFDEKQSEKDKKNKALA 674
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1875 QHKA-------DIEARLAQLRKASESELERQKG---------------LVEDTLRQRRQVEEEILALKGSFE-------- 1924
Cdd:pfam12128 675 ERKDsanerlnSLEAQLKQLDKKHQAWLEEQKEqkreartekqaywqvVEGALDAQLALLKAAIAARRSGAKaelkalet 754
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1925 -----------------KAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEE 1987
Cdd:pfam12128 755 wykrdlaslgvdpdviaKLKREIRTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLATQLSNIERAISELQQQLARLI 834
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1988 EAARQRKAALEevERLKAKVEEARRLRE-----RAEQESARQLQLAQEAAQKRLQAEE 2040
Cdd:pfam12128 835 ADTKLRRAKLE--MERKASEKQQVRLSEnlrglRCEMSKLATLKEDANSEQAQGSIGE 890
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1385-2483 |
2.66e-11 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 70.59 E-value: 2.66e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1385 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEA--AERSRLRIEEEIRvVRLQLEATErqrggAEG 1462
Cdd:pfam01576 2 RQEEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQlqAETELCAEAEEMR-ARLAARKQE-----LEE 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1463 ELQALRARAEEAEAQKRQAQEEAERLRRQVQDetqrkrqAEAELalrvqAEAEAAREKqralqaleeLRLQAEEAERRLR 1542
Cdd:pfam01576 76 ILHELESRLEEEEERSQQLQNEKKKMQQHIQD-------LEEQL-----DEEEAARQK---------LQLEKVTTEAKIK 134
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1543 QAEAErarqvqVALETAQRSaeaELQSEHASFAEKTAQLERTLKEEHVAVVQL-----REEATRRAQQQAEAERARAEAE 1617
Cdd:pfam01576 135 KLEED------ILLLEDQNS---KLSKERKLLEERISEFTSNLAEEEEKAKSLsklknKHEAMISDLEERLKKEEKGRQE 205
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1618 RELERWQLKAnEALRLRlqaEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtAQQR 1697
Cdd:pfam01576 206 LEKAKRKLEG-ESTDLQ---EQIAELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISE--LQED 279
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1698 LAAEqelirlRAETEQGEQQRQLLEEELARLQRE---AAAATQKRRELEAelakvRAEMEVLLASKArAEEESRSTSEKS 1774
Cdd:pfam01576 280 LESE------RAARNKAEKQRRDLGEELEALKTEledTLDTTAAQQELRS-----KREQEVTELKKA-LEEETRSHEAQL 347
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1775 KQRLEAEAGRFRELAEE---AARLRALAEEAK--------------RQRQLAEEDAVRQRAEAERVLAEKLAAISEATRL 1837
Cdd:pfam01576 348 QEMRQKHTQALEELTEQleqAKRNKANLEKAKqalesenaelqaelRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQ 427
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1838 KTEAEIALKEKEAENERLRRLAEDeafqrrlLEEQAAQHKADIEARLAQLRKASE--SELERQKGLVEDTLrqrRQVEEE 1915
Cdd:pfam01576 428 RAELAEKLSKLQSELESVSSLLNE-------AEGKNIKLSKDVSSLESQLQDTQEllQEETRQKLNLSTRL---RQLEDE 497
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1916 ILALKGSFEKAAAGKAELELELGRIRGTAEDTlrsKEQAEQEAARQRQLaaeeerrrreaeERVQKSLAAEEEAARQRka 1995
Cdd:pfam01576 498 RNSLQEQLEEEEEAKRNVERQLSTLQAQLSDM---KKKLEEDAGTLEAL------------EEGKKRLQRELEALTQQ-- 560
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1996 aLEEVERLKAKVEEAR-RLRERAE-----QESARQLQLAQEAAQKR---LQAEEKA-HAFAVQQKEQELQQTLQQEQSVL 2065
Cdd:pfam01576 561 -LEEKAAAYDKLEKTKnRLQQELDdllvdLDHQRQLVSNLEKKQKKfdqMLAEEKAiSARYAEERDRAEAEAREKETRAL 639
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2066 ERLRSeaeaarraaeeaeaareraereAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAA 2145
Cdd:pfam01576 640 SLARA----------------------LEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEM 697
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQaadaEMEKHKQFAEQA-LR-------QKAQVEQELTALRLQLEEtdhQKSILDEELQRLKAEVTEAARQRGQvee 2217
Cdd:pfam01576 698 KTQLE----ELEDELQATEDAkLRlevnmqaLKAQFERDLQARDEQGEE---KRRQLVKQVRELEAELEDERKQRAQ--- 767
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2218 eLFSLRVQME-ELGKLKARIEAENRAlvlrdKDSAQRLLQEEAEKMKQVAeeaarlsvaaqeaarlRQLAEEDLAQQRAL 2296
Cdd:pfam01576 768 -AVAAKKKLElDLKELEAQIDAANKG-----REEAVKQLKKLQAQMKDLQ----------------RELEEARASRDEIL 825
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2297 AEKMLKEKmqavqeatRLKA-EAELLQQQKELA-QEQARR-LQEDKEQMAQQLAQETQGfqKTLETERQRQLEmsaeaER 2373
Cdd:pfam01576 826 AQSKESEK--------KLKNlEAELLQLQEDLAaSERARRqAQQERDELADEIASGASG--KSALQDEKRRLE-----AR 890
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2374 LRLRVAEMSRAQARAEEDARRFRKQAEDIgERLyRTELATQEKvmLVQTLETQRQQSDRDAERLREAIAELEHE-KDKLK 2452
Cdd:pfam01576 891 IAQLEEELEEEQSNTELLNDRLRKSTLQV-EQL-TTELAAERS--TSQKSESARQQLERQNKELKAKLQEMEGTvKSKFK 966
|
1130 1140 1150
....*....|....*....|....*....|.
gi 1920237968 2453 QEAQLLQLKSEEMqtvrQEQLLQETQALQQS 2483
Cdd:pfam01576 967 SSIAALEAKIAQL----EEQLEQESRERQAA 993
|
|
| CH_PLS_rpt3 |
cd21298 |
third calponin homology (CH) domain found in the plastin family; The plastin family includes ... |
46-152 |
2.70e-11 |
|
third calponin homology (CH) domain found in the plastin family; The plastin family includes plastin-1, -2, and -3. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409147 Cd Length: 117 Bit Score: 63.41 E-value: 2.70e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 46 KTFTKWVNKhlikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSL-------PREKGRMRFHKLQNVQIALDYLRHR 118
Cdd:cd21298 9 KTYRNWMNS-------LGVNPFVNHLYSDLRDGLVLLQLYDKIKPGVVdwsrvnkPFKKLGANMKKIENCNYAVELGKKL 81
|
90 100 110
....*....|....*....|....*....|....
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 152
Cdd:cd21298 82 KFSLVGIGGKDIYDGNRTLTLALVWQLMRAYTLS 115
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1504-2040 |
2.74e-11 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 70.66 E-value: 2.74e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1504 AELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE-RARQVQVALETAQRSAEAELQSEHAS--------F 1574
Cdd:COG3899 712 ARRALARGAYAEALRYLERALELLPPDPEEEYRLALLLELAEALyLAGRFEEAEALLERALAARALAALAAlrhgnppaS 791
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1575 AEKTAQLERTLKEEHVAVVQLREEATRRAQQ--QAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQkslTQAEAE 1652
Cdd:COG3899 792 ARAYANLGLLLLGDYEEAYEFGELALALAERlgDRRLEARALFNLGFILHWLGPLREALELLREALEAGLE---TGDAAL 868
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1653 KQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREA 1732
Cdd:COG3899 869 ALLALAAAAAAAAAAAALAAAAAAAARLLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAA 948
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1733 AAATQKRRELEAELAKVRAEMEVLLASKARAeeesrstsekskqRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED 1812
Cdd:COG3899 949 AAAAALAAALALAAAAAAAAAAALAAAAAAA-------------AAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAAL 1015
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1813 AVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 1892
Cdd:COG3899 1016 AAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAAAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAA 1095
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1893 SELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRR 1972
Cdd:COG3899 1096 ALAAAAAAALALAAALAALALAAALAALALAAAARAAAALLLLAAALALALAALLLLAALLLALALLLLALAALALAAAL 1175
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1973 REAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEE 2040
Cdd:COG3899 1176 AALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLL 1243
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
2152-2508 |
3.27e-11 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 70.07 E-value: 3.27e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2152 ADAEMEKHKQFAEQA--LRQKA-QVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEE 2228
Cdd:PRK02224 344 AESLREDADDLEERAeeLREEAaELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDE 423
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2229 LGKLKARIEAENRALVLRDKDsAQRLLQE-EAEKMKQVAEEAARLSVAAQEAARLRQLAEE--DLAQQRALAEKMLKEKM 2305
Cdd:PRK02224 424 LREREAELEATLRTARERVEE-AEALLEAgKCPECGQPVEGSPHVETIEEDRERVEELEAEleDLEEEVEEVEERLERAE 502
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2306 QAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQmAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSraQ 2385
Cdd:PRK02224 503 DLVEAEDRIERLEERREDLEELIAERRETIEEKRER-AEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELN--S 579
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2386 ARAEEDARrfRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERL---REAIAELEHEKDklkqEAQLLQLKS 2462
Cdd:PRK02224 580 KLAELKER--IESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLaekRERKRELEAEFD----EARIEEARE 653
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 1920237968 2463 EEMQTVR-QEQLLQETQALQQsflsEKDSLLQRERCIEQEKAKLEQL 2508
Cdd:PRK02224 654 DKERAEEyLEQVEEKLDELRE----ERDDLQAEIGAVENELEELEEL 696
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
2718-2756 |
3.30e-11 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 60.42 E-value: 3.30e-11
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 2718 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPELHHKL 2756
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
986-1541 |
4.80e-11 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 69.78 E-value: 4.80e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 986 EACETRTVHRLRLPLDKEPARECAQRITEQQKAqaevDGLGKGV--ARLSAEAEKVLAlPEPSPAAPTLRSELELTLGKL 1063
Cdd:PTZ00121 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKA----DEAKKKAeeAKKKADAAKKKA-EEAKKAAEAAKAEAEAAADEA 1359
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1064 EQVRSLS-AIYLEKLKTISLVIRSTQEAEEVLRAhEEQLKEAQAVPATLPELEATKAALKK---LRAQAEAQQPVfdalr 1139
Cdd:PTZ00121 1360 EAAEEKAeAAEKKKEEAKKKADAAKKKAEEKKKA-DEAKKKAEEDKKKADELKKAAAAKKKadeAKKKAEEKKKA----- 1433
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1140 DELRGAQEvgERLQQRHGERDVEVERWRERVTLLLERwqavlaqtdvrQRELEQLGRQLRYYREsADPLGAWLRDAKQRQ 1219
Cdd:PTZ00121 1434 DEAKKKAE--EAKKADEAKKKAEEAKKAEEAKKKAEE-----------AKKADEAKKKAEEAKK-ADEAKKKAEEAKKKA 1499
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1220 EQIQAVPLANSQAVREQLRQEKALLEDIERHGE--KVEEcqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSG 1297
Cdd:PTZ00121 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEakKADE----AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1298 SESIIQEYVDLR----TRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERlaeveaalEKQRQLAEAHAQAKA 1373
Cdd:PTZ00121 1576 KNMALRKAEEAKkaeeARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK--------KKVEQLKKKEAEEKK 1647
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1374 QAEReaqgLQRRMQEEVARREEVAVEAQEQKRSIQEelqhLRQSSEAEiqakarqvEAAERSRLRIEEEIRVVRLQLEAT 1453
Cdd:PTZ00121 1648 KAEE----LKKAEEENKIKAAEEAKKAEEDKKKAEE----AKKAEEDE--------KKAAEALKKEAEEAKKAEELKKKE 1711
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1454 ERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAElalrvQAEAEAAREKQRALQALEELRLQ 1533
Cdd:PTZ00121 1712 AEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK-----KEEEKKAEEIRKEKEAVIEEELD 1786
|
....*...
gi 1920237968 1534 AEEAERRL 1541
Cdd:PTZ00121 1787 EEDEKRRM 1794
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2147-2349 |
5.49e-11 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 67.87 E-value: 5.49e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2147 RQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQM 2226
Cdd:COG4942 34 QEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRAL 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2227 EELGK---LKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE 2303
Cdd:COG4942 114 YRLGRqppLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEAL 193
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1920237968 2304 KMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQE 2349
Cdd:COG4942 194 KAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1348-1873 |
5.81e-11 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 69.50 E-value: 5.81e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1348 RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1427
Cdd:COG3899 722 AEALRYLERALELLPPDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYANLGLL 801
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1428 QVEAAERSRLRIEEEIRVV-RLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL--RRQVQDETQRKRQAEA 1504
Cdd:COG3899 802 LLGDYEEAYEFGELALALAeRLGDRRLEARALFNLGFILHWLGPLREALELLREALEAGLETgdAALALLALAAAAAAAA 881
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1505 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERT 1584
Cdd:COG3899 882 AAAALAAAAAAAARLLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALA 961
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1585 LKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARR 1664
Cdd:COG3899 962 AAAAAAAAAALAAAAAAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAA 1041
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1665 RGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA 1744
Cdd:COG3899 1042 ALALLAALAAAAAAAAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALA 1121
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1745 ELAKVRAEMEVLLASKARAEEESRSTsekskqRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1824
Cdd:COG3899 1122 ALALAAAARAAAALLLLAAALALALA------ALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAAL 1195
|
490 500 510 520
....*....|....*....|....*....|....*....|....*....
gi 1920237968 1825 AEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQA 1873
Cdd:COG3899 1196 LAALLALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLLA 1244
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1321-2038 |
1.23e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 68.22 E-value: 1.23e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1321 QYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1400
Cdd:pfam15921 82 EYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCLK 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRSIQEELQHLRQ---SSEAEIQA-KARQVEAAERSRLRIEEEIRVVRLQLE----ATERQRGGAEGELQALRAR-- 1470
Cdd:pfam15921 162 EDMLEDSNTQIEQLRKmmlSHEGVLQEiRSILVDFEEASGKKIYEHDSMSTMHFRslgsAISKILRELDTEISYLKGRif 241
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1471 --AEEAEAQKRQAQEEAERLRRQVQDET-QRKRQAEAELAlRVQAEAEAAREKQRALQAleelrlQAEEAERRLRQAEAE 1547
Cdd:pfam15921 242 pvEDQLEALKSESQNKIELLLQQHQDRIeQLISEHEVEIT-GLTEKASSARSQANSIQS------QLEIIQEQARNQNSM 314
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1548 RARQVQvALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAeaerelerwQLKA 1627
Cdd:pfam15921 315 YMRQLS-DLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQ---------KLLA 384
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1628 NEALRLRLQAEEVAQQKSLtqaeaekqkeeaerEARRRGKAEEQAVRQRELAEQELEKQRQLA---------EGTAQQRL 1698
Cdd:pfam15921 385 DLHKREKELSLEKEQNKRL--------------WDRDTGNSITIDHLRRELDDRNMEVQRLEAllkamksecQGQMERQM 450
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1699 AAEQ----ELIRLRAETEQGEQQRQLLEEELARLqreaaaaTQKRRELEAELAKVrAEMEVLLASKARAEEESRSTSEKS 1774
Cdd:pfam15921 451 AAIQgkneSLEKVSSLTAQLESTKEMLRKVVEEL-------TAKKMTLESSERTV-SDLTASLQEKERAIEATNAEITKL 522
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1775 KQRLEAEAGRFRELAEEAARLRALAEEAKRQR-QLAEEDAV----RQ----------------------RAEAERVLAEK 1827
Cdd:pfam15921 523 RSRVDLKLQELQHLKNEGDHLRNVQTECEALKlQMAEKDKVieilRQqienmtqlvgqhgrtagamqveKAQLEKEINDR 602
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1828 LAAISEATRLKTEAEIALKEKEAE---------------NERLRRLAEDEAFQRRLLEEQAAQHK------ADIEARLAQ 1886
Cdd:pfam15921 603 RLELQEFKILKDKKDAKIRELEARvsdlelekvklvnagSERLRAVKDIKQERDQLLNEVKTSRNelnslsEDYEVLKRN 682
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1887 LRKASE------SELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG-KAELELELGRIrgtaeDTLRSK----EQAE 1955
Cdd:pfam15921 683 FRNKSEemetttNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGmQKQITAKRGQI-----DALQSKiqflEEAM 757
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1956 QEAARQRQLAAEEERRRREAEERV---QKSLAAEEEAARQRKAALEE--------VERLKAKVEEARRLRERAEQESARq 2024
Cdd:pfam15921 758 TNANKEKHFLKEEKNKLSQELSTVateKNKMAGELEVLRSQERRLKEkvanmevaLDKASLQFAECQDIIQRQEQESVR- 836
|
810
....*....|....
gi 1920237968 2025 LQLAQEAAQKRLQA 2038
Cdd:pfam15921 837 LKLQHTLDVKELQG 850
|
|
| CH_MICAL1 |
cd21196 |
calponin homology (CH) domain found in molecule interacting with CasL protein 1; MICAL-1, also ... |
166-266 |
1.30e-10 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 1; MICAL-1, also called NEDD9-interacting protein with calponin homology and LIM domains, acts as a [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-1 acts as a cytoskeletal regulator that connects NEDD9 to intermediate filaments. It also acts as a negative regulator of apoptosis via its interaction with STK38 and STK38L. MICAL-1 is a Rab effector protein that plays a role in vesicle trafficking. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409045 Cd Length: 106 Bit Score: 61.21 E-value: 1.30e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 166 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 245
Cdd:cd21196 5 QEELLRWCQEQTAGYPGVHVSDLSSSWADGLALCALVYRLQPGLLEPSELQGLGALEATAWALKVAENELGITPVVSAQA 84
|
90 100
....*....|....*....|.
gi 1920237968 246 VdVPQPDEKSIITYVSSLYDA 266
Cdd:cd21196 85 V-VAGSDPLGLIAYLSHFHSA 104
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1334-1834 |
1.37e-10 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 68.35 E-value: 1.37e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1334 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER-----EAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1408
Cdd:COG3899 741 EEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRhgnppASARAYANLGLLLLGDYEEAYEFGELALALA 820
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1409 EELQHLRQSSEAEIqAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL 1488
Cdd:COG3899 821 ERLGDRRLEARALF-NLGFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAARLLAA 899
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1489 RRQVQDETQRKRQAEAELALRVQAEAEAAREkqRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQ 1568
Cdd:COG3899 900 AAAALAAAAAAAALAAAELARLAAAAAAAAA--LALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAAAA 977
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1569 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQ 1648
Cdd:COG3899 978 AAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAAA 1057
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1649 AEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARL 1728
Cdd:COG3899 1058 AAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALAALALAAAARAAAALLL 1137
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1729 QREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQL 1808
Cdd:COG3899 1138 LAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLAL 1217
|
490 500
....*....|....*....|....*.
gi 1920237968 1809 AEEDAVRQRAEAERVLAEKLAAISEA 1834
Cdd:COG3899 1218 EAAALLLLLLLAALALAAALLALRLL 1243
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2244-2600 |
1.42e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.16 E-value: 1.42e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2244 VLRDKDSAQRLLQEEA---EKMKQVAEEAARLSVAAQEA-ARLRQLAEEDLAQQRAL---AEKMlkEKMQAVQEATRlKA 2316
Cdd:TIGR02168 149 IIEAKPEERRAIFEEAagiSKYKERRKETERKLERTRENlDRLEDILNELERQLKSLerqAEKA--ERYKELKAELR-EL 225
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2317 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQET---QGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDAR 2393
Cdd:TIGR02168 226 ELALLVLRLEELREELEELQEELKEAEEELEELTaelQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2394 RFRKQAEDIGERLYRtelatqekvmlvqtLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQL 2473
Cdd:TIGR02168 306 ILRERLANLERQLEE--------------LEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELE 371
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2474 LQEtQALQQSFLSEKDSLLQRERCIEQEKAKLEQLfqdeVAKAQALreeqQRQQQQMQQEKQQLAASMEEARRRQHEAE- 2552
Cdd:TIGR02168 372 SRL-EELEEQLETLRSKVAQLELQIASLNNEIERL----EARLERL----EDRRERLQQEIEELLKKLEEAELKELQAEl 442
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1920237968 2553 EGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR 2600
Cdd:TIGR02168 443 EELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQAR 490
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1670-2044 |
1.58e-10 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 67.49 E-value: 1.58e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQL--AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELA 1747
Cdd:COG4717 105 EELEAELEELREELEKLEKLlqLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLE 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1748 KVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAV------------- 1814
Cdd:COG4717 185 QLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLlliaaallallgl 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1815 --RQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 1892
Cdd:COG4717 265 ggSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLD 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1893 SELERQKGLVE-DTLRQRRQVEEEILALKGSFEKAAAGKAElelelgRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERR 1971
Cdd:COG4717 345 RIEELQELLREaEELEEELQLEELEQEIAALLAEAGVEDEE------ELRAALEQAEEYQELKEELEELEEQLEELLGEL 418
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1972 RREAEERVQKSLAAE-EEAARQRKAALEEVERLKAKVEEAR-RLRERAEQESARQLQLAQEAAQKRLQAEEKAHA 2044
Cdd:COG4717 419 EELLEALDEEELEEElEELEEELEELEEELEELREELAELEaELEQLEEDGELAELLQELEELKAELRELAEEWA 493
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1401-1590 |
1.88e-10 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 66.37 E-value: 1.88e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL--EATERQRGGAEGELQALRARAEEAEAQK 1478
Cdd:PRK09510 67 QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQkkQAEEAAKQAALKQKQAEEAAAKAAAAAK 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1479 RQAQEEAERLRRQV-QDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALE 1557
Cdd:PRK09510 147 AKAEAEAKRAAAAAkKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAA 226
|
170 180 190
....*....|....*....|....*....|...
gi 1920237968 1558 TAQRSAEAELQSEHASFAEKTAQLERTLKEEHV 1590
Cdd:PRK09510 227 AAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2244-2599 |
2.35e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 67.39 E-value: 2.35e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2244 VLRDKDSAQRLLQEEAEKMKQ-----VAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM--LKEKMQAVQEA-TRLK 2315
Cdd:TIGR02168 194 ILNELERQLKSLERQAEKAERykelkAELRELELALLVLRLEELREELEELQEELKEAEEELeeLTAELQELEEKlEELR 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2316 AEAELLQQQKELAQE---QARRLQEDKEQMAQQLAqetqgfqktletERQRQLEmsAEAERLRLRVAEMSRAQARAEEDA 2392
Cdd:TIGR02168 274 LEVSELEEEIEELQKelyALANEISRLEQQKQILR------------ERLANLE--RQLEELEAQLEELESKLDELAEEL 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2393 RRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKqeAQLLQLKSE-EMQTVRQE 2471
Cdd:TIGR02168 340 AELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN--NEIERLEARlERLEDRRE 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2472 QLLQETQALQQSFLSEKDSLLQRErcIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHeA 2551
Cdd:TIGR02168 418 RLQQEIEELLKKLEEAELKELQAE--LEELEEELEEL-QEELERLEEALEELREELEEAEQALDAAERELAQLQARLD-S 493
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1920237968 2552 EEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLqHLEEERRAALA 2599
Cdd:TIGR02168 494 LERLQENLEGFSEGVKALLKNQSGLSGILGVLSELI-SVDEGYEAAIE 540
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1670-2416 |
3.01e-10 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 66.86 E-value: 3.01e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQLAEGTAQqrlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1749
Cdd:COG4913 245 EDAREQIELLEPIRELAERYAAARER---LAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDAL 321
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1750 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1829
Cdd:COG4913 322 REELDELEAQIRGNGGDRLEQLEREIERLEREL---EERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEE 398
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1830 AISEATRLKTEAEIALKEKEAENERLRrlAEDEAFQRRlleeqaaqhKADIEARLAQLRKAseseLERQKGLVEDTLR-- 1907
Cdd:COG4913 399 ELEALEEALAEAEAALRDLRRELRELE--AEIASLERR---------KSNIPARLLALRDA----LAEALGLDEAELPfv 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1908 -QRRQVEEEILALKGSFEKAaagkaelelelgrIRGTAEDTLRSKEQAEQ--EAARQRQLAAEeerrrreaeerVQKSLA 1984
Cdd:COG4913 464 gELIEVRPEEERWRGAIERV-------------LGGFALTLLVPPEHYAAalRWVNRLHLRGR-----------LVYERV 519
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1985 AEEEAARQRKAALEE--VERLKAKVEEARrlrERAEQESARQLQLAQEAAQKRLQAEEKA--------HAFAVQQKEQEL 2054
Cdd:COG4913 520 RTGLPDPERPRLDPDslAGKLDFKPHPFR---AWLEAELGRRFDYVCVDSPEELRRHPRAitragqvkGNGTRHEKDDRR 596
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2055 QQTLQqeqSVLerlrseaeaarraaeeaeaareraereAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEaeqe 2134
Cdd:COG4913 597 RIRSR---YVL---------------------------GFDNRAKLAALEAELAELEEELAEAEERLEALEAELDA---- 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2135 aarraqaeqaaLRQKQAADAEMEKHkQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQ 2214
Cdd:COG4913 643 -----------LQERREALQRLAEY-SWDEIDVASAEREIAELEAELERLDASSDDLAALEEQLEELEAELEELEEELDE 710
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2215 VEEELFSLRvqmEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2294
Cdd:COG4913 711 LKGEIGRLE---KELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEE 787
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2295 ALAEKMLKEKMQAVQEATRLKAEAELLQQ-QKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLeterqrQLEMSAEAER 2373
Cdd:COG4913 788 ELERAMRAFNREWPAETADLDADLESLPEyLALLDRLEEDGLPEYEERFKELLNENSIEFVADL------LSKLRRAIRE 861
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 2374 LRLRVAEMSRA----------------QARAEEDARRFRKQAEDIGERLYRTELATQEK 2416
Cdd:COG4913 862 IKERIDPLNDSlkripfgpgrylrleaRPRPDPEVREFRQELRAVTSGASLFDEELSEA 920
|
|
| CH_PLS_FIM_rpt1 |
cd21217 |
first calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ... |
45-146 |
3.55e-10 |
|
first calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5; they cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409066 [Multi-domain] Cd Length: 114 Bit Score: 60.28 E-value: 3.55e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 45 KKTFTKWVNKHL-----IKHWRAEAQRHiSDLYEDLRDGHNLISLLEVLSGDSLPREKGRMR-----FHKLQNVQIALDY 114
Cdd:cd21217 3 KEAFVEHINSLLaddpdLKHLLPIDPDG-DDLFEALRDGVLLCKLINKIVPGTIDERKLNKKkpkniFEATENLNLALNA 81
|
90 100 110
....*....|....*....|....*....|..
gi 1920237968 115 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTII 146
Cdd:cd21217 82 AKKIGCKVVNIGPQDILDGNPHLVLGLLWQII 113
|
|
| CH_FIMB_rpt3 |
cd21300 |
third calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar ... |
36-143 |
3.61e-10 |
|
third calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar proteins; Fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409149 Cd Length: 119 Bit Score: 60.52 E-value: 3.61e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 36 EQDERDRVqkktFTKWVNKhlikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDS--------LPREKGRMRFHKLQN 107
Cdd:cd21300 4 EGEREARV----FTLWLNS-------LDVEPAVNDLFEDLRDGLILLQAYDKVIPGSvnwkkvnkAPASAEISRFKAVEN 72
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237968 108 VQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIW 143
Cdd:cd21300 73 TNYAVELGKQLGFSLVGIQGADITDGSRTLTLALVW 108
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1006-1594 |
4.09e-10 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 66.52 E-value: 4.09e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1006 RECAQRITEQQKAQaevdglgkgvaRLSAEAEKVLALPEPSPA-APTLRSELEltlgkleqvrslsaiylEKLKTISLVI 1084
Cdd:PRK04863 523 SELEQRLRQQQRAE-----------RLLAEFCKRLGKNLDDEDeLEQLQEELE-----------------ARLESLSESV 574
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1085 RSTQEAEEVLRAHEEQLK-EAQAVPATLPELEATKAALKKLRAQAEaqqpvfdalrDELRGAQEVGERLQQrHGERDVEV 1163
Cdd:PRK04863 575 SEARERRMALRQQLEQLQaRIQRLAARAPAWLAAQDALARLREQSG----------EEFEDSQDVTEYMQQ-LLEREREL 643
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1164 ERWRERVTlllERWQAVLAQTD-VRQRELEQLGRQLRYyresADPLGAWL----------RDAKQRQ----EQIQAVPLA 1228
Cdd:PRK04863 644 TVERDELA---ARKQALDEEIErLSQPGGSEDPRLNAL----AERFGGVLlseiyddvslEDAPYFSalygPARHAIVVP 716
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1229 NSQAVREQLRQEKALLEDI-------ERHGEKVEECQRFAKQYINAIKDYELQLVTYKAqlEPVASPAKKPK----VQSG 1297
Cdd:PRK04863 717 DLSDAAEQLAGLEDCPEDLyliegdpDSFDDSVFSVEELEKAVVVKIADRQWRYSRFPE--VPLFGRAAREKrieqLRAE 794
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1298 SESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEErlAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER 1377
Cdd:PRK04863 795 REELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAFEAD--PEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKE 872
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1378 EAQGLQR-----------RMQEEVARREEVAVEAQEQKRSIQ---------EELQHLRQSSEAEIQAKARQVEAAE---- 1433
Cdd:PRK04863 873 GLSALNRllprlnlladeTLADRVEEIREQLDEAEEAKRFVQqhgnalaqlEPIVSVLQSDPEQFEQLKQDYQQAQqtqr 952
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1434 --RSRLRIEEEIRVVRLQLEATERQR-GGAEGELQ-ALRARAEEAEAQKRQAQEEAerlrRQVQDETQRKRQAEAELALR 1509
Cdd:PRK04863 953 daKQQAFALTEVVQRRAHFSYEDAAEmLAKNSDLNeKLRQRLEQAEQERTRAREQL----RQAQAQLAQYNQVLASLKSS 1028
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1510 VQAEAEAAREKQRALQAL---------EELRLQAEEAERRLRQAEAERArqvqvALETAQRSAEAELQsehaSFAEKTAQ 1580
Cdd:PRK04863 1029 YDAKRQMLQELKQELQDLgvpadsgaeERARARRDELHARLSANRSRRN-----QLEKQLTFCEAEMD----NLTKKLRK 1099
|
650
....*....|....
gi 1920237968 1581 LERTLKEEHVAVVQ 1594
Cdd:PRK04863 1100 LERDYHEMREQVVN 1113
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1088-1539 |
4.51e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 66.70 E-value: 4.51e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1088 QEAEEVLRAHEEQLK--EAQAVPATLPELEATKAALKKLRAQAEAQQPVFDAlrDELRGAQEVGERLQQRHGErdvEVER 1165
Cdd:PTZ00121 1497 KKADEAKKAAEAKKKadEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA--DELKKAEELKKAEEKKKAE---EAKK 1571
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1166 WRERVTLLLERWQavlaqtDVRQRELEQLGRQLRYYRESADPLGAWLRdaKQRQEQIQAvplansqavrEQLRQEKALLE 1245
Cdd:PTZ00121 1572 AEEDKNMALRKAE------EAKKAEEARIEEVMKLYEEEKKMKAEEAK--KAEEAKIKA----------EELKKAEEEKK 1633
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1246 DIERHGEKVEECQRFAKQyinaIKDYELQLVTYKAQLEPVASPAKKPkvqsgSESIIQEYVDLRtRYSELSTLTSQYIRF 1325
Cdd:PTZ00121 1634 KVEQLKKKEAEEKKKAEE----LKKAEEENKIKAAEEAKKAEEDKKK-----AEEAKKAEEDEK-KAAEALKKEAEEAKK 1703
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1326 ISETLRRMEEEERLAEQQRAEERERLAEVEaalekqrqlaeahaQAKAQAEREaqglqRRMQEEVARREEVAVEAQEQKR 1405
Cdd:PTZ00121 1704 AEELKKKEAEEKKKAEELKKAEEENKIKAE--------------EAKKEAEED-----KKKAEEAKKDEEEKKKIAHLKK 1764
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1406 SIQEELQHLRQSSEAEIQAKARqvEAAERSRLRIEEEIRVVRLQLEATerQRGGAEGELQALRARAEEAEAQKRQAqEEA 1485
Cdd:PTZ00121 1765 EEEKKAEEIRKEKEAVIEEELD--EEDEKRRMEVDKKIKDIFDNFANI--IEGGKEGNLVINDSKEMEDSAIKEVA-DSK 1839
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1486 ERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRaLQALEELRLQAEEAER 1539
Cdd:PTZ00121 1840 NMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDL-KEDDEEEIEEADEIEK 1892
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1524-2019 |
4.58e-10 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 66.22 E-value: 4.58e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1524 LQALEELRLQAEEAE---RRLRQAEAERARQVQVALE-TAQRSAEAELQSEHASFAEKTAQLERtLKEEHVAVVQLREEA 1599
Cdd:PRK02224 161 LGKLEEYRERASDARlgvERVLSDQRGSLDQLKAQIEeKEEKDLHERLNGLESELAELDEEIER-YEEQREQARETRDEA 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1600 TRRAQQQAEAERARAEAERELERWQLKANEALRLRLQ-AEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQREL 1678
Cdd:PRK02224 240 DEVLEEHEERREELETLEAEIEDLRETIAETEREREElAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREEL 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1679 AEQELEKQRQLAEGTAQQRlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLA 1758
Cdd:PRK02224 320 EDRDEELRDRLEECRVAAQ-AHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRE 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1759 SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE----------------DAVRQRAEAER 1822
Cdd:PRK02224 399 RFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAgkcpecgqpvegsphvETIEEDRERVE 478
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1823 VLAEKLAAIsEATRLKTEAEI----ALKEKEAENERLRRlaedeafQRRLLEEQAAQHKADIEA---RLAQLRKAS---E 1892
Cdd:PRK02224 479 ELEAELEDL-EEEVEEVEERLeraeDLVEAEDRIERLEE-------RREDLEELIAERRETIEEkreRAEELRERAaelE 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1893 SELERQKglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELElELGRIRgtaeDTLRSKEQAEQEAARQRQLAAEEERRR 1972
Cdd:PRK02224 551 AEAEEKR---EAAAEAEEEAEEAREEVAELNSKLAELKERIE-SLERIR----TLLAAIADAEDEIERLREKREALAELN 622
|
490 500 510 520
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1973 REAEERVQkslaaeeeAARQRKAALEEvERLKAKVEEARRLRERAEQ 2019
Cdd:PRK02224 623 DERRERLA--------EKRERKRELEA-EFDEARIEEAREDKERAEE 660
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1759-2030 |
5.11e-10 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 66.12 E-value: 5.11e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1759 SKARAEEESRSTSEKSKQRLEAEAGRF-RELAEEAARlralAEEAKRQRQLAEEDAVrqrAEAERVLAEKLAAISEATRL 1837
Cdd:PRK05035 436 AEIRAIEQEKKKAEEAKARFEARQARLeREKAAREAR----HKKAAEARAAKDKDAV---AAALARVKAKKAAATQPIVI 508
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1838 KTEAEIALKEKEAEnERLRRLAEDEAFQRRLLEEQAAQHKADIEARL--AQLRKASESELERQKGLVEDTlrQRRQVEEE 1915
Cdd:PRK05035 509 KAGARPDNSAVIAA-REARKAQARARQAEKQAAAAADPKKAAVAAAIarAKAKKAAQQAANAEAEEEVDP--KKAAVAAA 585
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1916 ILALKGSFEKAAAGKAELELELgrirgTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKA 1995
Cdd:PRK05035 586 IARAKAKKAAQQAASAEPEEQV-----AEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVAAAIARAKARKA 660
|
250 260 270
....*....|....*....|....*....|....*
gi 1920237968 1996 ALEEVERLKAKVEEARRLRERAEQESARQLQLAQE 2030
Cdd:PRK05035 661 AQQQANAEPEEAEDPKKAAVAAAIARAKAKKAAQQ 695
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1681-2042 |
6.24e-10 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 65.56 E-value: 6.24e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1681 QELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQR--EAAAATQKRRELEAELAKVRAEMEvlla 1758
Cdd:COG4717 74 KELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKllQLLPLYQELEALEAELAELPERLE---- 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1759 sKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLK 1838
Cdd:COG4717 150 -ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEE 228
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1839 TEAEIALKEKEAENERLRRL---------------AEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVE 1903
Cdd:COG4717 229 LEQLENELEAAALEERLKEArlllliaaallallgLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQ 308
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1904 DTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERrrreaeerVQKSL 1983
Cdd:COG4717 309 ALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAL--------LAEAG 380
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1984 AAEEEAARQRKAALEEVERLKAKVEEA-RRLRERAEQESARQLQLAQEAAQKRLQAEEKA 2042
Cdd:COG4717 381 VEDEEELRAALEQAEEYQELKEELEELeEQLEELLGELEELLEALDEEELEEELEELEEE 440
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1092-1911 |
6.60e-10 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 66.13 E-value: 6.60e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1092 EVLRAHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAqqpvfdaLRDELRGAQEvGERLQQRHGERDVEVERWRERvt 1171
Cdd:PRK04863 294 ELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQA-------ASDHLNLVQT-ALRQQEKIERYQADLEELEER-- 363
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1172 llLERWQAVLAQTDVRQRELEqlgRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVReQLRQEKALLE----DI 1247
Cdd:PRK04863 364 --LEEQNEVVEEADEQQEENE---ARAEAAEEEVDELKSQLADYQQALDVQQTRAIQYQQAVQ-ALERAKQLCGlpdlTA 437
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1248 ERHGEKVEECQRFAKQYINAIKDYELQLVTYKA---QLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQY-- 1322
Cdd:PRK04863 438 DNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahsQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQLqq 517
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1323 IRFISETLRRMEEEERLAEQQRAEERERL-------AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREE 1395
Cdd:PRK04863 518 LRMRLSELEQRLRQQQRAERLLAEFCKRLgknlddeDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQR 597
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1396 VAVEAQEQkRSIQEELQHLRQSSEAE----------IQAKARQVEAAERSRLRIEEEIRVVRLQLEATErQRGGAEGE-L 1464
Cdd:PRK04863 598 LAARAPAW-LAAQDALARLREQSGEEfedsqdvteyMQQLLERERELTVERDELAARKQALDEEIERLS-QPGGSEDPrL 675
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1465 QALRAR-----------------AEEAEA---QKRQA--QEEAERLRRQVQDET-------------QRKRQA-----EA 1504
Cdd:PRK04863 676 NALAERfggvllseiyddvsledAPYFSAlygPARHAivVPDLSDAAEQLAGLEdcpedlyliegdpDSFDDSvfsveEL 755
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1505 ELALRVQ-AEAE--------------AAREKQralqaLEELRLQAEEAERRLRQAEAERaRQVQVALETAQR-------- 1561
Cdd:PRK04863 756 EKAVVVKiADRQwrysrfpevplfgrAAREKR-----IEQLRAEREELAERYATLSFDV-QKLQRLHQAFSRfigshlav 829
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1562 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVA 1641
Cdd:PRK04863 830 AFEADPEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKEGLSALNRLLPRLNLLADETLADRVEEIREQLDEAEEA 909
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1642 QQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQ--------------LAEGTAQQRLAAEQEL-IR 1706
Cdd:PRK04863 910 KRFVQQHGNALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQqafaltevvqrrahFSYEDAAEMLAKNSDLnEK 989
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1707 LRAETEQGEQQRQLLEEEL--------------ARLQREAAAATQKRRELEAELAK--VRAEMEVLLASKARAEE----- 1765
Cdd:PRK04863 990 LRQRLEQAEQERTRAREQLrqaqaqlaqynqvlASLKSSYDAKRQMLQELKQELQDlgVPADSGAEERARARRDElharl 1069
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1766 ----ESRSTSEKSKQRLEAE----AGRFRELAEEAARLRALAEEAKRQRQLAeEDAVRQRAEAERVLAEKLAAISeATRL 1837
Cdd:PRK04863 1070 sanrSRRNQLEKQLTFCEAEmdnlTKKLRKLERDYHEMREQVVNAKAGWCAV-LRLVKDNGVERRLHRRELAYLS-ADEL 1147
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1838 KTEAEI---ALKEKEAENERLR---RLAEDEAFQ----------RRLLEEQAAQhkaDIeARLAQLRKASEsELERQKGL 1901
Cdd:PRK04863 1148 RSMSDKalgALRLAVADNEHLRdvlRLSEDPKRPerkvqfyiavYQHLRERIRQ---DI-IRTDDPVEAIE-QMEIELSR 1222
|
970
....*....|
gi 1920237968 1902 VEDTLRQRRQ 1911
Cdd:PRK04863 1223 LTEELTSREQ 1232
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1196-1866 |
6.64e-10 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 65.86 E-value: 6.64e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1196 RQLRYYRESADPLGAWLRDAKQRQEQIQAvpLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDY---- 1271
Cdd:PRK03918 111 SSVREWVERLIPYHVFLNAIYIRQGEIDA--ILESDESREKVVRQILGLDDYENAYKNLGEVIKEIKRRIERLEKFikrt 188
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1272 ---ELQLVTYKAQLEPVASPAKK-----PKVQSGSESIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQ 1343
Cdd:PRK03918 189 eniEELIKEKEKELEEVLREINEisselPELREELEKLEKEVKELEELKEEIEELEKE-LESLEGSKRKLEEKIRELEER 267
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1344 RAEERERLAEVE---AALEKQRQLAEAHAQAK-----------------AQAEREAQGLQRRMQEEVARREEVAvEAQEQ 1403
Cdd:PRK03918 268 IEELKKEIEELEekvKELKELKEKAEEYIKLSefyeeyldelreiekrlSRLEEEINGIEERIKELEEKEERLE-ELKKK 346
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1404 KRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQE 1483
Cdd:PRK03918 347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKK 426
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1484 EAERLRRQVQDETQRKRQAEAELALRVQAEAEAarEKQRALQALEELRLQAEEAERRLRQAEAERARQ--VQVALETAQ- 1560
Cdd:PRK03918 427 AIEELKKAKGKCPVCGRELTEEHRKELLEEYTA--ELKRIEKELKEIEEKERKLRKELRELEKVLKKEseLIKLKELAEq 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1561 -RSAEAELQSEHASFAEKTAQLERTLKEEhvaVVQLREEATRRAQqqaeaeraraeaerelerwQLKANEALRLRLQAEE 1639
Cdd:PRK03918 505 lKELEEKLKKYNLEELEKKAEEYEKLKEK---LIKLKGEIKSLKK-------------------ELEKLEELKKKLAELE 562
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1640 VAQQKsltqaeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLaEGTAQQRLAAEQELIRLRaeteQGEQQRQ 1719
Cdd:PRK03918 563 KKLDE----------------------LEEELAELLKELEELGFESVEEL-EERLKELEPFYNEYLELK----DAEKELE 615
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1720 LLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLasKARAEEESRSTSEKskqrleaeagrFRELAEEAARLRALA 1799
Cdd:PRK03918 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELE--KKYSEEEYEELREE-----------YLELSRELAGLRAEL 682
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1800 EEAKRQRQLAEEDAvrqraeaeRVLAEKLAAISEATRLKTEAEIALKEKEAENERLRR---LAEDEAFQR 1866
Cdd:PRK03918 683 EELEKRREEIKKTL--------EKLKEELEEREKAKKELEKLEKALERVEELREKVKKykaLLKERALSK 744
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2257-2678 |
7.16e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 65.93 E-value: 7.16e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2257 EEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLK-AEAELLQQQKELAQ--EQAR 2333
Cdd:PTZ00121 1091 EATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEdAKRVEIARKAEDARkaEEAR 1170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2334 RLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQ-ARAEEDARRF---RKQAEDIGERLYRT 2409
Cdd:PTZ00121 1171 KAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEER-KAEEARKAEdAKKAEAVKKAeeaKKDAEEAKKAEEER 1249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2410 ELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQllqlKSEEMQTVrqEQLLQETQALQQSFLSEKD 2489
Cdd:PTZ00121 1250 NNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAK----KAEEKKKA--DEAKKKAEEAKKADEAKKK 1323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2490 SLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASME-----------EARRRQHEAE---EGV 2555
Cdd:PTZ00121 1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAkkkadaakkkaEEKKKADEAKkkaEED 1403
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2556 RRQQEELQRLAQQQQQQEKLL--AEENQRLRERLQHLEEERRAalarseEIAPSRAAAARALPNGQDAADGPAAAAEPEH 2633
Cdd:PTZ00121 1404 KKKADELKKAAAAKKKADEAKkkAEEKKKADEAKKKAEEAKKA------DEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 1920237968 2634 AFDGLRRKVPAQRLQEVGVLSAEELQQLAQGRTTVAELAQREDVR 2678
Cdd:PTZ00121 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
939-1588 |
1.05e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 65.09 E-value: 1.05e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 939 DRLQAEREYgscSRHYQQLLQSLEQGEQEESrcqrcISELKDIRLQLEACEtrtvhrlrlpldkepaRECAQRITEQQKA 1018
Cdd:TIGR02169 201 ERLRREREK---AERYQALLKEKREYEGYEL-----LKEKEALERQKEAIE----------------RQLASLEEELEKL 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1019 QAEVDGLGKGVA----RLSAEAEKVLALPEPSPAAptLRSELELTLGKLEQVRSLSAIYLEKL--------KTISLVIRS 1086
Cdd:TIGR02169 257 TEEISELEKRLEeieqLLEELNKKIKDLGEEEQLR--VKEKIGELEAEIASLERSIAEKERELedaeerlaKLEAEIDKL 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1087 TQEAEEVLRAHEEQLKEAQAVPAtlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGE-------- 1158
Cdd:TIGR02169 335 LAEIEELEREIEEERKRRDKLTE---EYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINElkreldrl 411
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1159 ------RDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQ--------- 1223
Cdd:TIGR02169 412 qeelqrLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEkelsklqre 491
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1224 -----AVPLANSQAVREQLRQEKALLEDI--------------ERHGEKVE-------------------ECQRFAKQY- 1264
Cdd:TIGR02169 492 laeaeAQARASEERVRGGRAVEEVLKASIqgvhgtvaqlgsvgERYATAIEvaagnrlnnvvveddavakEAIELLKRRk 571
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1265 --------INAIK---------------DYELQLVTYKAQLEPVASPAKKPKV-----QSGSESIIQ-----------EY 1305
Cdd:TIGR02169 572 agratflpLNKMRderrdlsilsedgviGFAVDLVEFDPKYEPAFKYVFGDTLvvediEAARRLMGKyrmvtlegelfEK 651
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1306 VDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRR 1385
Cdd:TIGR02169 652 SGAMTGGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1386 MQEEVARREEVAVEAQEQKRSIQEELQHLrQSSEAEIQAKARQVEAAERSRLRIE-----EEIRVVRLQLEATERQRGGA 1460
Cdd:TIGR02169 732 EEKLKERLEELEEDLSSLEQEIENVKSEL-KELEARIEELEEDLHKLEEALNDLEarlshSRIPEIQAELSKLEEEVSRI 810
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1461 EGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK---RQAEAELALRVQAEAEAAREKQRALQALEE----LRLQ 1533
Cdd:TIGR02169 811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIksiEKEIENLNGKKEELEEELEELEAALRDLESrlgdLKKE 890
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1534 AEEAERRLRQAEaERARQVQVALETAqRSAEAELQSEHASFAEKTAQLERTLKEE 1588
Cdd:TIGR02169 891 RDELEAQLRELE-RKIEELEAQIEKK-RKRLSELKAKLEALEEELSEIEDPKGED 943
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1501-1868 |
1.29e-09 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 64.76 E-value: 1.29e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1501 QAEAELALRVQAEAEAAREKQRALQALEELrlqAEEAERRLRQAEAERARQvqvaletaqrsaeAELQSEHASFAEktaq 1580
Cdd:pfam17380 279 QHQKAVSERQQQEKFEKMEQERLRQEKEEK---AREVERRRKLEEAEKARQ-------------AEMDRQAAIYAE---- 338
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1581 lertlkEEHVAVVQLREeatrraqqqaeaeraraeaereLERWQLKANEALRLRLQAEEVAQQksLTQAEAEKQKEEAER 1660
Cdd:pfam17380 339 ------QERMAMERERE----------------------LERIRQEERKRELERIRQEEIAME--ISRMRELERLQMERQ 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1661 EARRRGKAEEQAVRQRELAEQE-----LEKQRQLAEGTAQQRLAAEQELIRLraETEQGEQQRQLLEEELARLQReaaaa 1735
Cdd:pfam17380 389 QKNERVRQELEAARKVKILEEErqrkiQQQKVEMEQIRAEQEEARQREVRRL--EEERAREMERVRLEEQERQQQ----- 461
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1736 TQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKskqrlEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAvR 1815
Cdd:pfam17380 462 VERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEK-----ELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER-R 535
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1816 QRAEAER---VLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRL 1868
Cdd:pfam17380 536 REAEEERrkqQEMEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARAEY 591
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1233-1823 |
1.55e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 64.70 E-value: 1.55e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1233 VREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDlrtRY 1312
Cdd:PRK03918 219 LREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAE---EY 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1313 SELSTLTSQYIrfisETLRRMEEEERLAEQQRAEERERLAEVEaalEKQRQLaeahaqakaqaeREAQGLQRRMQEEVAR 1392
Cdd:PRK03918 296 IKLSEFYEEYL----DELREIEKRLSRLEEEINGIEERIKELE---EKEERL------------EELKKKLKELEKRLEE 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1393 REEvAVEAQEQKRSIQEELQHLRQS-SEAEIQAKARQVEAAERSRLRIEEEIRVV---RLQLEATERQRGGAEGELQALR 1468
Cdd:PRK03918 357 LEE-RHELYEEAKAKKEELERLKKRlTGLTPEKLEKELEELEKAKEEIEEEISKItarIGELKKEIKELKKAIEELKKAK 435
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1469 ARA---------EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvqaEAEAAREKQRALQALEELRLQAEEAER 1539
Cdd:PRK03918 436 GKCpvcgrelteEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELR-----ELEKVLKKESELIKLKELAEQLKELEE 510
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1540 RLRQAEAERarqvqvaLETAQRSAEaELQSEHASFAEKTAQLERTLKEEHvavvQLREEATRRAQQQAEAERARAEAERE 1619
Cdd:PRK03918 511 KLKKYNLEE-------LEKKAEEYE-KLKEKLIKLKGEIKSLKKELEKLE----ELKKKLAELEKKLDELEEELAELLKE 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1620 LERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtAQQRLA 1699
Cdd:PRK03918 579 LEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELE--ELEKKY 656
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1700 AEQELIRLRAETEQgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlaSKARAEEESrstSEKSKQRLE 1779
Cdd:PRK03918 657 SEEEYEELREEYLE-------LSRELAGLRAELEELEKRREEIKKTLEKLKEELEER--EKAKKELEK---LEKALERVE 724
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1780 AEAGRFRELAEEAARlRALAEEAKRQRQLAEE------DAVRQRAEAERV 1823
Cdd:PRK03918 725 ELREKVKKYKALLKE-RALSKVGEIASEIFEEltegkySGVRVKAEENKV 773
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1686-1892 |
1.93e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 63.24 E-value: 1.93e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1686 QRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEE 1765
Cdd:COG4942 18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1766 ESRSTSEKSKQRLEA--------------EAGRFRELAEEAARLRALAEEAKRQ-----RQLAEEDAVRQRAEAERvlAE 1826
Cdd:COG4942 98 ELEAQKEELAELLRAlyrlgrqpplalllSPEDFLDAVRRLQYLKYLAPARREQaeelrADLAELAALRAELEAER--AE 175
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1827 KLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 1892
Cdd:COG4942 176 LEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1340-1538 |
2.08e-09 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 62.90 E-value: 2.08e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1340 AEQQRAEE-RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARReevavEAQEQKrsiQEELQHLRQSS 1418
Cdd:PRK09510 72 KSAKRAEEqRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ-----AALKQK---QAEEAAAKAAA 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1419 EAEIQAKARQVEAAERSRlRIEEEIRvVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlrrqVQDETQR 1498
Cdd:PRK09510 144 AAKAKAEAEAKRAAAAAK-KAAAEAK-KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKK----AAAEAKK 217
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1920237968 1499 KRQAEAELAL-RVQAEAEAAREKQRALQALEELRLQAEEAE 1538
Cdd:PRK09510 218 KAAAEAKAAAaKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
1085-1281 |
2.48e-09 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 60.54 E-value: 2.48e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1085 RSTQEAEEVLRAHEEQLKEAQaVPATLPELEATKAALKKLRAQAEAQQPVFDALrdelrgaQEVGERLQQRHGERDVEVe 1164
Cdd:cd00176 7 RDADELEAWLSEKEELLSSTD-YGDDLESVEALLKKHEALEAELAAHEERVEAL-------NELGEQLIEEGHPDAEEI- 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1165 rwRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADpLGAWLRDAKQRQEQIQavPLANSQAVREQLRQEKALL 1244
Cdd:cd00176 78 --QERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD-LEQWLEEKEAALASED--LGKDLESVEELLKKHKELE 152
|
170 180 190
....*....|....*....|....*....|....*..
gi 1920237968 1245 EDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQ 1281
Cdd:cd00176 153 EELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLE 189
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1709-2298 |
2.57e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 63.93 E-value: 2.57e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1709 AETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTS--EKSKQRLEAEAG--- 1783
Cdd:PRK03918 186 KRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELEslEGSKRKLEEKIRele 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1784 --------RFRELAEEAARLRALAEEAKRQRQLAEEdaVRQRAEAERVLAEKLAAISEATRlktEAEIALKEKEAENERL 1855
Cdd:PRK03918 266 erieelkkEIEELEEKVKELKELKEKAEEYIKLSEF--YEEYLDELREIEKRLSRLEEEIN---GIEERIKELEEKEERL 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1856 RRLAEDEAFQRRLLEEQAAQHKA--DIEARLAQLRKASES----ELERQKGLVEDTLRQRRQVEEEILAL---KGSFEKA 1926
Cdd:PRK03918 341 EELKKKLKELEKRLEELEERHELyeEAKAKKEELERLKKRltglTPEKLEKELEELEKAKEEIEEEISKItarIGELKKE 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1927 AAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLaaeeerrrrEAEERVQKSLAAEEEAARQRKAALEEVERLKAK 2006
Cdd:PRK03918 421 IKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYT---------AELKRIEKELKEIEEKERKLRKELRELEKVLKK 491
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2007 VEEARRLRERAEQESARQLQLAQEAAQKrlqAEEKAHAF-AVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAeeaeaa 2085
Cdd:PRK03918 492 ESELIKLKELAEQLKELEEKLKKYNLEE---LEKKAEEYeKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELE------ 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2086 reraereaaqsrRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQ 2165
Cdd:PRK03918 563 ------------KKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDK 630
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2166 ALRQKAQVEQELTALRLQLEETdhQKSILDEELQRLKAEVTEaarqrgqVEEELFSLRVQMEELGKLKARIEAEnralvL 2245
Cdd:PRK03918 631 AFEELAETEKRLEELRKELEEL--EKKYSEEEYEELREEYLE-------LSRELAGLRAELEELEKRREEIKKT-----L 696
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2246 RDkdsaqrlLQEEAEKMKQVAEEAARLSVAAQEAARLRQ--LAEEDLAQQRALAE 2298
Cdd:PRK03918 697 EK-------LKEELEEREKAKKELEKLEKALERVEELREkvKKYKALLKERALSK 744
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
2146-2604 |
2.77e-09 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 63.83 E-value: 2.77e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILdeelqRLKAEVTEAARQRGQVEEELFSLRVQ 2225
Cdd:TIGR00618 245 LTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAA-----PLAAHIKAVTQIEQQAQRIHTELQSK 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2226 MEELGKLKARIEA--ENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA---RLRQLAEE---DLAQQRALA 2297
Cdd:TIGR00618 320 MRSRAKLLMKRAAhvKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTltqHIHTLQQQkttLTQKLQSLC 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2298 EKMLKEKMQAVQEATRLKAEAELLQQ------QKELAQEQARRLQEDKEQMAQQLAQETQGFQK-------------TLE 2358
Cdd:TIGR00618 400 KELDILQREQATIDTRTSAFRDLQGQlahakkQQELQQRYAELCAAAITCTAQCEKLEKIHLQEsaqslkereqqlqTKE 479
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2359 TERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLR 2438
Cdd:TIGR00618 480 QIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRA 559
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2439 EAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqlLQETQALQQSFLSEKDSLLQRERCIE---------QEKAKLEQLF 2509
Cdd:TIGR00618 560 SLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNI--TVRLQDLTEKLSEAEDMLACEQHALLrklqpeqdlQDVRLHLQQC 637
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2510 QDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEAR---------------------------------RRQHEAEEGVR 2556
Cdd:TIGR00618 638 SQELALKLTALHALQLTLTQERVREHALSIRVLPKEllasrqlalqkmqsekeqltywkemlaqcqtllRELETHIEEYD 717
|
490 500 510 520
....*....|....*....|....*....|....*....|....*...
gi 1920237968 2557 RQQEELQRLAQQQQQQeklLAEENQRLRERLQHLEEERRAALARSEEI 2604
Cdd:TIGR00618 718 REFNEIENASSSLGSD---LAAREDALNQSLKELMHQARTVLKARTEA 762
|
|
| CH_FLNC_rpt2 |
cd21314 |
second calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; ... |
164-266 |
2.80e-09 |
|
second calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C (FLN-C), also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. FLN-C contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409163 Cd Length: 115 Bit Score: 57.77 E-value: 2.80e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 164 TAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 242
Cdd:cd21314 11 TPKQRLLGWIQNKVPQ---LPITNFNRDWQDGKALGALVDNCAPGLCpDWESWDPNQPVQNAREAMQQADDWLGVPQVIA 87
|
90 100
....*....|....*....|....
gi 1920237968 243 PEDVDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21314 88 PEEIVDPNVDEHSVMTYLSQFPKA 111
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2243-2594 |
2.94e-09 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 63.60 E-value: 2.94e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2243 LVLRDKDSAQRLLQEEAEKM-----KQVAEEAARlsvaaqEAARLRQLAEEDLAQQRALAEkmlkekmqavqeatrlkaE 2317
Cdd:pfam17380 277 IVQHQKAVSERQQQEKFEKMeqerlRQEKEEKAR------EVERRRKLEEAEKARQAEMDR------------------Q 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2318 AELLQQQKELAQEQARRL----QEDKEQMAQQLAQETQGFQKTLETERQR-QLEMSAEAERLRLRVAEMSRAQARAEEDA 2392
Cdd:pfam17380 333 AAIYAEQERMAMERERELerirQEERKRELERIRQEEIAMEISRMRELERlQMERQQKNERVRQELEAARKVKILEEERQ 412
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2393 RRFRKQAEDIGERLYRTELATQEKVmlvQTLETQRQqsdRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQ 2472
Cdd:pfam17380 413 RKIQQQKVEMEQIRAEQEEARQREV---RRLEEERA---REMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRDR 486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2473 LLQETQalqqsflsekdsllqRERCIEQEKAKLEQLFQDEVAKAQALREEqqrqqqqmqqekqqlaasMEEarRRQHEAE 2552
Cdd:pfam17380 487 KRAEEQ---------------RRKILEKELEERKQAMIEEERKRKLLEKE------------------MEE--RQKAIYE 531
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 1920237968 2553 EGVRRQQEELQRlaqqqqqqEKLLAEENQRLRERLQHLEEER 2594
Cdd:pfam17380 532 EERRREAEEERR--------KQQEMEERRRIQEQMRKATEER 565
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1188-1566 |
3.40e-09 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 63.82 E-value: 3.40e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1188 QRELEQLGRQLRYYRESADplgawlrDAKQRQEQIQA-VPLANSQAvREQLRQekaLLEDIERHGEKVEECQRFAKQYIN 1266
Cdd:COG3096 849 ERELAQHRAQEQQLRQQLD-------QLKEQLQLLNKlLPQANLLA-DETLAD---RLEELREELDAAQEAQAFIQQHGK 917
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1267 AIkdyelqlvtykAQLEPVASPAKKPKVQSgsESIIQEYVDLRTRYSELStltsQYIRFISETLRRM------EEEERLA 1340
Cdd:COG3096 918 AL-----------AQLEPLVAVLQSDPEQF--EQLQADYLQAKEQQRRLK----QQIFALSEVVQRRphfsyeDAVGLLG 980
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1341 EQQRAEE--RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEevarreevaveAQEQKRSIQEELQHLrqss 1418
Cdd:COG3096 981 ENSDLNEklRARLEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDA-----------KQQTLQELEQELEEL---- 1045
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1419 eaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE-------RLRRQ 1491
Cdd:COG3096 1046 --GVQADAEAEERARIRRDELHEELSQNRSRRSQLEKQLTRCEAEMDSLQKRLRKAERDYKQEREQVVqakagwcAVLRL 1123
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1492 VQDETQRKRQAEAELalrvqaeaeaarekqrALQALEELRLQAEEAERRLRQAEAERArQVQVALETAQRSAEAE 1566
Cdd:COG3096 1124 ARDNDVERRLHRREL----------------AYLSADELRSMSDKALGALRLAVADNE-HLRDALRLSEDPRRPE 1181
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1102-1523 |
3.63e-09 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 63.22 E-value: 3.63e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1102 KEAQAVPATLPELEA-----------------TKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERdVEVE 1164
Cdd:pfam17380 221 KEVQGMPHTLAPYEKmerrkesfnlaedvttmTPEYTVRYNGQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEK-MEQE 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1165 RWR---ERVTLLLERWQAVLAQTDVRQRELEqlgRQLRYYRESAdplgawlRDAKQRQEQIQAVPLANSQAVREQLRQEK 1241
Cdd:pfam17380 300 RLRqekEEKAREVERRRKLEEAEKARQAEMD---RQAAIYAEQE-------RMAMERERELERIRQEERKRELERIRQEE 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1242 ALLEDierhgEKVEECQRFakqyinaikdyelqlvtykaQLEPvaspakkpkvQSGSESIIQEYVDLRtrysELSTLTSQ 1321
Cdd:pfam17380 370 IAMEI-----SRMRELERL--------------------QMER----------QQKNERVRQELEAAR----KVKILEEE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1322 YIRFISETLRRMEEEERLAEQQRAEERERLAEveaalEKQRQLAEAHAQakaQAEREAQGLQRRMQEEVARREEVAVEAQ 1401
Cdd:pfam17380 411 RQRKIQQQKVEMEQIRAEQEEARQREVRRLEE-----ERAREMERVRLE---EQERQQQVERLRQQEEERKRKKLELEKE 482
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1402 EQKRSIQEELQhlRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrlqleatERQRGGAEGElqalRARAEEAEAQKRQA 1481
Cdd:pfam17380 483 KRDRKRAEEQR--RKILEKELEERKQAMIEEERKRKLLEKEME---------ERQKAIYEEE----RRREAEEERRKQQE 547
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1920237968 1482 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRA 1523
Cdd:pfam17380 548 MEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARA 589
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1678-1909 |
3.67e-09 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 61.79 E-value: 3.67e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1678 LAEQELEKQRQLAEGTAQqrlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAelakvraemevll 1757
Cdd:TIGR02794 47 AVAQQANRIQQQKKPAAK---KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQA------------- 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1758 askARAEEESRSTSEKSKQRLEAEAGRfrelAEEAARLRALAEEAKRQrqlAEEDAVRQRAEAervlAEKLAaisEATRL 1837
Cdd:TIGR02794 111 ---AKQAEEKQKQAEEAKAKQAAEAKA----KAEAEAERKAKEEAAKQ---AEEEAKAKAAAE----AKKKA---EEAKK 173
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1838 KTEAEiALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIE----ARLAQLRKASESELERQKGLVEDTLRQR 1909
Cdd:TIGR02794 174 KAEAE-AKAKAEAEAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEaaaaAAAEAERKADEAELGDIFGLASGSNAEK 248
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1482-1963 |
4.06e-09 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 62.86 E-value: 4.06e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1482 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQR 1561
Cdd:COG4717 52 EKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLY 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1562 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREeatrraqqqaeaeraraeaerelerwqlKANEALRLRLQAEEVA 1641
Cdd:COG4717 132 QELEALEAELAELPERLEELEERLEELRELEEELEE----------------------------LEAELAELQEELEELL 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1642 QQKSLTQAeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1721
Cdd:COG4717 184 EQLSLATE-----------------EELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLK 246
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1722 EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAeeeSRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEE 1801
Cdd:COG4717 247 EARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLAL---LFLLLAREKASLGKEAEELQALPALEELEEEELEE 323
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1802 AKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLE--EQAAQHKAD 1879
Cdd:COG4717 324 LLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELRAALEqaEEYQELKEE 403
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1880 IEARLAQLRKASESELERQKGLVEDTLRQR-RQVEEEILALKGSFEKAAAGKAELELELGRIRGtaEDTLRSKEQAEQEA 1958
Cdd:COG4717 404 LEELEEQLEELLGELEELLEALDEEELEEElEELEEELEELEEELEELREELAELEAELEQLEE--DGELAELLQELEEL 481
|
....*
gi 1920237968 1959 ARQRQ 1963
Cdd:COG4717 482 KAELR 486
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1365-1588 |
4.12e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 62.09 E-value: 4.12e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1365 AEAHAQAKAQAEREAQGLQRRMQEEVARREEvaveAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEE 1441
Cdd:COG4942 15 AAAQADAAAEAEAELEQLQQEIAELEKELAA----LKKEEKALLKQLAALERriaALARRIRALEQELAALEAELAELEK 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1442 EIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1521
Cdd:COG4942 91 EIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELE 170
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1522 RALQALEELRLQAEEAERRLRQAEAERARQVQV--ALETAQRSAEAELQSEHASFAEKTAQLERTLKEE 1588
Cdd:COG4942 171 AERAELEALLAELEEERAALEALKAERQKLLARleKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| CH_dFLNA-like_rpt2 |
cd21315 |
second calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and ... |
159-261 |
4.35e-09 |
|
second calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and similar proteins; Drosophila melanogaster filamin-A (dFLNA or dFLN-A), also called actin-binding protein 280 (ABP-280) or filamin-1, is involved in germline ring canal formation. It may tether actin microfilaments within the ovarian ring canal to the cell membrane and contributes to actin microfilament organization. dFLNA contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409164 Cd Length: 118 Bit Score: 57.10 E-value: 4.35e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 159 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 237
Cdd:cd21315 11 DGKGPTPKQRLLGWIQSKVPD---LPITNFTNDWNDGKAIGALVDALAPGLCpDWEDWDPKDAVKNAKEAMDLAEDWLDV 87
|
90 100
....*....|....*....|....
gi 1920237968 238 TRLLDPEDVDVPQPDEKSIITYVS 261
Cdd:cd21315 88 PQLIKPEEMVNPKVDELSMMTYLS 111
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4376-4414 |
4.51e-09 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 54.64 E-value: 4.51e-09
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 4376 FLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTAQKL 4414
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4298-4335 |
6.46e-09 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 54.03 E-value: 6.46e-09
10 20 30
....*....|....*....|....*....|....*...
gi 1920237968 4298 QRLLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMV 4335
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1809-2342 |
6.57e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 62.39 E-value: 6.57e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1809 AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEdEAFQRRLLEEQAAQHKADIEARLAQLR 1888
Cdd:PRK03918 187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKE-EIEELEKELESLEGSKRKLEEKIRELE 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1889 KASESELERQKGLVEDT--LRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAA 1966
Cdd:PRK03918 266 ERIEELKKEIEELEEKVkeLKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKK 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1967 EEERrrreaeerVQKSLAAEEEAArqrkaalEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAfA 2046
Cdd:PRK03918 346 KLKE--------LEKRLEELEERH-------ELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS-K 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2047 VQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEReaaqsRRQVEEAERLKQSAEEQAQAQAQAQAAAEK 2126
Cdd:PRK03918 410 ITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELL-----EEYTAELKRIEKELKEIEEKERKLRKELRE 484
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2127 LRKEAEQEAARraqaeqaaLRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA--- 2203
Cdd:PRK03918 485 LEKVLKKESEL--------IKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEElkk 556
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2204 EVTEAARQRGQVEEELFSLRVQMEELG---------KLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEA-ARLS 2273
Cdd:PRK03918 557 KLAELEKKLDELEEELAELLKELEELGfesveeleeRLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAfEELA 636
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2274 VAAQEAARLR-QLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2342
Cdd:PRK03918 637 ETEKRLEELRkELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEER 706
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1356-1580 |
6.76e-09 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 61.36 E-value: 6.76e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1356 AALEKQRQLAEAHAQAKAQAEREAQGLQRrmQEEVarREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAK---ARQVEAA 1432
Cdd:PRK09510 60 VVEQYNRQQQQQKSAKRAEEQRKKKEQQQ--AEEL--QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKqaaLKQKQAE 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1433 ERsrlrieeeirvvrlQLEATERQRGGAEGELQALRARAEEAEAQ-KRQAQEEAerlrrQVQDETQRKRQAEAELALRVQ 1511
Cdd:PRK09510 136 EA--------------AAKAAAAAKAKAEAEAKRAAAAAKKAAAEaKKKAEAEA-----AKKAAAEAKKKAEAEAAAKAA 196
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1512 AEAEAAREKQRALQAleelrlqAEEAErrlRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ 1580
Cdd:PRK09510 197 AEAKKKAEAEAKKKA-------AAEAK---KKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
2153-2603 |
7.25e-09 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 62.29 E-value: 7.25e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2153 DAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKaevtEAARQRGQVEEELFSLRVQMEELGKL 2232
Cdd:TIGR00618 200 TLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKR----EAQEEQLKKQQLLKQLRARIEELRAQ 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2233 KARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEA-ARLSVAAQEAARLRQLA------EEDLAQQRALAEKMLKEKM 2305
Cdd:TIGR00618 276 EAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIhTELQSKMRSRAKLLMKRaahvkqQSSIEEQRRLLQTLHSQEI 355
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2306 QAVQEATRLKAEAELLQQQKELAQeQARRLQEDKEQMAQQLaqetQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQ 2385
Cdd:TIGR00618 356 HIRDAHEVATSIREISCQQHTLTQ-HIHTLQQQKTTLTQKL----QSLCKELDILQREQATIDTRTSAFRDLQGQLAHAK 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2386 ARAEEDARRFRKQAEDIGER----------LYRTELATQEKVMLVQTLETQRQQSDR-----DAERLREAIAELEHEKDK 2450
Cdd:TIGR00618 431 KQQELQQRYAELCAAAITCTaqceklekihLQESAQSLKEREQQLQTKEQIHLQETRkkavvLARLLELQEEPCPLCGSC 510
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2451 LKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQM 2530
Cdd:TIGR00618 511 IHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQ 590
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 2531 QQEKQQLAASMEEARRR------QHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEE 2603
Cdd:TIGR00618 591 NITVRLQDLTEKLSEAEdmlaceQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRV 669
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1330-1550 |
7.37e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 61.32 E-value: 7.37e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1330 LRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE 1409
Cdd:COG4942 29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAE 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1410 ---ELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE 1486
Cdd:COG4942 109 llrALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERA 188
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1487 RLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERAR 1550
Cdd:COG4942 189 ALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALK 252
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1336-1550 |
9.02e-09 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 60.63 E-value: 9.02e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1336 EERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSiQEELQHLR 1415
Cdd:TIGR02794 49 AQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK-QAEEAKAK 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1416 QSSEAEIQAKA-RQVEAAERSRLRIEEEirvvRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEE----AERLRR 1490
Cdd:TIGR02794 128 QAAEAKAKAEAeAERKAKEEAAKQAEEE----AKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEakakAEAAKA 203
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1491 QVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERAR 1550
Cdd:TIGR02794 204 KAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAGSEVDK 263
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1084-2021 |
1.12e-08 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 61.89 E-value: 1.12e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1084 IRSTQEAEEVLRAHEEQLKEAQAvpatlpeleatkaALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQ--RHGERdv 1161
Cdd:COG3096 284 SERALELRRELFGARRQLAEEQY-------------RLVEMARELEELSARESDLEQDYQAASDHLNLVQTalRQQEK-- 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1162 eVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVReqlrqek 1241
Cdd:COG3096 349 -IERYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQQTRAIQYQQAVQ------- 420
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1242 ALledierhgEKVEECQRFAKQYINAIKDYelqLVTYKAQLEpvaspakkpkvqsgseSIIQEYVDLRTRYSELSTLTSQ 1321
Cdd:COG3096 421 AL--------EKARALCGLPDLTPENAEDY---LAAFRAKEQ----------------QATEEVLELEQKLSVADAARRQ 473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1322 YIRFIsETLRRMEEE-ERLAEQQRAEERER-------LAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARR 1393
Cdd:COG3096 474 FEKAY-ELVCKIAGEvERSQAWQTARELLRryrsqqaLAQRLQQLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDAA 552
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1394 EEVAVEAQEQKRSIQEELQHLRQSSEAEIQakarqveaaersrlrieeeirvvrlqleaTERQRGGAEGELQALRARAEE 1473
Cdd:COG3096 553 EELEELLAELEAQLEELEEQAAEAVEQRSE-----------------------------LRQQLEQLRARIKELAARAPA 603
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1474 AeaqkRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAeeaeRRLRQAE-AERARQV 1552
Cdd:COG3096 604 W----LAAQDALERLREQSGEALADSQEVTAAMQQLLEREREATVERDELAARKQALESQI----ERLSQPGgAEDPRLL 675
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1553 Q----------------VALETAQR-------SAEAELQSEHASFAEKTAQLERTLkeEHVAVVQLREEATRRAQQQAEA 1609
Cdd:COG3096 676 AlaerlggvllseiyddVTLEDAPYfsalygpARHAIVVPDLSAVKEQLAGLEDCP--EDLYLIEGDPDSFDDSVFDAEE 753
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1610 ERARAEAERELERWQL-------------KANEALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgkaeEQAVRQR 1676
Cdd:COG3096 754 LEDAVVVKLSDRQWRYsrfpevplfgraaREKRLEELRAERDELAEQYA------------------------KASFDVQ 809
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1677 ELaeQELEKQ-RQLAEGTAQQRLAAEQElirlrAETEQGEQQRQLLEEELARL----QREAAAATQKRRELEAeLAKVRA 1751
Cdd:COG3096 810 KL--QRLHQAfSQFVGGHLAVAFAPDPE-----AELAALRQRRSELERELAQHraqeQQLRQQLDQLKEQLQL-LNKLLP 881
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1752 EMEVL----LASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRA--LAEEAKRQRQL---AEEDAVRQRAEAER 1822
Cdd:COG3096 882 QANLLadetLADRLEELREELDAAQEAQAFIQQHGKALAQLEPLVAVLQSdpEQFEQLQADYLqakEQQRRLKQQIFALS 961
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1823 VLAEKLAAISEAtrlktEAEIALKEKEAENERLR---RLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQK 1899
Cdd:COG3096 962 EVVQRRPHFSYE-----DAVGLLGENSDLNEKLRarlEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQ 1036
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1900 GLvedtlrQRRQVEEEILALKGSFEKAAAGKAELELELGRIRG--TAEDTLRSKEQAEQEAARQRqlaaeeerrrreaEE 1977
Cdd:COG3096 1037 EL------EQELEELGVQADAEAEERARIRRDELHEELSQNRSrrSQLEKQLTRCEAEMDSLQKR-------------LR 1097
|
970 980 990 1000
....*....|....*....|....*....|....*....|....*.
gi 1920237968 1978 RVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRL--RERAEQES 2021
Cdd:COG3096 1098 KAERDYKQEREQVVQAKAGWCAVLRLARDNDVERRLhrRELAYLSA 1143
|
|
| CH_FLNB_rpt2 |
cd21313 |
second calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; ... |
159-266 |
1.31e-08 |
|
second calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B (FLN-B) is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton. It may promote orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It anchors various transmembrane proteins to the actin cytoskeleton. FLN-B contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409162 Cd Length: 110 Bit Score: 55.48 E-value: 1.31e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 159 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 237
Cdd:cd21313 3 DAKKQTPKQRLLGWIQNKIPY---LPITNFNQNWQDGKALGALVDSCAPGLCpDWESWDPQKPVDNAREAMQQADDWLGV 79
|
90 100
....*....|....*....|....*....
gi 1920237968 238 TRLLDPEDVDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21313 80 PQVITPEEIIHPDVDEHSVMTYLSQFPKA 108
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
2146-2602 |
1.34e-08 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 61.47 E-value: 1.34e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEET-----DHQKSILDEELQRLKAEVTEAARQRGQVEEELF 2220
Cdd:COG4913 247 AREQIELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRrlellEAELEELRAELARLEAELERLEARLDALREELD 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2221 SLRVQMEELG-----KLKARIEAENRALVLRDKDSAQrlLQEEAEKMK-QVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2294
Cdd:COG4913 327 ELEAQIRGNGgdrleQLEREIERLERELEERERRRAR--LEALLAALGlPLPASAEEFAALRAEAAALLEALEEELEALE 404
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2295 ALAEKMLKEKMQAVQEATRLKAEAELLQQQK---ELAQEQARR-----LQEDKEQM------------------------ 2342
Cdd:COG4913 405 EALAEAEAALRDLRRELRELEAEIASLERRKsniPARLLALRDalaeaLGLDEAELpfvgelievrpeeerwrgaiervl 484
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2343 ---AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLyRTELATQEKVML 2419
Cdd:COG4913 485 ggfALTLLVPPEHYAAALRWVNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWL-EAELGRRFDYVC 563
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2420 VQTLE-------------------TQRQQSDRDAERL--------REAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQ-- 2470
Cdd:COG4913 564 VDSPEelrrhpraitragqvkgngTRHEKDDRRRIRSryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAELDal 643
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2471 EQLLQETQALQQSFLSEKDsLLQRERCIEQEKAKLEQL--FQDEVAKAQALREEqqrqqqqmqqekqqLAASMEEARRRQ 2548
Cdd:COG4913 644 QERREALQRLAEYSWDEID-VASAEREIAELEAELERLdaSSDDLAALEEQLEE--------------LEAELEELEEEL 708
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 2549 HEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSE 2602
Cdd:COG4913 709 DELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDA 762
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1367-1826 |
1.45e-08 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 61.57 E-value: 1.45e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1367 AHAQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVV 1446
Cdd:COG3903 475 EYAAERLAEAGERAAARRRHADYYLA---LAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALRLAAAL 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1447 RLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQA 1526
Cdd:COG3903 552 APFWFLRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAA 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1527 LEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQ 1606
Cdd:COG3903 632 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAAL 711
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1607 AEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQ 1686
Cdd:COG3903 712 AAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAA 791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1687 RQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1766
Cdd:COG3903 792 AAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALA 871
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1767 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1826
Cdd:COG3903 872 AAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1479-1999 |
1.64e-08 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 61.47 E-value: 1.64e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1479 RQAQEEAERLRRQVQD----ETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARqvqv 1554
Cdd:COG4913 238 ERAHEALEDAREQIELlepiRELAERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELER---- 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1555 aLETAQRSAEAELQSEHASFAEKTAQLERTLKEEhvavvqlreeatrraqqqaeaeraraeaereLERWQLKANEALRLR 1634
Cdd:COG4913 314 -LEARLDALREELDELEAQIRGNGGDRLEQLERE-------------------------------IERLERELEERERRR 361
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1635 LQAEEVAQQKSLTQAEAEKQKEeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1714
Cdd:COG4913 362 ARLEALLAALGLPLPASAEEFA----------ALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASL 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1715 EQQRQLLEEELARLQREAAAATQ-KRRELE--AELAKVRAE-------MEVLLASKAR---------------------- 1762
Cdd:COG4913 432 ERRKSNIPARLLALRDALAEALGlDEAELPfvGELIEVRPEeerwrgaIERVLGGFALtllvppehyaaalrwvnrlhlr 511
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1763 ------------AEEESRSTSEKS-KQRLEAEAGRFR-----ELAEEAARLRALAEEA-----------------KRQRQ 1807
Cdd:COG4913 512 grlvyervrtglPDPERPRLDPDSlAGKLDFKPHPFRawleaELGRRFDYVCVDSPEElrrhpraitragqvkgnGTRHE 591
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1808 LAEEDAVRQR----AEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEE-----QAAQHKA 1878
Cdd:COG4913 592 KDDRRRIRSRyvlgFDNRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDeidvaSAEREIA 671
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1879 DIEARLAQLRKASeSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRirgtAEDTLRSKEQAEQEA 1958
Cdd:COG4913 672 ELEAELERLDASS-DDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDE----LQDRLEAAEDLARLE 746
|
570 580 590 600
....*....|....*....|....*....|....*....|.
gi 1920237968 1959 ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 1999
Cdd:COG4913 747 LRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEE 787
|
|
| CH_PARV_rpt2 |
cd21222 |
second calponin homology (CH) domain found in the parvin family; The parvin family includes ... |
35-149 |
2.04e-08 |
|
second calponin homology (CH) domain found in the parvin family; The parvin family includes alpha-parvin, beta-parvin, and gamma-parvin. Alpha-parvin, also called actopaxin, calponin-like integrin-linked kinase-binding protein (CH-ILKBP), or matrix-remodeling-associated protein 2, plays a role in sarcomere organization and in smooth muscle cell contraction. It is required for normal development of the embryonic cardiovascular system, and for normal septation of the heart outflow tract. Beta-parvin, also called affixin, is an adapter protein that plays a role in integrin signaling via ILK and in activation of the GTPases Cdc42 and Rac1 by guanine exchange factors, such as ARHGEF6. Both alpha-parvin and beta-parvin are involved in the reorganization of the actin cytoskeleton and the formation of lamellipodia, and both play roles in cell adhesion, cell spreading, establishment or maintenance of cell polarity, and cell migration. Gamma-parvin probably plays a role in the regulation of cell adhesion and cytoskeleton organization. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409071 Cd Length: 121 Bit Score: 55.29 E-value: 2.04e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 35 DEQDERDRVQKKTFTKWVNKHLikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLP----REKGRMRFHKLQNVQI 110
Cdd:cd21222 8 DEAPEKLAEVKELLLQFVNKHL-----AKLNIEVTDLATQFHDGVYLILLIGLLEGFFVPlheyHLTPSTDDEKLHNVKL 82
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237968 111 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 149
Cdd:cd21222 83 ALELMEDAGISTPKIRPEDIVNGDLKSILRVLYSLFSKY 121
|
|
| EntF |
COG1020 |
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites ... |
1351-1763 |
2.05e-08 |
|
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];
Pssm-ID: 440643 [Multi-domain] Cd Length: 1329 Bit Score: 61.03 E-value: 2.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1351 LAEVEAALEKQRQLAEAHAQAkaqaeREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE----------- 1419
Cdd:COG1020 885 LGEIEAALLQHPGVREAVVVA-----REDAPGDKRLVAYVVPEAGAAAAAALLRLALALLLPPYMVPAAvvlllplpltg 959
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1420 --------AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQ 1491
Cdd:COG1020 960 ngkldrlaLPAPAAAAAAAAAAPPAEEEEEEAALALLLLLVVVVGDDDFFFFGGGLGLLLLLALARAARLLLLLLLLLLL 1039
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1492 VQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEH 1571
Cdd:COG1020 1040 FLAAAAAAAAAAAAAAAAAAAAPLAAAAAPLPLPPLLLSLLALLLALLLLLALLALLALLLLLLLLLLLLALLLLLALLL 1119
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1572 ASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEA 1651
Cdd:COG1020 1120 ALLAALRARRAVRQEGPRLRLLVALAAALALAALLALLLAAAAAAAELLAAAALLLLLALLLLALLLLLLLLLLLLLLLL 1199
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1652 EKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQRE 1731
Cdd:COG1020 1200 LLLLLLLLLLLLLLLLLLLLLLLLLAAAAAALLALALLLALLALAALLALAALAALAAALLALALALLALALLLLALALL 1279
|
410 420 430
....*....|....*....|....*....|..
gi 1920237968 1732 AAAATQKRRELEAELAKVRAEMEVLLASKARA 1763
Cdd:COG1020 1280 LPALARARAARTARALALLLLLALLLLLALAL 1311
|
|
| CH_AtFIM_like_rpt3 |
cd21299 |
third calponin homology (CH) domain found in the Arabidopsis thaliana fimbrin family; The ... |
34-148 |
2.41e-08 |
|
third calponin homology (CH) domain found in the Arabidopsis thaliana fimbrin family; The Arabidopsis thaliana fimbrin (AtFIM) family includes Fimbrin-1, -2, -3, -4, and -5, which cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409148 Cd Length: 114 Bit Score: 54.82 E-value: 2.41e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 34 QDEQDERdrvqkkTFTKWVNKhlikhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKG-----RMRFHKLQNV 108
Cdd:cd21299 1 ETSREER------CFRLWINS-------LGIDTYVNNVFEDVRDGWVLLEVLDKVSPGSVNWKHAnkppiKMPFKKVENC 67
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1920237968 109 QIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILH 148
Cdd:cd21299 68 NQVVKIGKQLKFSLVNVAGNDIVQGNKKLILALLWQLMRY 107
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
511-700 |
3.49e-08 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 57.07 E-value: 3.49e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 511 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQLSPATRGAY---RDCLGRLD 587
Cdd:cd00176 6 LRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAeeiQERLEELN 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 588 LQYAKLLSSSKARLRSLE---SLHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEI 664
Cdd:cd00176 86 QRWEELRELAEERRQRLEealDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSL 165
|
170 180 190
....*....|....*....|....*....|....*..
gi 1920237968 665 QSTGDRLLREDHP-ARPTAESFQAALQTQWSWMLQLC 700
Cdd:cd00176 166 NELAEELLEEGHPdADEEIEEKLEELNERWEELLELA 202
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1089-1444 |
3.94e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 60.07 E-value: 3.94e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1089 EAEEVLRAHEEQLKEAQAVPATL-PELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWR 1167
Cdd:TIGR02168 681 ELEEKIEELEEKIAELEKALAELrKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELE 760
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1168 ERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLrDAKQRQEQIQAVPLANSQAVREQLRQEKALLED- 1246
Cdd:TIGR02168 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL-DELRAELTLLNEEAANLRERLESLERRIAATERr 839
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1247 IERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSELstltsqyirfi 1326
Cdd:TIGR02168 840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELE---------ALLNERASLEEALALLRSELEEL----------- 899
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1327 SETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQ------------------LAEAHAQAKAQAEREAQGLQRRMQE 1388
Cdd:TIGR02168 900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVridnlqerlseeysltleEAEALENKIEDDEEEARRRLKRLEN 979
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1389 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAErsrlRIEEEIR 1444
Cdd:TIGR02168 980 KIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIE----EIDREAR 1031
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1679-1866 |
4.15e-08 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 59.05 E-value: 4.15e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1679 AEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAA--AATQKRRELEAELAKVRAEMEVL 1756
Cdd:PRK09510 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQkqAEEAAAKAAAAAKAKAEAEAKRA 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1757 LASKARAEEESrstseksKQRLEAEAgrfRELAEEAARLRALAEEAKrqrQLAEEDAVRQRAEAERVlAEKLAAISEATR 1836
Cdd:PRK09510 157 AAAAKKAAAEA-------KKKAEAEA---AKKAAAEAKKKAEAEAAA---KAAAEAKKKAEAEAKKK-AAAEAKKKAAAE 222
|
170 180 190
....*....|....*....|....*....|
gi 1920237968 1837 LKTEAEIALKEKEAENERLRRLAEDEAFQR 1866
Cdd:PRK09510 223 AKAAAAKAAAEAKAAAEKAAAAKAAEKAAA 252
|
|
| WEMBL |
pfam05701 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
1401-1941 |
4.65e-08 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 59.27 E-value: 4.65e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEeirvVRLQLE--ATERQRGGAEGELQALRARaeeaEAQK 1478
Cdd:pfam05701 41 ELELEKVQEEIPEYKKQSEAAEAAKAQVLEELESTKRLIEE----LKLNLEraQTEEAQAKQDSELAKLRVE----EMEQ 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1479 RQAQEEAERLRRQVQDETQRKRQAEAELALrVQAE--------AEAAREKQRALQALEELRLQAEEAERRLRQAEAERAr 1550
Cdd:pfam05701 113 GIADEASVAAKAQLEVAKARHAAAVAELKS-VKEEleslrkeyASLVSERDIAIKRAEEAVSASKEIEKTVEELTIELI- 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1551 QVQVALETAQRS-AEAELQSEHASFA--EKTAQLERTLKEEHVAVVQLREEATRRAQQQAeaeraraeaerelerwQLKA 1627
Cdd:pfam05701 191 ATKESLESAHAAhLEAEEHRIGAALAreQDKLNWEKELKQAEEELQRLNQQLLSAKDLKS----------------KLET 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1628 NEALRLRLQAEEVAQQKSltqaeaekqkeEAEREARRRGKAEEQAVRQRE---LAEQELEKQRQLAEgtaqqRLAAEQEL 1704
Cdd:pfam05701 255 ASALLLDLKAELAAYMES-----------KLKEEADGEGNEKKTSTSIQAalaSAKKELEEVKANIE-----KAKDEVNC 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1705 IRLRAETEQGEQQRQllEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlasKARAEEESRSTSEKSKQRLEAeagr 1784
Cdd:pfam05701 319 LRVAAASLRSELEKE--KAELASLRQREGMASIAVSSLEAELNRTKSEIALV---QAKEKEAREKMVELPKQLQQA---- 389
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1785 frelAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEA--ENERLRRLAEDE 1862
Cdd:pfam05701 390 ----AQEAEEAKSLAQAAREELRKAKEEAEQAKAAASTVESRLEAVLKEIEAAKASEKLALAAIKAlqESESSAESTNQE 465
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1863 AFQR----------------RLLEEQAaqhKADIEARLAQLRKASESELERQKGLvEDTLRQRRQVEEEILALKGSFEKA 1926
Cdd:pfam05701 466 DSPRgvtlsleeyyelskraHEAEELA---NKRVAEAVSQIEEAKESELRSLEKL-EEVNREMEERKEALKIALEKAEKA 541
|
570
....*....|....*
gi 1920237968 1927 AAGKAELELELGRIR 1941
Cdd:pfam05701 542 KEGKLAAEQELRKWR 556
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
2164-2604 |
4.76e-08 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 59.85 E-value: 4.76e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2164 EQALRQKAQVEQELTALRLQL-EETDHQKSILDEELQRLKAEVTEAARQRGQV---EEELFSLRVQMEELGKLKARIEAE 2239
Cdd:pfam12128 404 EARDRQLAVAEDDLQALESELrEQLEAGKLEFNEEEYRLKSRLGELKLRLNQAtatPELLLQLENFDERIERAREEQEAA 483
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2240 NRAlvlrdkdsaQRLLQEEAEKMKQVAEEAARlsvaaqeaaRLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAE 2319
Cdd:pfam12128 484 NAE---------VERLQSELRQARKRRDQASE---------ALRQASRRLEERQSALDELELQLFPQAGTLLHFLRKEAP 545
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2320 LLQQQ--KELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA-------QARAEE 2390
Cdd:pfam12128 546 DWEQSigKVISPELLHRTDLDPEVWDGSVGGELNLYGVKLDLKRIDVPEWAASEEELRERLDKAEEAlqsarekQAAAEE 625
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2391 DARRFRKQAE--DIGERLYRTELaTQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTV 2468
Cdd:pfam12128 626 QLVQANGELEkaSREETFARTAL-KNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQLDKKHQAWLEE 704
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2469 RQEQLLQETQALQQSFL---SEKDSLLQR-----ERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAAS 2540
Cdd:pfam12128 705 QKEQKREARTEKQAYWQvveGALDAQLALlkaaiAARRSGAKAELKAL-ETWYKRDLASLGVDPDVIAKLKREIRTLERK 783
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2541 MEEARRRQHEAEEGVRRQQE----ELQRLAQQQQQQEKLLAEENQRL-------RERLQHLEEERRAALARSEEI 2604
Cdd:pfam12128 784 IERIAVRRQEVLRYFDWYQEtwlqRRPRLATQLSNIERAISELQQQLarliadtKLRRAKLEMERKASEKQQVRL 858
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
2155-2603 |
5.16e-08 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 59.69 E-value: 5.16e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2155 EMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEE------ELFSLRVQMEE 2228
Cdd:PRK03918 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEyiklseFYEEYLDELRE 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2229 LGKLKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEdlaqqralAEKMLKEKmq 2306
Cdd:PRK03918 312 IEKRLSRLEEEINGIeeRIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEE--------LERLKKRL-- 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2307 AVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQ-----RQLEMSAEAERLRLRVAEM 2381
Cdd:PRK03918 382 TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKcpvcgRELTEEHRKELLEEYTAEL 461
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2382 SRAQ---ARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKL-KQEAQL 2457
Cdd:PRK03918 462 KRIEkelKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLiKLKGEI 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2458 LQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRER-----CIEQEKAKLEQL--FQDEVAKAQALREEQQRQQQQM 2530
Cdd:PRK03918 542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEelgfeSVEELEERLKELepFYNEYLELKDAEKELEREEKEL 621
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 2531 QQEKQQLAASMEEARRRQHEAEEgVRRQQEELQRlaqqqqqqeKLLAEENQRLRERLQHLEEERRAALARSEE 2603
Cdd:PRK03918 622 KKLEEELDKAFEELAETEKRLEE-LRKELEELEK---------KYSEEEYEELREEYLELSRELAGLRAELEE 684
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1485-1944 |
6.26e-08 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 59.26 E-value: 6.26e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1485 AERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQAleelRLQAEEAERRLRQAEAERARQVQVALETAqrSAE 1564
Cdd:COG3903 478 AERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLA----RLDAEHDNLRAALRWALAHGDAELALRLA--AAL 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1565 AELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQK 1644
Cdd:COG3903 552 APFWFLRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAA 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1645 SLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEE 1724
Cdd:COG3903 632 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAAL 711
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1725 LARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKR 1804
Cdd:COG3903 712 AAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAA 791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1805 QRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARL 1884
Cdd:COG3903 792 AAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALA 871
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1885 AQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTA 1944
Cdd:COG3903 872 AAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1642-1837 |
6.66e-08 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 58.28 E-value: 6.66e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1642 QQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1721
Cdd:PRK09510 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKA 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1722 EEELARLQREAAAATQKRRELEAELAKVRAEMEvllASKARAEEESRSTSEKSKQRLEAEAgrfRELAEEAARLRALAEE 1801
Cdd:PRK09510 150 EAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE---AKKKAEAEAAAKAAAEAKKKAEAEA---KKKAAAEAKKKAAAEA 223
|
170 180 190
....*....|....*....|....*....|....*.
gi 1920237968 1802 AKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRL 1837
Cdd:PRK09510 224 KAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
2196-2464 |
7.28e-08 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 59.16 E-value: 7.28e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2196 EELQRLKAEVTEAARQRGQVE------EELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEkmkQVAEEA 2269
Cdd:COG4913 235 DDLERAHEALEDAREQIELLEpirelaERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELA---RLEAEL 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2270 ARLSVAAQEAARLRQLAEEDLAQQralaekmlkekmqAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQE 2349
Cdd:COG4913 312 ERLEARLDALREELDELEAQIRGN-------------GGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPAS 378
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2350 TQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQaedigerlyrtelatqekvmlVQTLETQRQQ 2429
Cdd:COG4913 379 AEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAE---------------------IASLERRKSN 437
|
250 260 270
....*....|....*....|....*....|....*.
gi 1920237968 2430 SDRDAERLREAIAE-LEHEKDKLKQEAQLLQLKSEE 2464
Cdd:COG4913 438 IPARLLALRDALAEaLGLDEAELPFVGELIEVRPEE 473
|
|
| CH_ASPM_rpt2 |
cd21224 |
second calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated ... |
168-263 |
7.45e-08 |
|
second calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated protein (ASPM) and similar proteins; ASPM, also called abnormal spindle protein homolog, or Asp homolog, is involved in mitotic spindle regulation and coordination of mitotic processes. It may also have a preferential role in regulating neurogenesis. Members of this family contain two copies of CH domain in the middle region. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409073 [Multi-domain] Cd Length: 138 Bit Score: 54.23 E-value: 7.45e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 168 KLLL-WSQrMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNL-----------------------EN 223
Cdd:cd21224 3 SLLLkWCQ-AVCAHYGVKVENFTVSFADGRALCYLIHHYLPSLLPLDAIRQPTTQtvdraqdeaedfwvaefspstgdSG 81
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 224 LDQAFSVAER-----------DLG-VTRLLDPEDVDVPQPDEKSIITYVSSL 263
Cdd:cd21224 82 LSSELLANEKrnfklvqqavaELGgVPALLRASDMSNTIPDEKVVILFLSYL 133
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1469-1920 |
8.41e-08 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 58.63 E-value: 8.41e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1469 ARAEEAEAQKRQAQEEAERLRRQVQD-ETQRKRQAEAELAL----RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQ 1543
Cdd:COG4717 71 KELKELEEELKEAEEKEEEYAELQEElEELEEELEELEAELeelrEELEKLEKLLQLLPLYQELEALEAELAELPERLEE 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1544 AEAERARQVQV-----ALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEaer 1618
Cdd:COG4717 151 LEERLEELRELeeeleELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEE--- 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1619 elerwQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAvrqrELAEQELEKQRQLAEGTAQQRL 1698
Cdd:COG4717 228 -----ELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIA----GVLFLVLGLLALLFLLLAREKA 298
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1699 AAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRstsEKSKQRL 1778
Cdd:COG4717 299 SLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEEL---EQEIAAL 375
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1779 EAEAGrfrelAEEAARLRALAEEAKRQRQLAEEdavrqRAEAERVLAEKLAAISEATRLKTEAEiaLKEKEAENERLRRL 1858
Cdd:COG4717 376 LAEAG-----VEDEEELRAALEQAEEYQELKEE-----LEELEEQLEELLGELEELLEALDEEE--LEEELEELEEELEE 443
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1859 AEDEafqrrllEEQAAQHKADIEARLAQLrkASESELERQKGLVEDTLRQRRQVEEEILALK 1920
Cdd:COG4717 444 LEEE-------LEELREELAELEAELEQL--EEDGELAELLQELEELKAELRELAEEWAALK 496
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1420-1572 |
8.83e-08 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 57.96 E-value: 8.83e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1420 AEIQAKARQVEA-AERsrlriEEEIRVvrlqleaTERQRGGAEGELQALRARAE----EAEAQKRQAQEEAERlrrqvqd 1494
Cdd:COG2268 195 AEIIRDARIAEAeAER-----ETEIAI-------AQANREAEEAELEQEREIETariaEAEAELAKKKAEERR------- 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1495 ETQRKRqAEAELALRVQaEAEAAREKQRALQALE---ELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEH 1571
Cdd:COG2268 256 EAETAR-AEAEAAYEIA-EANAEREVQRQLEIAErerEIELQEKEAEREEAELEADVRKPAEAEKQAAEAEAEAEAEAIR 333
|
.
gi 1920237968 1572 A 1572
Cdd:COG2268 334 A 334
|
|
| MutS2 |
COG1193 |
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair]; |
1389-1554 |
9.37e-08 |
|
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];
Pssm-ID: 440806 [Multi-domain] Cd Length: 784 Bit Score: 58.61 E-value: 9.37e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1389 EVARR----EEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegel 1464
Cdd:COG1193 490 EIARRlglpEEIIERARELLGEESIDVEKLIEELERERRELEEEREEAERLREELEKLREELEEKLEELEEEK------- 562
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1465 QALRARA-EEAEAQKRQAQEEAERLRRQVQDEtqrkrqaeaelalrvQAEAEAAREKQRALQALEElRLQAEEAERRLRQ 1543
Cdd:COG1193 563 EEILEKArEEAEEILREARKEAEELIRELREA---------------QAEEEELKEARKKLEELKQ-ELEEKLEKPKKKA 626
|
170
....*....|.
gi 1920237968 1544 AEAERARQVQV 1554
Cdd:COG1193 627 KPAKPPEELKV 637
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4029-4065 |
1.07e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 50.56 E-value: 1.07e-07
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 4029 IRLLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEM 4065
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1036-1598 |
1.14e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 58.54 E-value: 1.14e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1036 AEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEaqaVPATLPELE 1115
Cdd:PRK03918 203 EEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEE---LKKEIEELE 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1116 ATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLqqrhgerdvevERWRERvtllLERWQAVLAQTDVRQRELEQLG 1195
Cdd:PRK03918 280 EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL-----------SRLEEE----INGIEERIKELEEKEERLEELK 344
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1196 RQLRYYRESADPLGAWLR---DAKQRQEQIqavplansqavrEQLRQEKALL--EDIERHGEKVEECQRFAKQYINAIKD 1270
Cdd:PRK03918 345 KKLKELEKRLEELEERHElyeEAKAKKEEL------------ERLKKRLTGLtpEKLEKELEELEKAKEEIEEEISKITA 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1271 YELQLVTYKAQLEPVASPAKKPKVQS---GSESIIQEYVDLRTRYSElstltsqYIRFISETLRRMEEEERlaeqqraEE 1347
Cdd:PRK03918 413 RIGELKKEIKELKKAIEELKKAKGKCpvcGRELTEEHRKELLEEYTA-------ELKRIEKELKEIEEKER-------KL 478
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1348 RERLAEVEAALEKQRQLAEAHAQAKaqaereaqglQRRMQEEvaRREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1427
Cdd:PRK03918 479 RKELRELEKVLKKESELIKLKELAE----------QLKELEE--KLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKK 546
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1428 QVEAAERsrlrIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1507
Cdd:PRK03918 547 ELEKLEE----LKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELK 622
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1508 lRVQAEAEAAREK-QRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERtLK 1586
Cdd:PRK03918 623 -KLEEELDKAFEElAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEK-LK 700
|
570
....*....|..
gi 1920237968 1587 EEHVAVVQLREE 1598
Cdd:PRK03918 701 EELEEREKAKKE 712
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1328-1588 |
1.21e-07 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 57.24 E-value: 1.21e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSI 1407
Cdd:pfam13868 46 DEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQ 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1408 QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1487
Cdd:pfam13868 126 RQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDE 205
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1488 LR-RQVQDETQRK-RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEA 1565
Cdd:pfam13868 206 LRaKLYQEEQERKeRQKEREEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRR 285
|
250 260
....*....|....*....|...
gi 1920237968 1566 ELQSEHASFAEKTAQLERTLKEE 1588
Cdd:pfam13868 286 MKRLEHRRELEKQIEEREEQRAA 308
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1055-1503 |
1.28e-07 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 58.24 E-value: 1.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1055 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKA-------ALKKLRAQ 1127
Cdd:COG4717 75 ELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAelaelpeRLEELEER 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1128 AEAQQPVFDALRDELRGAQEVGERLQQrhgERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADP 1207
Cdd:COG4717 155 LEELRELEEELEELEAELAELQEELEE---LLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQ 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1208 LGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVeecqrfakQYINAIKDYELQLVTYKAQLEPVAS 1287
Cdd:COG4717 232 LENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGV--------LFLVLGLLALLFLLLAREKASLGKE 303
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1288 PAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRfiseTLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA 1367
Cdd:COG4717 304 AEELQALPALEELEEEELEELLAALGLPPDLSPEELL----ELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEA 379
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1368 HAQAKAQAEREAQGLQRRmQEEVARREEVAVEAQEQKRSIQEELQHLRQSS-EAEIQAKARQVEAAERSRLRIEEEIRVV 1446
Cdd:COG4717 380 GVEDEEELRAALEQAEEY-QELKEELEELEEQLEELLGELEELLEALDEEElEEELEELEEELEELEEELEELREELAEL 458
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 1447 RLQLEATERqrggaEGELQALRARAEEAEAQKRQAQEEAERLR------RQVQDETQRKRQAE 1503
Cdd:COG4717 459 EAELEQLEE-----DGELAELLQELEELKAELRELAEEWAALKlalellEEAREEYREERLPP 516
|
|
| CH_NAV3 |
cd21286 |
calponin homology (CH) domain found in neuron navigator 3; Neuron navigator 3 (NAV3), also ... |
46-145 |
1.32e-07 |
|
calponin homology (CH) domain found in neuron navigator 3; Neuron navigator 3 (NAV3), also called pore membrane and/or filament-interacting-like protein 1 (POMFIL1), Steerin-3 (STEERIN3), or Unc-53 homolog 3 (unc53H3), may regulate IL2 production by T-cells. It may be involved in neuron regeneration. NAV3 contains a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409135 Cd Length: 105 Bit Score: 52.72 E-value: 1.32e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 46 KTFTKWVNKHLIKhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGD------SLPREKGRMrfhkLQNVQIALDYLRHRQ 119
Cdd:cd21286 3 KIYTDWANHYLAK---SGHKRLIKDLQQDIADGVLLAEIIQIIANEkvedinGCPRSQSQM----IENVDVCLSFLAARG 75
|
90 100
....*....|....*....|....*.
gi 1920237968 120 VKLVNIRNDDIADGNPKLTLGLIWTI 145
Cdd:cd21286 76 VNVQGLSAEEIRNGNLKAILGLFFSL 101
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1336-1963 |
2.04e-07 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 57.50 E-value: 2.04e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1336 EERLAEQQRAeeRERLAEVEAALEK-QRQLA------------------EAHAQAKAQAEREAQGLQRRMQEEVARREEV 1396
Cdd:PRK10246 253 DELQQEASRR--QQALQQALAAEEKaQPQLAalslaqparqlrphweriQEQSAALAHTRQQIEEVNTRLQSTMALRARI 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1397 AVEAQEQKRSIQEELQHLRQ-SSEAEIQAKARQVEAAERSRL----RIEEEIRVVRLQLEATERQRGGAEGELQALRARa 1471
Cdd:PRK10246 331 RHHAAKQSAELQAQQQSLNTwLAEHDRFRQWNNELAGWRAQFsqqtSDREQLRQWQQQLTHAEQKLNALPAITLTLTAD- 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1472 EEAEAQKRQAQEEAER-----LRRQVQDETQRKRQAEAelalrvqAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1546
Cdd:PRK10246 410 EVAAALAQHAEQRPLRqrlvaLHGQIVPQQKRLAQLQV-------AIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKT 482
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1547 ERARQVQVALETAQRsaeAELQS----------EHASFAEKTAqLERTlkeehvaVVQLREEATRRAQQQAeaeraraea 1616
Cdd:PRK10246 483 ICEQEARIKDLEAQR---AQLQAgqpcplcgstSHPAVEAYQA-LEPG-------VNQSRLDALEKEVKKL--------- 542
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1617 erelerwqlkANEALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgKAEEQAVRQRElAEQELEKQRQLAEGTAQQ 1696
Cdd:PRK10246 543 ----------GEEGAALRGQLDALTKQLQ---------------------RDESEAQSLRQ-EEQALTQQWQAVCASLNI 590
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1697 RLAAEQELIRLRAETEQGEQQRQLLEEELArLQREAAAATQKRRELEAELAKVRAEMEVLLASKA------RAEEESRST 1770
Cdd:PRK10246 591 TLQPQDDIQPWLDAQEEHERQLRLLSQRHE-LQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYAltlpqeDEEASWLAT 669
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1771 SEKSKQRLEAEAGRFRELAEEAARLRALAEeakrqrQLAEEDAVrqRAEAERVLAEKLAAISEATrLKTEAEIALKEKEA 1850
Cdd:PRK10246 670 RQQEAQSWQQRQNELTALQNRIQQLTPLLE------TLPQSDDL--PHSEETVALDNWRQVHEQC-LSLHSQLQTLQQQD 740
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1851 ENERLR---------------RLAEDEAFQRRLLEEQAAQhkadieaRLAQLRKASESELERQKGLVEdtlrQRRQVEEE 1915
Cdd:PRK10246 741 VLEAQRlqkaqaqfdtalqasVFDDQQAFLAALLDEETLT-------QLEQLKQNLENQRQQAQTLVT----QTAQALAQ 809
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 1916 ILALKGSFEKAAAGKAELELELGRIRGT-AEDTLRSKE---QAEQEA-ARQRQ 1963
Cdd:PRK10246 810 HQQHRPDGLDLTVTVEQIQQELAQLAQQlRENTTRQGEirqQLKQDAdNRQQQ 862
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1700-1900 |
2.06e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 56.76 E-value: 2.06e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1700 AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLE 1779
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1780 A---------------EAGRFRELAEEAARLRALAEEAKR---QRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEA 1841
Cdd:COG3883 94 AlyrsggsvsyldvllGSESFSDFLDRLSALSKIADADADlleELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1842 EIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKG 1900
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAA 232
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1679-1931 |
2.32e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 56.38 E-value: 2.32e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1679 AEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLA 1758
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1759 SKARAEE---------ESRSTSE--KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEK 1827
Cdd:COG3883 94 ALYRSGGsvsyldvllGSESFSDflDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1828 LAAISEATRLKteAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLR 1907
Cdd:COG3883 174 EAQQAEQEALL--AQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGA 251
|
250 260
....*....|....*....|....
gi 1920237968 1908 QRRQVEEEILALKGSFEKAAAGKA 1931
Cdd:COG3883 252 AGAAGAAAGSAGAAGAAAGAAGAG 275
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1321-1582 |
2.63e-07 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 57.27 E-value: 2.63e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1321 QYIRFISETLRRMEEEERLAEQ--QRAEER-ERLAEVEAA-LEKQRQLAEAHAQAKAQA---------EREAQGLQRRMQ 1387
Cdd:PRK05035 429 QYYRQAKAEIRAIEQEKKKAEEakARFEARqARLEREKAArEARHKKAAEARAAKDKDAvaaalarvkAKKAAATQPIVI 508
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1388 EEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAA-ERSRLRIEEeirvvrlQLEATERQRGGAEGELQA 1466
Cdd:PRK05035 509 KAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAiARAKAKKAA-------QQAANAEAEEEVDPKKAA 581
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1467 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAelalrvqaeAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1546
Cdd:PRK05035 582 VAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAA---------AIARAKAKKAEQQANAEPEEPVDPRKAAVAAAI 652
|
250 260 270
....*....|....*....|....*....|....*.
gi 1920237968 1547 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLE 1582
Cdd:PRK05035 653 ARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAK 688
|
|
| EmrA |
COG1566 |
Multidrug resistance efflux pump EmrA [Defense mechanisms]; |
1432-1552 |
2.75e-07 |
|
Multidrug resistance efflux pump EmrA [Defense mechanisms];
Pssm-ID: 441174 [Multi-domain] Cd Length: 331 Bit Score: 55.82 E-value: 2.75e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1432 AERSRLRIEEEIRVVRLQLEATERQRGgAEGELQALRARAEEAEAQKRQAQEEAERLR---------RQVQDETQRKR-Q 1501
Cdd:COG1566 81 LQAALAQAEAQLAAAEAQLARLEAELG-AEAEIAAAEAQLAAAQAQLDLAQRELERYQalykkgavsQQELDEARAALdA 159
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1502 AEAELAlRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV 1552
Cdd:COG1566 160 AQAQLE-AAQAQLAQAQAGLREEEELAAAQAQVAQAEAALAQAELNLARTT 209
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1686-1886 |
2.84e-07 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 56.42 E-value: 2.84e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1686 QRQLAEGTAQQRLAAEQelIRLRAETEQGEQQRqllEEELARLQREAAAATQKRRELEAELAKVraemevllaskaraEE 1765
Cdd:COG2268 191 RRKIAEIIRDARIAEAE--AERETEIAIAQANR---EAEEAELEQEREIETARIAEAEAELAKK--------------KA 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1766 ESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEA-----ERVLAEKLAAISEAtrlKTE 1840
Cdd:COG2268 252 EERREAETARAEAEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAEREEAELeadvrKPAEAEKQAAEAEA---EAE 328
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1841 AEIALKEKEAENERLRRLAE-DEAFQRRLLEEQAAQHKADIEARLAQ 1886
Cdd:COG2268 329 AEAIRAKGLAEAEGKRALAEaWNKLGDAAILLMLIEKLPEIAEAAAK 375
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1715-1899 |
4.06e-07 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 55.97 E-value: 4.06e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1715 EQQRQLLEEELAR-LQREAAAATQKRRELEAELAKVRAEmevllasKARAEEESRSTSEKSKQRLEAEAGrfrelAEEAA 1793
Cdd:PRK09510 78 EEQRKKKEQQQAEeLQQKQAAEQERLKQLEKERLAAQEQ-------KKQAEEAAKQAALKQKQAEEAAAK-----AAAAA 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1794 RLRALAEeakrqrQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRrlAEDEAFQRrllEEQA 1873
Cdd:PRK09510 146 KAKAEAE------AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK--AEAEAKKK---AAAE 214
|
170 180
....*....|....*....|....*.
gi 1920237968 1874 AQHKADIEARLAQLRKASESELERQK 1899
Cdd:PRK09510 215 AKKKAAAEAKAAAAKAAAEAKAAAEK 240
|
|
| COG4995 |
COG4995 |
Uncharacterized conserved protein, contains CHAT domain [Function unknown]; |
2264-2717 |
4.17e-07 |
|
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
Pssm-ID: 444019 [Multi-domain] Cd Length: 711 Bit Score: 56.52 E-value: 4.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2264 QVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMA 2343
Cdd:COG4995 9 LLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLALALAALALA 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2344 QQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTL 2423
Cdd:COG4995 89 LLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAAAAAALLAL 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2424 ETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKA 2503
Cdd:COG4995 169 ALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLALLALAAAAA 248
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2504 KLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRL 2583
Cdd:COG4995 249 ALAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLLLAALLLLLA 328
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2584 RERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFDGLRRKVPAQRLQEVGVLSAEELQQLAQ 2663
Cdd:COG4995 329 ALALLALLLLLAAAALLAAALAAALALAAALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAALLALAAAQL 408
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2664 GRTTVAELAQREDVRHYLQGRSSIAGLL-LKPADEKLTIYAALRRQLLSPGTALI 2717
Cdd:COG4995 409 LRLLLAALALLLALAAYAAARLALLALIeYIILPDRLYAFVQLYQLLIAPIEAEL 463
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1667-1884 |
4.28e-07 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 55.62 E-value: 4.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRreleAEL 1746
Cdd:TIGR02794 72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAERKA----KEE 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1747 AKVRAEMEvllaSKARAEEESRSTSEKSKQRLEAEAgrfreLAEEAARLRALAEEAKRQRQLAEEDA---VRQRAEAERV 1823
Cdd:TIGR02794 148 AAKQAEEE----AKAKAAAEAKKKAEEAKKKAEAEA-----KAKAEAEAKAKAEEAKAKAEAAKAKAaaeAAAKAEAEAA 218
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1824 LAEKLAAISEATRLKTEAEIAL-KEKEAENERLRRLAEDEAFQRRLleeqAAQHKADIEARL 1884
Cdd:TIGR02794 219 AAAAAEAERKADEAELGDIFGLaSGSNAEKQGGARGAAAGSEVDKY----AAIIQQAIQQNL 276
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1739-2445 |
4.46e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 56.46 E-value: 4.46e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1739 RRELEAELAKVRAEMEVLLASKARAEEESRStsEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRA 1818
Cdd:COG4913 220 EPDTFEAADALVEHFDDLERAHEALEDAREQ--IELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRRLELLEAEL 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1819 EAERvlaeklaaiSEATRLKTEAEIALKEKEAENERLRRLaedeafqrrllEEQAAQHKADIEARLAQLRKASESELERQ 1898
Cdd:COG4913 298 EELR---------AELARLEAELERLEARLDALREELDEL-----------EAQIRGNGGDRLEQLEREIERLERELEER 357
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1899 KGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaaeeerrrreaeer 1978
Cdd:COG4913 358 ERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELR--------------- 422
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1979 vqkSLAAEEEAARQRKAAL-EEVERLKAKVEEARRLRE------------RAEQES-------------------ARQLQ 2026
Cdd:COG4913 423 ---ELEAEIASLERRKSNIpARLLALRDALAEALGLDEaelpfvgelievRPEEERwrgaiervlggfaltllvpPEHYA 499
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2027 LAQEAAQkRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAAREraereaaqSRRQVEEAERL 2106
Cdd:COG4913 500 AALRWVN-RLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWLEAELGRRF--------DYVCVDSPEEL 570
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2107 KQSAEEQAQAQaqaqaaaekLRKeaeqeaarraqaEQAALRQKQAADAEMEKHkQFAEQALRQKAQVEQELTALRLQLEE 2186
Cdd:COG4913 571 RRHPRAITRAG---------QVK------------GNGTRHEKDDRRRIRSRY-VLGFDNRAKLAALEAELAELEEELAE 628
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2187 TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQmEELGKLKARIEAenralvLRDKDSAQRLLQEEAEKMKQVA 2266
Cdd:COG4913 629 AEERLEALEAELDALQERREALQRLAEYSWDEIDVASAE-REIAELEAELER------LDASSDDLAALEEQLEELEAEL 701
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2267 EEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQ-QKELAQEQARRLQEDKEQMAQQ 2345
Cdd:COG4913 702 EELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAAlGDAVERELRENLEERIDALRAR 781
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2346 LAQETQGFQKTleterqrqleMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERL------YRTELATQEKVML 2419
Cdd:COG4913 782 LNRAEEELERA----------MRAFNREWPAETADLDADLESLPEYLALLDRLEEDGLPEYeerfkeLLNENSIEFVADL 851
|
730 740
....*....|....*....|....*.
gi 1920237968 2420 VQTLETQRQQSDRDAERLREAIAELE 2445
Cdd:COG4913 852 LSKLRRAIREIKERIDPLNDSLKRIP 877
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2152-2352 |
4.84e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 55.61 E-value: 4.84e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2152 ADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEE--- 2228
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGErar 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2229 --------LGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM 2300
Cdd:COG3883 94 alyrsggsVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2301 LKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQG 2352
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAA 225
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1091-1753 |
5.25e-07 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 56.34 E-value: 5.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1091 EEVLRAHEEQLKE-----AQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVER 1165
Cdd:pfam01576 337 EEETRSHEAQLQEmrqkhTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQE 416
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1166 WRERV-------TLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAK-QRQEQIQAvPLANSQAVReQL 1237
Cdd:pfam01576 417 LQARLseserqrAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQeLLQEETRQ-KLNLSTRLR-QL 494
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1238 RQEKALLEdiERHGEKVEECQRFAKQyinaIKDYELQLVTYKAQLEPVASPAK-----KPKVQSGSESIIQEYVDLRTRY 1312
Cdd:pfam01576 495 EDERNSLQ--EQLEEEEEAKRNVERQ----LSTLQAQLSDMKKKLEEDAGTLEaleegKKRLQRELEALTQQLEEKAAAY 568
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1313 SELS-------------TLTSQYIRFISETLRR-------MEEEERLAEQQRAEERERlAEVEAALEKQRQLAEAHA--- 1369
Cdd:pfam01576 569 DKLEktknrlqqelddlLVDLDHQRQLVSNLEKkqkkfdqMLAEEKAISARYAEERDR-AEAEAREKETRALSLARAlee 647
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1370 --QAKAQAEREAQGLQRRMQEEVARREEV------------AVEAQEQKRSIQEE-------------------LQHLRQ 1416
Cdd:pfam01576 648 alEAKEELERTNKQLRAEMEDLVSSKDDVgknvhelerskrALEQQVEEMKTQLEeledelqatedaklrlevnMQALKA 727
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1417 SSEAEIQAKArqvEAAERSRLRIEEEIRVVRLQLEATERQRGGA-------EGELQALRARAEEAEAQKRQAQEEAERLR 1489
Cdd:pfam01576 728 QFERDLQARD---EQGEEKRRQLVKQVRELEAELEDERKQRAQAvaakkklELDLKELEAQIDAANKGREEAVKQLKKLQ 804
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1490 RQVQDetqrkRQAEAELALRVQAEAEA-AREKQRALQALEELRLQAEE----AERRLRQAEAERAR-QVQVALETAQRSA 1563
Cdd:pfam01576 805 AQMKD-----LQRELEEARASRDEILAqSKESEKKLKNLEAELLQLQEdlaaSERARRQAQQERDElADEIASGASGKSA 879
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1564 eaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAE-----AERARAEAERELERWQL-KANEALRLRLQA 1637
Cdd:pfam01576 880 ---LQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQlttelAAERSTSQKSESARQQLeRQNKELKAKLQE 956
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1638 EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQA-----VRQRELAEQEL----EKQRQLAEGTAQQRLAAEQELIRLR 1708
Cdd:pfam01576 957 MEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQaanklVRRTEKKLKEVllqvEDERRHADQYKDQAEKGNSRMKQLK 1036
|
730 740 750 760
....*....|....*....|....*....|....*....|....*
gi 1920237968 1709 AETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEM 1753
Cdd:pfam01576 1037 RQLEEAEEEASRANAARRKLQRELDDATESNESMNREVSTLKSKL 1081
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
2159-2462 |
5.31e-07 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 56.50 E-value: 5.31e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2159 HKQFAEQALRQKAQVEQ---ELTALRLQLEEtdhQKSILDEelqrLKAEVTEAARQRGQVEEELFSLRVQME-------- 2227
Cdd:PRK04863 336 HLNLVQTALRQQEKIERyqaDLEELEERLEE---QNEVVEE----ADEQQEENEARAEAAEEEVDELKSQLAdyqqaldv 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2228 ---------------------------ELGKLKARIEAenraLVLRDKDSAQRLLQEE-----AEKMKQVAEEAARL--- 2272
Cdd:PRK04863 409 qqtraiqyqqavqalerakqlcglpdlTADNAEDWLEE----FQAKEQEATEELLSLEqklsvAQAAHSQFEQAYQLvrk 484
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2273 ---SVAAQEAARLRQLAEEDLAQQRALAEKM---------LKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKE 2340
Cdd:PRK04863 485 iagEVSRSEAWDVARELLRRLREQRHLAEQLqqlrmrlseLEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELE 564
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2341 QMAQQLAQEtqgfqktLETERQRQLEMSAEAERLRLRVAE-MSRAQA--RAEEDARRFRKQAEDigerlyrtELATQEKV 2417
Cdd:PRK04863 565 ARLESLSES-------VSEARERRMALRQQLEQLQARIQRlAARAPAwlAAQDALARLREQSGE--------EFEDSQDV 629
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2418 M--LVQTLETQRQQSdRDAERLREAIAELEHEKDKLKQ-----EAQLLQLKS 2462
Cdd:PRK04863 630 TeyMQQLLERERELT-VERDELAARKQALDEEIERLSQpggseDPRLNALAE 680
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1471-1588 |
6.07e-07 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 55.26 E-value: 6.07e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1471 AEEAEAQKRQAQEEAErLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRAlQALEELRLQAEEAERRLRQA--EAER 1548
Cdd:COG2268 212 TEIAIAQANREAEEAE-LEQEREIETARIAEAEAELA-KKKAEERREAETARA-EAEAAYEIAEANAEREVQRQleIAER 288
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1920237968 1549 ARQVQVALETAQRsAEAELQSEHASFAEktAQLERTLKEE 1588
Cdd:COG2268 289 EREIELQEKEAER-EEAELEADVRKPAE--AEKQAAEAEA 325
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1466-1864 |
6.09e-07 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 56.11 E-value: 6.09e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1466 ALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAeaelalrvqaeAEAAREKQRALQAL---------EELRLQAEE 1536
Cdd:COG3096 829 AFAPDPEAELAALRQRRSELERELAQHRAQEQQLRQQ-----------LDQLKEQLQLLNKLlpqanlladETLADRLEE 897
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1537 AERRLRQAEAERA--RQVQVALETAQRSAEAeLQSEHASFAEktaqlertLKEEHVAVVQLREEATRRAQQQAEAERARA 1614
Cdd:COG3096 898 LREELDAAQEAQAfiQQHGKALAQLEPLVAV-LQSDPEQFEQ--------LQADYLQAKEQQRRLKQQIFALSEVVQRRP 968
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1615 EAERELERWQLKANEALRLRLQAeevaqqksltqaeaekqkeeaerearrrgkaeeqavrQRELAEQELEKQRQLAEGTA 1694
Cdd:COG3096 969 HFSYEDAVGLLGENSDLNEKLRA-------------------------------------RLEQAEEARREAREQLRQAQ 1011
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1695 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREA-----AAATQKRRELEAELAKVRAEmevllaskaraeeesRS 1769
Cdd:COG3096 1012 AQYSQYNQVLASLKSSRDAKQQTLQELEQELEELGVQAdaeaeERARIRRDELHEELSQNRSR---------------RS 1076
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1770 TSEKSKQRLEAE----AGRFRELAEEAARLRALAEEAK----RQRQLAEEDAVRQRAEAERVL---AEKLAAISEatrlk 1838
Cdd:COG3096 1077 QLEKQLTRCEAEmdslQKRLRKAERDYKQEREQVVQAKagwcAVLRLARDNDVERRLHRRELAylsADELRSMSD----- 1151
|
410 420
....*....|....*....|....*....
gi 1920237968 1839 tEAEIALKEKEAENERLR---RLAEDEAF 1864
Cdd:COG3096 1152 -KALGALRLAVADNEHLRdalRLSEDPRR 1179
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
2147-2588 |
6.27e-07 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 55.95 E-value: 6.27e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2147 RQKQAADAEMEKHKQFAEQALRqkaqvEQELTALRLQLEE--TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRV 2224
Cdd:pfam01576 92 QQLQNEKKKMQQHIQDLEEQLD-----EEEAARQKLQLEKvtTEAKIKKLEEDILLLEDQNSKLSKERKLLEERISEFTS 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2225 QMEE-------LGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMK----------QVAEEAARLS-VAAQEAARLRQLA 2286
Cdd:pfam01576 167 NLAEeeekaksLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKlegestdlqeQIAELQAQIAeLRAQLAKKEEELQ 246
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2287 E-----EDLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQKELAQEQARRLQEDKEQMAQQL--AQETQGFQKTLET 2359
Cdd:pfam01576 247 AalarlEEETAQKNNALKKIRELEAQISE---LQEDLESERAARNKAEKQRRDLGEELEALKTELedTLDTTAAQQELRS 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2360 ERQRQLEMSAEAERLRLRVAEMSRAQARaeedaRRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLRE 2439
Cdd:pfam01576 324 KREQEVTELKKALEEETRSHEAQLQEMR-----QKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQ 398
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2440 AIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQL-------------------------------------LQETQALQQ 2482
Cdd:pfam01576 399 AKQDSEHKRKKLEGQLQELQARLSESERQRAELAeklsklqselesvssllneaegkniklskdvsslesqLQDTQELLQ 478
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2483 SFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQekqqLAASMEEARRRQHEAEEGVRRQQEEL 2562
Cdd:pfam01576 479 EETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSD----MKKKLEEDAGTLEALEEGKKRLQREL 554
|
490 500 510
....*....|....*....|....*....|
gi 1920237968 2563 ----QRLAQQQQQQEKLlaeenQRLRERLQ 2588
Cdd:pfam01576 555 ealtQQLEEKAAAYDKL-----EKTKNRLQ 579
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
2245-2682 |
7.61e-07 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 55.75 E-value: 7.61e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2245 LRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQ 2324
Cdd:TIGR00618 158 LKAKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQ 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2325 KELAQEQARRLQEDKEQMA--QQLAQETQGFQKTLETERQRqLEMSAEAERLRLRVAEMSRAQARAEEdarrFRKQAEDI 2402
Cdd:TIGR00618 238 TQQSHAYLTQKREAQEEQLkkQQLLKQLRARIEELRAQEAV-LEETQERINRARKAAPLAAHIKAVTQ----IEQQAQRI 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2403 GERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKS---EEMQTVRQEQLLQETQA 2479
Cdd:TIGR00618 313 HTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCqqhTLTQHIHTLQQQKTTLT 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2480 LQQSFLSEKDSLLQRErciEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQ 2559
Cdd:TIGR00618 393 QKLQSLCKELDILQRE---QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK 469
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2560 EELQRLAQQQQQQEKlLAEENQRLRERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFDGLR 2639
Cdd:TIGR00618 470 EREQQLQTKEQIHLQ-ETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVY 548
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 2640 RKVPA-----QRLQEVGVLSAEELQQLAQGRTTVAELAQR-----EDVRHYLQ 2682
Cdd:TIGR00618 549 HQLTSerkqrASLKEQMQEIQQSFSILTQCDNRSKEDIPNlqnitVRLQDLTE 601
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2792-2828 |
8.07e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 48.25 E-value: 8.07e-07
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 2792 IRLLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEM 2828
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2250-2466 |
9.16e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 54.77 E-value: 9.16e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2250 SAQRLLQEEAEKMKQVAEEAARLSVAAQEAARlrqlAEEDLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQKELAQ 2329
Cdd:COG4942 17 AQADAAAEAEAELEQLQQEIAELEKELAALKK----EEKALLKQLAALERRIAALARRIRA---LEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2330 EQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRT 2409
Cdd:COG4942 90 KEIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAEL 169
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 2410 ELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQ 2466
Cdd:COG4942 170 EAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELE 226
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1863-2352 |
9.17e-07 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 55.16 E-value: 9.17e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1863 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQkglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRg 1942
Cdd:COG4717 41 AFIRAMLLERLEKEADELFKPQGRKPELNLKELKEL----EEELKEAEEKEEEYAELQEELEELEEELEELEAELEELR- 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1943 tAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2022
Cdd:COG4717 116 -EELEKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEEL 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2023 RQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVL-ERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVE 2101
Cdd:COG4717 195 QDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEaAALEERLKEARLLLLIAAALLALLGLGGSLLSLILT 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2102 EAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQA------LRQKAQVEQ 2175
Cdd:COG4717 275 IAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELlelldrIEELQELLR 354
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2176 ELTALRLQLEETDHQKSIlDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL-GKLKARIEAENRALVLRDKDSAQRL 2254
Cdd:COG4717 355 EAEELEEELQLEELEQEI-AALLAEAGVEDEEELRAALEQAEEYQELKEELEELeEQLEELLGELEELLEALDEEELEEE 433
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2255 LQEEAEKMKQVAEEAARLSVAAQEA-ARLRQLAEEDLAQQRALAEKMLKEKMQ-AVQEATRLKAEAELLQQQKELAQEqa 2332
Cdd:COG4717 434 LEELEEELEELEEELEELREELAELeAELEQLEEDGELAELLQELEELKAELReLAEEWAALKLALELLEEAREEYRE-- 511
|
490 500
....*....|....*....|
gi 1920237968 2333 RRLQEDKEQMAQQLAQETQG 2352
Cdd:COG4717 512 ERLPPVLERASEYFSRLTDG 531
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
1468-1598 |
9.62e-07 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 54.96 E-value: 9.62e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1468 RARAEEAE------AQKRQAQEEAerlRRQVQDETQRKRQAEAELALRVQAEAEAAREKQralQALEELRLQAEEAER-- 1539
Cdd:pfam15709 337 RLRAERAEmrrlevERKRREQEEQ---RRLQQEQLERAEKMREELELEQQRRFEEIRLRK---QRLEEERQRQEEEERkq 410
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1540 -RLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ----LERTLKEEHVAVVQLREE 1598
Cdd:pfam15709 411 rLQLQAAQERARQQQEEFRRKLQELQRKKQQEEAERAEAEKQrqkeLEMQLAEEQKRLMEMAEE 474
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1331-1599 |
1.01e-06 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 54.15 E-value: 1.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1331 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVeaQEQKRSIQEE 1410
Cdd:pfam13868 36 AEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQM--DEIVERIQEE 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1411 LQHLRQSSEAEIQAKARQVEAAERSRLRI---------EEEIRVVRLQLEATERQRggAEGELQALRARAEEAEAQKRQA 1481
Cdd:pfam13868 114 DQAEAEEKLEKQRQLREEIDEFNEEQAEWkelekeeerEEDERILEYLKEKAEREE--EREAEREEIEEEKEREIARLRA 191
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1482 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQR 1561
Cdd:pfam13868 192 QQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERMLRKQA 271
|
250 260 270
....*....|....*....|....*....|....*...
gi 1920237968 1562 SAEAELQSEhasfAEKTAQLERTLKEEHVAVVQLREEA 1599
Cdd:pfam13868 272 EDEEIEQEE----AEKRRMKRLEHRRELEKQIEEREEQ 305
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4132-4160 |
1.12e-06 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 47.71 E-value: 1.12e-06
10 20
....*....|....*....|....*....
gi 1920237968 4132 IVDPETGKEMSVYEAYRKGLIDHQTYLEL 4160
Cdd:pfam00681 11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1389-2016 |
1.22e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 55.07 E-value: 1.22e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1389 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ----------LEATERQRG 1458
Cdd:PRK03918 169 EVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEvkeleelkeeIEELEKELE 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1459 GAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEaEAAREKQRALQALEELRLQAEEAE 1538
Cdd:PRK03918 249 SLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE-EYLDELREIEKRLSRLEEEINGIE 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1539 RRLRQAEAERARQVQValetaqRSAEAELQSEHASFaEKTAQLERTLKEEHVAVVQLREEatrraqqqaeaeraraeaer 1618
Cdd:PRK03918 328 ERIKELEEKEERLEEL------KKKLKELEKRLEEL-EERHELYEEAKAKKEELERLKKR-------------------- 380
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1619 elerwqLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREarrrgkaeEQAVRQRELAEQELEKqrqlAEGTAQ--Q 1696
Cdd:PRK03918 381 ------LTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL--------KKEIKELKKAIEELKK----AKGKCPvcG 442
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1697 RLAAEQELIRLRAEteqgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVraEMEVLLASKARAE----EESRSTSE 1772
Cdd:PRK03918 443 RELTEEHRKELLEE----------YTAELKRIEKELKEIEEKERKLRKELREL--EKVLKKESELIKLkelaEQLKELEE 510
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1773 KSK----QRLEAEAGRFRELAEEAARLRalaeeaKRQRQLAEEdaVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEK 1848
Cdd:PRK03918 511 KLKkynlEELEKKAEEYEKLKEKLIKLK------GEIKSLKKE--LEKLEELKKKLAELEKKLDELEEELAELLKELEEL 582
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1849 --EAENERLRRLAEDEAFQRRLLEEQAAQHkaDIEARLAQLRKAsESELERQKGLVEDTLRQRRQVEEEILALKGSF--- 1923
Cdd:PRK03918 583 gfESVEELEERLKELEPFYNEYLELKDAEK--ELEREEKELKKL-EEELDKAFEELAETEKRLEELRKELEELEKKYsee 659
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1924 --EKAAAGKAELELELGRIRGTAEDTLRSKEQAEqeaarqrqlaaeeerrrreaeervqKSLAAEEEAARQRKAALEEVE 2001
Cdd:PRK03918 660 eyEELREEYLELSRELAGLRAELEELEKRREEIK-------------------------KTLEKLKEELEEREKAKKELE 714
|
650
....*....|....*
gi 1920237968 2002 RLKAKVEEARRLRER 2016
Cdd:PRK03918 715 KLEKALERVEELREK 729
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1667-2040 |
1.30e-06 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 54.49 E-value: 1.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQRElaEQELEKQRQLAEGTAQQRLAAEQelIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1746
Cdd:pfam02029 14 RAREERRRQKE--EEEPSGQVTESVEPNEHNSYEED--SELKPSGQGGLDEEEAFLDRTAKREERRQKRLQEALERQKEF 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1747 AKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRElAEEAARLRALAEEAKRQ--RQLAEEDAVRQRAEAERVL 1824
Cdd:pfam02029 90 DPTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYKE-EETEIREKEYQENKWSTevRQAEEEGEEEEDKSEEAEE 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1825 AEKLAAISEatrlKTEAEIALKEKEAENERLRRLAedeafQRRLLEEQAAQhkadiearlaqlrkASESELERQKGLVED 1904
Cdd:pfam02029 169 VPTENFAKE----EVKDEKIKKEKKVKYESKVFLD-----QKRGHPEVKSQ--------------NGEEEVTKLKVTTKR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1905 TLRQRRQVEEEilalkgsfEKAAAGKAELELELGRIRGTAEDtlrsKEQAEQEAARQRQLAAEEERRRREAEERVQKSLA 1984
Cdd:pfam02029 226 RQGGLSQSQER--------EEEAEVFLEAEQKLEELRRRRQE----KESEEFEKLRQKQQEAELELEELKKKREERRKLL 293
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1985 AEEEaaRQRKAalEEVERLKAKVEEARRLRERAEQESArqlqlaqEAAQKRLQAEE 2040
Cdd:pfam02029 294 EEEE--QRRKQ--EEAERKLREEEEKRRMKEEIERRRA-------EAAEKRQKLPE 338
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2317-2603 |
1.55e-06 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 54.74 E-value: 1.55e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2317 EAELLQQQKELAQEQA----RRLQEDKEQMAQQ-LAQETQgfQKTLETERQRQLEMS-----AEAERLRLRVAEMSRAQA 2386
Cdd:pfam17380 267 ENEFLNQLLHIVQHQKavseRQQQEKFEKMEQErLRQEKE--EKAREVERRRKLEEAekarqAEMDRQAAIYAEQERMAM 344
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2387 RAEEDARRFRKQAEDI-GERLYRTELATQ-EKVMLVQTLETQRQQSDrdaERLREAIAELEHEKDKLKQEAQLLQLKSEE 2464
Cdd:pfam17380 345 ERERELERIRQEERKReLERIRQEEIAMEiSRMRELERLQMERQQKN---ERVRQELEAARKVKILEEERQRKIQQQKVE 421
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2465 MQTVRQEQLLQETQALQQsflsekdslLQRERCIEQEKAKLEQLfqdevakaqalreeqqRqqqqMQQEKQQLAASMEEA 2544
Cdd:pfam17380 422 MEQIRAEQEEARQREVRR---------LEEERAREMERVRLEEQ----------------E----RQQQVERLRQQEEER 472
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2545 RRRQHEAEEGVRRQQ--EELQRLAQQQQQQEKLLAE-ENQRLRERLQHLEEERRAALARSEE 2603
Cdd:pfam17380 473 KRKKLELEKEKRDRKraEEQRRKILEKELEERKQAMiEEERKRKLLEKEMEERQKAIYEEER 534
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1335-1519 |
1.58e-06 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 54.04 E-value: 1.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1335 EEERLAEQQRAEERERLAEveAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEvaveaqEQKRSIQEELQhl 1414
Cdd:PRK09510 107 EKERLAAQEQKKQAEEAAK--QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA------EAKKKAEAEAA-- 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1415 rQSSEAEIQAKARQVEAAersrlrieeeirvvrlQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAerlrrqvqd 1494
Cdd:PRK09510 177 -KKAAAEAKKKAEAEAAA----------------KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKA--------- 230
|
170 180
....*....|....*....|....*
gi 1920237968 1495 ETQRKRQAEAELALRVQAEAEAARE 1519
Cdd:PRK09510 231 AAEAKAAAEKAAAAKAAEKAAAAKA 255
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
2156-2380 |
1.70e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 54.25 E-value: 1.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2156 MEKHKQFAEQALR----QKAQVEQELTALRLQLEE--TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL 2229
Cdd:COG3206 166 LELRREEARKALEfleeQLPELRKELEEAEAALEEfrQKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAAL 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2230 GKLKARIEAENRALVlrDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEkmqAVQ 2309
Cdd:COG3206 246 RAQLGSGPDALPELL--QSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILAS---LEA 320
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 2310 EATRLKAEAELLQQQKELAQEQARRLQEdKEQMAQQLAQETQGFQKTLET--ERQRQLEMSAEAERLRLRVAE 2380
Cdd:COG3206 321 ELEALQAREASLQAQLAQLEARLAELPE-LEAELRRLEREVEVARELYESllQRLEEARLAEALTVGNVRVID 392
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3451-3487 |
1.93e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 47.09 E-value: 1.93e-06
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 3451 IRLLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEM 3487
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
2244-2597 |
1.95e-06 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 54.28 E-value: 1.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2244 VLRDKDSAQRLLQEEAEKmKQVAEEAARLSVAAQEAARLRQLAE------EDLAQQRALAEKMLKEKMQAVQEATRLKAE 2317
Cdd:PRK02224 181 VLSDQRGSLDQLKAQIEE-KEEKDLHERLNGLESELAELDEEIEryeeqrEQARETRDEADEVLEEHEERREELETLEAE 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2318 AELLQQQKELAQ---EQARRLQEDKEQMAQQLAQETQGFQKTLETER-------QRQLEMSAEAERLRLRVAEMSRAQAR 2387
Cdd:PRK02224 260 IEDLRETIAETErerEELAEEVRDLRERLEELEEERDDLLAEAGLDDadaeaveARREELEDRDEELRDRLEECRVAAQA 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2388 AEEDARRFRKQAEDIGERlyrtelaTQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQT 2467
Cdd:PRK02224 340 HNEEAESLREDADDLEER-------AEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAED 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2468 VRQEqllqetqalqqsFLSEKDSLLQRERCIEqekAKLEQLfQDEVAKAQALREE--------------QQRQQQQMQQE 2533
Cdd:PRK02224 413 FLEE------------LREERDELREREAELE---ATLRTA-RERVEEAEALLEAgkcpecgqpvegspHVETIEEDRER 476
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2534 KQQLAASMEEARRRQHEAEEGVRRQQE------ELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAA 2597
Cdd:PRK02224 477 VEELEAELEDLEEEVEEVEERLERAEDlveaedRIERLEERREDLEELIAERRETIEEKRERAEELRERA 546
|
|
| CH_PLS3_rpt3 |
cd21331 |
third calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is ... |
39-151 |
2.04e-06 |
|
third calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Plastin-3 contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409180 Cd Length: 134 Bit Score: 50.00 E-value: 2.04e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 39 ERDRVQKKTFTKWVNKHLIkhwraeaQRHISDLYEDLRDGHNLISLLEVL-------SGDSLPREKGRMRFHKLQNVQIA 111
Cdd:cd21331 18 EGETREERTFRNWMNSLGV-------NPHVNHLYGDLQDALVILQLYEKIkvpvdwnKVNKPPYPKLGANMKKLENCNYA 90
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1920237968 112 LDYLRHR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 151
Cdd:cd21331 91 VELGKHPaKFSLVGIGGQDLNDGNPTLTLALVWQLMRRYTL 131
|
|
| DUF4659 |
pfam15558 |
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ... |
1331-1592 |
2.11e-06 |
|
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.
Pssm-ID: 464768 [Multi-domain] Cd Length: 374 Bit Score: 53.50 E-value: 2.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1331 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVAR--REEVAVEAQEQKRSIQ 1408
Cdd:pfam15558 21 QRMRELQQQAALAWEELRRRDQKRQETLERERRLLLQQSQEQWQAEKEQRKARLGREERRRAdrREKQVIEKESRWREQA 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1409 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrlqleaterqRGGAEGELQALRARAEEAEaQKRQAQEEAERL 1488
Cdd:pfam15558 101 EDQENQRQEKLERARQEAEQRKQCQEQRLKEKEEEL------------QALREQNSLQLQERLEEAC-HKRQLKEREEQK 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1489 RRQVQDETQRKRQAEAELALRVQAEAEAAREK----QRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAE 1564
Cdd:pfam15558 168 KVQENNLSELLNHQARKVLVDCQAKAEELLRRlsleQSLQRSQENYEQLVEERHRELREKAQKEEEQFQRAKWRAEEKEE 247
|
250 260
....*....|....*....|....*...
gi 1920237968 1565 AELQSEHASFAEKTAQLERTLKEEHVAV 1592
Cdd:pfam15558 248 ERQEHKEALAELADRKIQQARQVAHKTV 275
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
2251-2578 |
2.24e-06 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 53.00 E-value: 2.24e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2251 AQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2330
Cdd:pfam13868 33 RIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEIVERIQE 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2331 QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERlrlRVAEMSRAQARAEEDARRFRKQAEDIGERLYrte 2410
Cdd:pfam13868 113 EDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDE---RILEYLKEKAEREEEREAEREEIEEEKEREI--- 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2411 latqeKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEaqllqlksEEMQTVRQEQLLQETQALQQSFLSEKDS 2490
Cdd:pfam13868 187 -----ARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKERE--------EAEKKARQRQELQQAREEQIELKERRLA 253
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2491 LLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQ 2570
Cdd:pfam13868 254 EEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRERIEE 333
|
....*...
gi 1920237968 2571 QQEKLLAE 2578
Cdd:pfam13868 334 ERQKKLKE 341
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
1666-1886 |
2.26e-06 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 53.80 E-value: 2.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1666 GKAEEQAVRQRELAEQELEKQRQlAEGTAQQRLAAEQ-ELIRLRAETEQGEQ--QRQLLEEELARlqreaaaATQKRREL 1742
Cdd:pfam15709 307 GNMESEEERSEEDPSKALLEKRE-QEKASRDRLRAERaEMRRLEVERKRREQeeQRRLQQEQLER-------AEKMREEL 378
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1743 EAELAKVRAEMEvlLASKARAEEESRSTSEKSKQRLEaeagrfrelaEEAARLRALAEEAKRQRQLAEEDAVRQRAEAER 1822
Cdd:pfam15709 379 ELEQQRRFEEIR--LRKQRLEEERQRQEEEERKQRLQ----------LQAAQERARQQQEEFRRKLQELQRKKQQEEAER 446
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1823 VLAEKLAAISEATRLKTEAEIALkeKEAENERLRRLAE-DEAFQRRLLEEQAAQHKADIEARLAQ 1886
Cdd:pfam15709 447 AEAEKQRQKELEMQLAEEQKRLM--EMAEEERLEYQRQkQEAEEKARLEAEERRQKEEEAARLAL 509
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1464-1738 |
2.35e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 53.23 E-value: 2.35e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1464 LQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQA--------LEELRLQAE 1535
Cdd:COG4942 15 AAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLA-ALERRIAALARRIRALEQelaaleaeLAELEKEIA 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1536 EAERRLRQAEAERARQVQVALETAQRSAEAELQSehasfAEKTAQLERTLKEEHVAVVQLREEATRRAQqqaeaerarae 1615
Cdd:COG4942 94 ELRAELEAQKEELAELLRALYRLGRQPPLALLLS-----PEDFLDAVRRLQYLKYLAPARREQAEELRA----------- 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1616 aerelerwQLKANEALRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrgkaEEQAVRQRELAEQELEKQRQLAEgtaq 1695
Cdd:COG4942 158 --------DLAELAALRAELEAERAELEALL----------------------AELEEERAALEALKAERQKLLAR---- 203
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1920237968 1696 qrlaAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQK 1738
Cdd:COG4942 204 ----LEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
2156-2459 |
2.50e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 53.87 E-value: 2.50e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2156 MEKHKQFAEQALRQKAQVEQELTAlrlQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKAR 2235
Cdd:TIGR04523 389 LESQINDLESKIQNQEKLNQQKDE---QIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRES 465
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2236 IEAENRALvlrdKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAarlRQLAEE--DLAQQRALaekmLKEKMQavqeatr 2313
Cdd:TIGR04523 466 LETQLKVL----SRSINKIKQNLEQKQKELKSKEKELKKLNEEK---KELEEKvkDLTKKISS----LKEKIE------- 527
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2314 lKAEAELLQQQKELAQeqarrLQEDKEQMAQQLAQETqgfqktLETERQRQLEmsaEAERLRLRVAEMSRAQARAEEDAR 2393
Cdd:TIGR04523 528 -KLESEKKEKESKISD-----LEDELNKDDFELKKEN------LEKEIDEKNK---EIEELKQTQKSLKKKQEEKQELID 592
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 2394 RFRKQAEDIGERLyrtelatQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQ 2459
Cdd:TIGR04523 593 QKEKEKKDLIKEI-------EEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIK 651
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
2168-2599 |
2.59e-06 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 53.99 E-value: 2.59e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2168 RQKAQVEQELTALRLQLEETDHQ---KSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALV 2244
Cdd:pfam07111 190 KQLAEAQKEAELLRKQLSKTQEEleaQVTLVESLRKYVGEQVPPEVHSQTWELERQELLDTMQHLQEDRADLQATVELLQ 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2245 LRDKDSAQRLLQEEAEKMKQVAEEAarlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAE-AELLQQ 2323
Cdd:pfam07111 270 VRVQSLTHMLALQEEELTRKIQPSD---SLEPEFPKKCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQvAELQEQ 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2324 QKELAQEQA--RRLQEDKEQMAQQLAQETQGFQKTL----ETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRK 2397
Cdd:pfam07111 347 VTSQSQEQAilQRALQDKAAEVEVERMSAKGLQMELsraqEARRRQQQQTASAEEQLKFVVNAMSSTQIWLETTMTRVEQ 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2398 QAEDIGERLYRTELATQE----------KVMLVQTLETQRQQSDRDAERLREAIAELEH---EKDKLKQEAQL-LQLKSE 2463
Cdd:pfam07111 427 AVARIPSLSNRLSYAVRKvhtikglmarKVALAQLRQESCPPPPPAPPVDADLSLELEQlreERNRLDAELQLsAHLIQQ 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2464 EMQTVRQE-------------QLLQETQALQQSFLS---EKDSLLQRERCIEQEKAKLEQ-LFQDEVAKAQALREEQQRQ 2526
Cdd:pfam07111 507 EVGRAREQgeaerqqlsevaqQLEQELQRAQESLASvgqQLEVARQGQQESTEEAASLRQeLTQQQEIYGQALQEKVAEV 586
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 2527 QQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQ----EELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALA 2599
Cdd:pfam07111 587 ETRLREQLSDTKRRLNEARREQAKAVVSLRQIQhratQEKERNQELRRLQDEARKEEGQRLARRVQELERDKNLMLA 663
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2195-2400 |
2.94e-06 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 52.91 E-value: 2.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2195 DEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVlRDKDSAQRLLQEEAEKMKQVAEEAARLSV 2274
Cdd:COG3883 15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQ-AEIDKLQAEIAEAEAEIEERREELGERAR 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2275 AAQEAARLRQLAE--------EDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQL 2346
Cdd:COG3883 94 ALYRSGGSVSYLDvllgsesfSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 2347 AQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAE 2400
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAA 227
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1398-2233 |
3.00e-06 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 53.90 E-value: 3.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1398 VEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAersrLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQ 1477
Cdd:TIGR00606 185 IKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKA----CEIRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEHN 260
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1478 KRQAQEEAERLRRQVQDETQRKRQaEAELALRVQAEAEAAREKqraLQALEELR-LQAEEAERRLRQAEAERARQVQVAL 1556
Cdd:TIGR00606 261 LSKIMKLDNEIKALKSRKKQMEKD-NSELELKMEKVFQGTDEQ---LNDLYHNHqRTVREKERELVDCQRELEKLNKERR 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1557 ETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRL--R 1634
Cdd:TIGR00606 337 LLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQLcaD 416
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1635 LQAEEVAQQKSLTQAEAEKQKEEAEREArrrgKAEEQAVRQRELaeQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1714
Cdd:TIGR00606 417 LQSKERLKQEQADEIRDEKKGLGRTIEL----KKEILEKKQEEL--KFVIKELQQLEGSSDRILELDQELRKAERELSKA 490
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1715 EQQR--QLLEEELARLQREAAAATQKRRELEAELAKV------RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFR 1786
Cdd:TIGR00606 491 EKNSltETLKKEVKSLQNEKADLDRKLRKLDQEMEQLnhhtttRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFP 570
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1787 ELAEEAARLRALAEEAKRQRQ-LAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEI----ALKEKEAENERLRRLAED 1861
Cdd:TIGR00606 571 NKKQLEDWLHSKSKEINQTRDrLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLfdvcGSQDEESDLERLKEEIEK 650
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1862 EAFQRRLLEEQAAQHKADIEAR---------LAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKaaagkaE 1932
Cdd:TIGR00606 651 SSKQRAMLAGATAVYSQFITQLtdenqsccpVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEK------R 724
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1933 LELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRR--EAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEA 2010
Cdd:TIGR00606 725 RDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNdiEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDV 804
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2011 RRLRERAEQESaRQLQLAQEAAQKRLQAEEKAHAF-AVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERA 2089
Cdd:TIGR00606 805 ERKIAQQAAKL-QGSDLDRTVQQVNQEKQEKQHELdTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRR 883
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2090 EREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKL----------RKEAEQEAARRAQAEQAALRQKQAADAEMEKH 2159
Cdd:TIGR00606 884 QQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDqqekeelissKETSNKKAQDKVNDIKEKVKNIHGYMKDIENK 963
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 2160 KQfaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVtEAARQRGQVEEELFSLRVQMEELGKLK 2233
Cdd:TIGR00606 964 IQ--DGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDI-DTQKIQERWLQDNLTLRKRENELKEVE 1034
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1877-2595 |
3.02e-06 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 53.69 E-value: 3.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1877 KADIEARLAQlRKASESELERQKGLVEDTLRQRrQVEEEILALKGSFEKAAAG-----KAELELELGRIRGTAEDTLRSK 1951
Cdd:pfam12128 199 KSMIVAILED-DGVVPPKSRLNRQQVEHWIRDI-QAIAGIMKIRPEFTKLQQEfntleSAELRLSHLHFGYKSDETLIAS 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1952 EQAEQEA--ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQ 2029
Cdd:pfam12128 277 RQEERQEtsAELNQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAAADQEQLPSWQSEL 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2030 EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQS 2109
Cdd:pfam12128 357 ENLEERLKALTGKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEAGKLEFN 436
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2110 AEEQAQaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDh 2189
Cdd:pfam12128 437 EEEYRL----------KSRLGELKLRLNQATATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQAS- 505
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2190 qksildEELQRLKAEVTEAARQRGQVEEELFS--------LRVQM----EELGKLKARieaenrALVLR-------DKDS 2250
Cdd:pfam12128 506 ------EALRQASRRLEERQSALDELELQLFPqagtllhfLRKEApdweQSIGKVISP------ELLHRtdldpevWDGS 573
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2251 AQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2330
Cdd:pfam12128 574 VGGELNLYGVKLDLKRIDVPEWAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARL 653
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2331 QARRLQEDKEQMAQQLAQETQGFQKTLETERQrqlEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQA--EDIGERLYR 2408
Cdd:pfam12128 654 DLRRLFDEKQSEKDKKNKALAERKDSANERLN---SLEAQLKQLDKKHQAWLEEQKEQKREARTEKQAYwqVVEGALDAQ 730
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2409 TELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqLLQETQALQQSFLSEK 2488
Cdd:pfam12128 731 LALLKAAIAARRSGAKAELKALETWYKRDLASLGVDPDVIAKLKREIRTLERKIERIAVRRQE-VLRYFDWYQETWLQRR 809
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2489 DSLLQRERCIEQEKAKLEQlfqdevakaqalreEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQ 2568
Cdd:pfam12128 810 PRLATQLSNIERAISELQQ--------------QLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMSKLATL 875
|
730 740
....*....|....*....|....*..
gi 1920237968 2569 QQQQEKllAEENQRLRERLQHLEEERR 2595
Cdd:pfam12128 876 KEDANS--EQAQGSIGERLAQLEDLKL 900
|
|
| EmrA |
COG1566 |
Multidrug resistance efflux pump EmrA [Defense mechanisms]; |
1361-1489 |
3.08e-06 |
|
Multidrug resistance efflux pump EmrA [Defense mechanisms];
Pssm-ID: 441174 [Multi-domain] Cd Length: 331 Bit Score: 52.74 E-value: 3.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1361 QRQLAEAHAQ-AKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKArQVEAAERSRLRI 1439
Cdd:COG1566 82 QAALAQAEAQlAAAEAQLARLEAELGAEAEIAAAEAQLAAAQAQLDLAQRELERYQALYKKGAVSQQ-ELDEARAALDAA 160
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1440 EEEIRVVRLQLEATERQRGGAEgELQALRARAEEAEAQKRQAQEEAERLR 1489
Cdd:COG1566 161 QAQLEAAQAQLAQAQAGLREEE-ELAAAQAQVAQAEAALAQAELNLARTT 209
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1334-1499 |
3.21e-06 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 52.54 E-value: 3.21e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1334 EEEERLAEQQRAEERERLAEVEAAleKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQH 1413
Cdd:TIGR02794 101 EKAAKQAEQAAKQAEEKQKQAEEA--KAKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAE 178
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1414 LRQSSEAEIQAKARQVEA-AERSRLRIEEEI----RVVRLQLEATERQRGGAEGELQALRARAEEAEAQK------RQAQ 1482
Cdd:TIGR02794 179 AKAKAEAEAKAKAEEAKAkAEAAKAKAAAEAaakaEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKqggargAAAG 258
|
170
....*....|....*..
gi 1920237968 1483 EEAERLRRQVQDETQRK 1499
Cdd:TIGR02794 259 SEVDKYAAIIQQAIQQN 275
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
2253-2501 |
3.26e-06 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 53.90 E-value: 3.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2253 RLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQ-EATRLKA---EAELLQQQKELA 2328
Cdd:PRK10929 123 RQAQQEQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNTPLAQAQLTALQaESAALKAlvdELELAQLSANNR 202
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2329 QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAE-AERLRLRVAEMSRAQARA----EEDARRFRKQAEDIG 2403
Cdd:PRK10929 203 QELARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAERALEsTELLAEQSGDLPKSIVAQfkinRELSQALNQQAQRMD 282
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2404 ERLYRTELATQEKVMLVQTLETQRQQ------SDRDAERLREAIAELEhEKDKLKQ----EAQLL--QLKSEEMQTvRQE 2471
Cdd:PRK10929 283 LIASQQRQAASQTLQVRQALNTLREQsqwlgvSNALGEALRAQVARLP-EMPKPQQldteMAQLRvqRLRYEDLLN-KQP 360
|
250 260 270
....*....|....*....|....*....|
gi 1920237968 2472 QLLQETQALQQSFLSEKDSLLQRERCIEQE 2501
Cdd:PRK10929 361 QLRQIRQADGQPLTAEQNRILDAQLRTQRE 390
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1315-1570 |
3.47e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 53.48 E-value: 3.47e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1315 LSTLTSQYIRFISEtLRRMEEEERLA--EQQRAEERERLAEVEAALEKQRQlaeahAQAKAQAEREAQGLQRRMQEEVAR 1392
Cdd:COG3206 154 ANALAEAYLEQNLE-LRREEARKALEflEEQLPELRKELEEAEAALEEFRQ-----KNGLVDLSEEAKLLLQQLSELESQ 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1393 ReevaVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRieEEIRVVRLQLEATERQRGGAEGELQALRARAE 1472
Cdd:COG3206 228 L----AEARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQLR--AQLAELEAELAELSARYTPNHPDVIALRAQIA 301
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1473 EAEAQKRQaqeEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAeaeRARQV 1552
Cdd:COG3206 302 ALRAQLQQ---EAQRILASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREVEVARELYESL---LQRLE 375
|
250
....*....|....*...
gi 1920237968 1553 QVALETAQRSAEAELQSE 1570
Cdd:COG3206 376 EARLAEALTVGNVRVIDP 393
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1766-2607 |
3.61e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 53.58 E-value: 3.61e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1766 ESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIAL 1845
Cdd:pfam15921 81 EEYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCL 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1846 KEKEAENERlrrlAEDEAFQRRLLEEQAAQHkaDIEARLAQLRKASESELERQKGLVEDTLRQR--------RQVEEEIL 1917
Cdd:pfam15921 161 KEDMLEDSN----TQIEQLRKMMLSHEGVLQ--EIRSILVDFEEASGKKIYEHDSMSTMHFRSLgsaiskilRELDTEIS 234
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1918 ALKGSF----EKAAAGKAELELELGRIRGTAEDTLrskEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQR 1993
Cdd:pfam15921 235 YLKGRIfpveDQLEALKSESQNKIELLLQQHQDRI---EQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQ 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1994 KAA----LEEVE----RLKAKVEEARRLRERAEQESARQLQLAQ----EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQE 2061
Cdd:pfam15921 312 NSMymrqLSDLEstvsQLRSELREAKRMYEDKIEELEKQLVLANseltEARTERDQFSQESGNLDDQLQKLLADLHKREK 391
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2062 QSVLERlrseaeaarraaeeaeaareraereaAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2141
Cdd:pfam15921 392 ELSLEK--------------------------EQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQ 445
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2142 EQAALRQKQAADAEMEKHKQFAEQALRQKA---QVEQELTALRLQLEETDHQKSILDEELQR----LKAEVTEAARQRGQ 2214
Cdd:pfam15921 446 MERQMAAIQGKNESLEKVSSLTAQLESTKEmlrKVVEELTAKKMTLESSERTVSDLTASLQEkeraIEATNAEITKLRSR 525
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2215 VE---EELFSLRVQMEELGKLKAriEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQ-EAArlrQLAEEDL 2290
Cdd:pfam15921 526 VDlklQELQHLKNEGDHLRNVQT--ECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQvEKA---QLEKEIN 600
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2291 AQQRALAE-KMLKEKMQAvqEATRLKAEAELLQQQK-ELAQEQARRLQEDKEqmaqqLAQETQGFQKTLETERQRQLEMS 2368
Cdd:pfam15921 601 DRRLELQEfKILKDKKDA--KIRELEARVSDLELEKvKLVNAGSERLRAVKD-----IKQERDQLLNEVKTSRNELNSLS 673
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2369 AEAERLRlrvaemsraqaraeedaRRFRKQAEDIGERLYRTELATQE-KVMLVQTLETQRQQSDRDAERLREA------I 2441
Cdd:pfam15921 674 EDYEVLK-----------------RNFRNKSEEMETTTNKLKMQLKSaQSELEQTRNTLKSMEGSDGHAMKVAmgmqkqI 736
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2442 AELEHEKDKLKQEAQLLQlksEEMQTVRQEQ-LLQEtqalqqsflsEKDSLLQRERCIEQEKAKLEQlfQDEVAKAQALR 2520
Cdd:pfam15921 737 TAKRGQIDALQSKIQFLE---EAMTNANKEKhFLKE----------EKNKLSQELSTVATEKNKMAG--ELEVLRSQERR 801
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2521 eeqqrqqqqMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA----EENQRLRERLQhleeeRRA 2596
Cdd:pfam15921 802 ---------LKEKVANMEVALDKASLQFAECQDIIQRQEQESVRLKLQHTLDVKELQgpgyTSNSSMKPRLL-----QPA 867
|
890
....*....|.
gi 1920237968 2597 ALARSEEIAPS 2607
Cdd:pfam15921 868 SFTRTHSNVPS 878
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1232-1964 |
3.72e-06 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 53.57 E-value: 3.72e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1232 AVREQLRQEKALLEDierhGEKVEECQRFAKQyinaikdyELQLVTYKAQLepvaspakkpKVQSGsesiIQEYVDLRTR 1311
Cdd:pfam05483 96 SIEAELKQKENKLQE----NRKIIEAQRKAIQ--------ELQFENEKVSL----------KLEEE----IQENKDLIKE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1312 yselSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKqrqLAEAHAQAKAQAEREAQGLQRRMQEEva 1391
Cdd:pfam05483 150 ----NNATRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEK---MILAFEELRVQAENARLEMHFKLKED-- 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1392 rreevaveaqeqkrsiQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARA 1471
Cdd:pfam05483 221 ----------------HEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENL 284
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1472 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELR----LQAEEAERRLRQAEaE 1547
Cdd:pfam05483 285 KELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKaahsFVVTEFEATTCSLE-E 363
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1548 RARQVQVALEtaqrSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaeaerarAEAERELERWQLKA 1627
Cdd:pfam05483 364 LLRTEQQRLE----KNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKK-------------ILAEDEKLLDEKKQ 426
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1628 NEALRLRLQAEEvaQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQR-QLAEGTAQ--------QRL 1698
Cdd:pfam05483 427 FEKIAEELKGKE--QELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKlKNIELTAHcdklllenKEL 504
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1699 AAEQE--LIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA---ELAKVRAEMEVLL---ASKARAEEESRST 1770
Cdd:pfam05483 505 TQEASdmTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESvreEFIQKGDEVKCKLdksEENARSIEYEVLK 584
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1771 SEKSKQRLEAEAGRFRELAEEAAR-LRALAEEAKRQRQlaEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKE 1849
Cdd:pfam05483 585 KEKQMKILENKCNNLKKQIENKNKnIEELHQENKALKK--KGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKE 662
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1850 AENERL--RRLAEdEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRrqvEEEILALKGSFEKAA 1927
Cdd:pfam05483 663 IEDKKIseEKLLE-EVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEER---DSELGLYKNKEQEQS 738
|
730 740 750
....*....|....*....|....*....|....*..
gi 1920237968 1928 AGKAELELELGRIRGtaeDTLRSKEQAEQEAARQRQL 1964
Cdd:pfam05483 739 SAKAALEIELSNIKA---ELLSLKKQLEIEKEEKEKL 772
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1672-2414 |
3.84e-06 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 53.42 E-value: 3.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1672 AVRQRELAEQELEKQRQLAeGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEE-------LARLQrEAAAATQKrrelea 1744
Cdd:COG3096 277 ANERRELSERALELRRELF-GARRQLAEEQYRLVEMARELEELSARESDLEQDyqaasdhLNLVQ-TALRQQEK------ 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1745 eLAKVRAEMEVLlasKARAEEESRSTSEKSKQRLEAEAgRFRELAEEAARLRAlaEEAKRQRQLaeeDAVRQRAEAERvl 1824
Cdd:COG3096 349 -IERYQEDLEEL---TERLEEQEEVVEEAAEQLAEAEA-RLEAAEEEVDSLKS--QLADYQQAL---DVQQTRAIQYQ-- 416
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1825 aEKLAAISEATRLKTEAEIALKEKEAENERLRRlAEDEAFQRRLleeQAAQHKADIEARLAQLRKAseseLERQKGLVED 1904
Cdd:COG3096 417 -QAVQALEKARALCGLPDLTPENAEDYLAAFRA-KEQQATEEVL---ELEQKLSVADAARRQFEKA----YELVCKIAGE 487
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1905 TLRQRR-QVEEEILALKGSFEKAAAGKAELELELGrirgtaedtlrskeQAEQEAARQRQLAAEEERRRREAEERVQKSL 1983
Cdd:COG3096 488 VERSQAwQTARELLRRYRSQQALAQRLQQLRAQLA--------------ELEQRLRQQQNAERLLEEFCQRIGQQLDAAE 553
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1984 AAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEA-----AQKRLQ--AEEKAHAFAVQQkeqelqQ 2056
Cdd:COG3096 554 ELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARApawlaAQDALErlREQSGEALADSQ------E 627
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2057 TLQQEQSVLERLRseaeaarraaeeaeaareraerEAAQSRRQVEEAERlkqsaeeqaqaqaqaqaaaeklrkeaeqeaa 2136
Cdd:COG3096 628 VTAAMQQLLERER----------------------EATVERDELAARKQ------------------------------- 654
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2137 rraQAEQAALRQKQAADAEMEKHKQFAEQ----------------------AL--------------------------- 2167
Cdd:COG3096 655 ---ALESQIERLSQPGGAEDPRLLALAERlggvllseiyddvtledapyfsALygparhaivvpdlsavkeqlagledcp 731
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2168 ---------------------------------RQ-------------KAQVEQELTALRLQLEETD---HQKSILDEEL 2198
Cdd:COG3096 732 edlyliegdpdsfddsvfdaeeledavvvklsdRQwrysrfpevplfgRAAREKRLEELRAERDELAeqyAKASFDVQKL 811
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2199 QRL--------------------KAEVTEAARQRGQVEEELFSLRVQM----EELGKLKARIEAENRAL----VLRDKDS 2250
Cdd:COG3096 812 QRLhqafsqfvgghlavafapdpEAELAALRQRRSELERELAQHRAQEqqlrQQLDQLKEQLQLLNKLLpqanLLADETL 891
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2251 AQRL--LQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAekmlKEKMQAVQEATRLKAEA---------- 2318
Cdd:COG3096 892 ADRLeeLREELDAAQEAQAFIQQHGKALAQLEPLVAVLQSDPEQFEQLQ----ADYLQAKEQQRRLKQQIfalsevvqrr 967
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2319 -------------------ELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQkTLETERQRQLEMSAEAERlrlRVA 2379
Cdd:COG3096 968 phfsyedavgllgensdlnEKLRARLEQAEEARREAREQLRQAQAQYSQYNQVLA-SLKSSRDAKQQTLQELEQ---ELE 1043
|
890 900 910
....*....|....*....|....*....|....*...
gi 1920237968 2380 EMS-RAQARAEEDA--RRFRKQAEDIGERLYRTELATQ 2414
Cdd:COG3096 1044 ELGvQADAEAEERAriRRDELHEELSQNRSRRSQLEKQ 1081
|
|
| CCDC22 |
pfam05667 |
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ... |
2148-2333 |
3.96e-06 |
|
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.
Pssm-ID: 461708 [Multi-domain] Cd Length: 600 Bit Score: 53.11 E-value: 3.96e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2148 QKQAADAEMEKHKQFAEQALRQKAQvEQELTALRLQLEETDHQKSILDEELQRLKAEVTeaarqrgQVEEELFSLRVQME 2227
Cdd:pfam05667 309 TNEAPAATSSPPTKVETEEELQQQR-EEELEELQEQLEDLESSIQELEKEIKKLESSIK-------QVEEELEELKEQNE 380
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2228 ELGK---LKARI-----EAENRALVLrdkdsaQRLLQEEAEKMKQVAE--EAARLSVAAQEAARLRQLAEEDLAQQRALA 2297
Cdd:pfam05667 381 ELEKqykVKKKTldllpDAEENIAKL------QALVDASAQRLVELAGqwEKHRVPLIEEYRALKEAKSNKEDESQRKLE 454
|
170 180 190
....*....|....*....|....*....|....*....
gi 1920237968 2298 E-KMLKEKMQAVQEATRLKAE--AELLQQQKELAQEQAR 2333
Cdd:pfam05667 455 EiKELREKIKEVAEEAKQKEElyKQLVAEYERLPKDVSR 493
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
2148-2332 |
4.01e-06 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 52.50 E-value: 4.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2148 QKQAADAEMEKHKQFAEQA--LRQKAQVEQEltalRLQLEETDHQKSildeelQRLKAEVTEAARQRGQVEEELFSLRVQ 2225
Cdd:PRK09510 71 QKSAKRAEEQRKKKEQQQAeeLQQKQAAEQE----RLKQLEKERLAA------QEQKKQAEEAAKQAALKQKQAEEAAAK 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2226 MEELGKLKARIEAEN-RALVLRDKDSAQRLLQEEAEK-----MKQVAEEAARLSVAAQEAARLRQLAEE---DLAQQRAL 2296
Cdd:PRK09510 141 AAAAAKAKAEAEAKRaAAAAKKAAAEAKKKAEAEAAKkaaaeAKKKAEAEAAAKAAAEAKKKAEAEAKKkaaAEAKKKAA 220
|
170 180 190
....*....|....*....|....*....|....*.
gi 1920237968 2297 AEKmlKEKMQAVQEATRLKAEAELLQQQKELAQEQA 2332
Cdd:PRK09510 221 AEA--KAAAAKAAAEAKAAAEKAAAAKAAEKAAAAK 254
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
2196-2604 |
4.04e-06 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 53.51 E-value: 4.04e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2196 EELQRLKAEVteAARQRGQVEEELFSLRVQMEELGKLKARIEaENRALVLRDKDSAQRLLQEEAEKMKQ---VAEEAARL 2272
Cdd:PRK02224 187 GSLDQLKAQI--EEKEEKDLHERLNGLESELAELDEEIERYE-EQREQARETRDEADEVLEEHEERREEletLEAEIEDL 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2273 SVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLaqetQG 2352
Cdd:PRK02224 264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAA----QA 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2353 FQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLyrtelatqekvmlvQTLETQRQQSDR 2432
Cdd:PRK02224 340 HNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEI--------------EELRERFGDAPV 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2433 DAERLREAIAELEHEKDKLKQEAQLLQ--LKSEEMQTVRQEQLLQE------TQALQQSflSEKDSLLQRERCIEQEKAK 2504
Cdd:PRK02224 406 DLGNAEDFLEELREERDELREREAELEatLRTARERVEEAEALLEAgkcpecGQPVEGS--PHVETIEEDRERVEELEAE 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2505 LEQL------FQDEVAKAQALREEQQR--QQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLL 2576
Cdd:PRK02224 484 LEDLeeeveeVEERLERAEDLVEAEDRieRLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEA 563
|
410 420 430
....*....|....*....|....*....|....
gi 1920237968 2577 AEENQRLRERLQHLEE------ERRAALARSEEI 2604
Cdd:PRK02224 564 EEEAEEAREEVAELNSklaelkERIESLERIRTL 597
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1420-1599 |
4.07e-06 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 52.54 E-value: 4.07e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1420 AEIQAKARQVEAAERSRLRIEEEirvvrlQLEATERQRGGAEGELQALRARAEEAEAqKRQAQEEAERLRRQVQDETQRK 1499
Cdd:TIGR02794 53 NRIQQQKKPAAKKEQERQKKLEQ------QAEEAEKQRAAEQARQKELEQRAAAEKA-AKQAEQAAKQAEEKQKQAEEAK 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1500 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1579
Cdd:TIGR02794 126 AKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAKAKA 205
|
170 180
....*....|....*....|
gi 1920237968 1580 QLERTLKEEHVAVVQLREEA 1599
Cdd:TIGR02794 206 AAEAAAKAEAEAAAAAAAEA 225
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1351-1445 |
4.12e-06 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 53.42 E-value: 4.12e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1351 LAEVEAALEKQRQLAEAHAQAKAQAEREAqglqRRMQEEVARREEVAVEAQEQKRSIQEELQHLR-QSSEAEIQAKARQV 1429
Cdd:PRK11448 144 LHALQQEVLTLKQQLELQAREKAQSQALA----EAQQQELVALEGLAAELEEKQQELEAQLEQLQeKAAETSQERKQKRK 219
|
90
....*....|....*....
gi 1920237968 1430 EAAERSRLRI---EEEIRV 1445
Cdd:PRK11448 220 EITDQAAKRLelsEEETRI 238
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3710-3745 |
4.31e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 45.94 E-value: 4.31e-06
10 20 30
....*....|....*....|....*....|....*.
gi 1920237968 3710 RQLLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPE 3745
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| DUF4659 |
pfam15558 |
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ... |
1328-1561 |
5.18e-06 |
|
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.
Pssm-ID: 464768 [Multi-domain] Cd Length: 374 Bit Score: 52.35 E-value: 5.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQRAEERERLA--EVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEvavEAQEQKR 1405
Cdd:pfam15558 51 ERRLLLQQSQEQWQAEKEQRKARLGreERRRADRREKQVIEKESRWREQAEDQENQRQEKLERARQEAEQ---RKQCQEQ 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1406 SIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEA------EAQKR 1479
Cdd:pfam15558 128 RLKEKEEELQALREQNSLQLQERLEEACHKRQLKEREEQKKVQENNLSELLNHQARKVLVDCQAKAEELlrrlslEQSLQ 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1480 QAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ----RALQALEELRL-QAEEAERRLRQAEAERARQVQV 1554
Cdd:pfam15558 208 RSQENYEQLVEERHRELREKAQKEEEQFQRAKWRAEEKEEERqehkEALAELADRKIqQARQVAHKTVQDKAQRARELNL 287
|
....*..
gi 1920237968 1555 ALETAQR 1561
Cdd:pfam15558 288 EREKNHH 294
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1369-1591 |
5.20e-06 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 52.16 E-value: 5.20e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1369 AQAKAQAEREAQGLQ---RRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKArqveaAERSRLRIEEEIRV 1445
Cdd:TIGR02794 46 GAVAQQANRIQQQKKpaaKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQ-----AEQAAKQAEEKQKQ 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1446 vrlQLEATERQRggaegelqALRARAEEAEAqKRQAQEEAerlRRQVQDETQRKRQAEAelalrvQAEAEAAREKQRAL- 1524
Cdd:TIGR02794 121 ---AEEAKAKQA--------AEAKAKAEAEA-ERKAKEEA---AKQAEEEAKAKAAAEA------KKKAEEAKKKAEAEa 179
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1525 --QALEELRLQAEEAERRLRQAE----------AERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVA 1591
Cdd:TIGR02794 180 kaKAEAEAKAKAEEAKAKAEAAKakaaaeaaakAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAG 258
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3915-3952 |
5.46e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 45.94 E-value: 5.46e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1920237968 3915 QRFLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTA 3952
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1666-1948 |
5.85e-06 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 52.21 E-value: 5.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1666 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1745
Cdd:COG4372 23 GILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEE 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1746 LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQ----LAEEDAVRQRAEAE 1821
Cdd:COG4372 103 LESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEelaaLEQELQALSEAEAE 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1822 RVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGL 1901
Cdd:COG4372 183 QALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEE 262
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1902 VEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTL 1948
Cdd:COG4372 263 LELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSL 309
|
|
| CH_FLNA_rpt2 |
cd21312 |
second calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; ... |
159-266 |
5.85e-06 |
|
second calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A (FLN-A) is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also anchors various transmembrane proteins to the actin cytoskeleton and serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-A contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409161 Cd Length: 114 Bit Score: 48.26 E-value: 5.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 159 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 237
Cdd:cd21312 7 EAKKQTPKQRLLGWIQNKLPQ---LPITNFSRDWQSGRALGALVDSCAPGLCpDWDSWDASKPVTNAREAMQQADDWLGI 83
|
90 100
....*....|....*....|....*....
gi 1920237968 238 TRLLDPEDVDVPQPDEKSIITYVSSLYDA 266
Cdd:cd21312 84 PQVITPEEIVDPNVDEHSVMTYLSQFPKA 112
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1514-2045 |
7.42e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 52.37 E-value: 7.42e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1514 AEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQV--ALETAQRSAEAELQS------EHASFAEKTAQLERTL 1585
Cdd:PRK03918 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREinEISSELPELREELEKlekevkELEELKEEIEELEKEL 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1586 KEEHVAVVQLRE---EATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREA 1662
Cdd:PRK03918 248 ESLEGSKRKLEEkirELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1663 RRRGKAEEQAVRQRELAEQELEKQRQLAE--------GTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR--LQREA 1732
Cdd:PRK03918 328 ERIKELEEKEERLEELKKKLKELEKRLEEleerhelyEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKeeIEEEI 407
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1733 AAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGR-FRELAEEAARLRALAEEAKRQ-RQLAE 1810
Cdd:PRK03918 408 SKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAeLKRIEKELKEIEEKERKLRKElRELEK 487
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1811 EDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERlRRLAEDEAFQRRLLEEqaAQHKADIEARLAQLRKA 1890
Cdd:PRK03918 488 VLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLK-EKLIKLKGEIKSLKKE--LEKLEELKKKLAELEKK 564
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1891 SEsELERQKGLVEDTLRQR-----RQVEEEILALKGSFEK---AAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQR 1962
Cdd:PRK03918 565 LD-ELEEELAELLKELEELgfesvEELEERLKELEPFYNEyleLKDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1963 QLAAEEERRrreaeervqKSLAAEEEAARQRKAALE---EVERLKAKVEEARRLRERAEQ--ESARQLQLAQEAAQKRLQ 2037
Cdd:PRK03918 644 ELRKELEEL---------EKKYSEEEYEELREEYLElsrELAGLRAELEELEKRREEIKKtlEKLKEELEEREKAKKELE 714
|
....*...
gi 1920237968 2038 AEEKAHAF 2045
Cdd:PRK03918 715 KLEKALER 722
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
2146-2592 |
7.64e-06 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 52.42 E-value: 7.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQ-----KAQVEQELTA---LRLQLEETDHQKSI-LDEELQRLKAEVTEAARQRGQVE 2216
Cdd:pfam05483 160 LKETCARSAEKTKKYEYEREETRQvymdlNNNIEKMILAfeeLRVQAENARLEMHFkLKEDHEKIQHLEEEYKKEINDKE 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2217 EELFSLRVQM-EELGKLKARI----EAENRALVLRDKDSAQ-RLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDL 2290
Cdd:pfam05483 240 KQVSLLLIQItEKENKMKDLTflleESRDKANQLEEKTKLQdENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDL 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2291 AQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQ--------KELAQEQARRLQEDKEQMaQQLAQETQgfQKTLETERQ 2362
Cdd:pfam05483 320 QIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEfeattcslEELLRTEQQRLEKNEDQL-KIITMELQ--KKSSELEEM 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2363 RQLEMSAEAERLRLRVAeMSRAQARAEEdarrfRKQAEDIGERLYRTElatQEKVMLVQTLETQRQQSDRDAERLREAIA 2442
Cdd:pfam05483 397 TKFKNNKEVELEELKKI-LAEDEKLLDE-----KKQFEKIAEELKGKE---QELIFLLQAREKEIHDLEIQLTAIKTSEE 467
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2443 ELEHEKDKLKQEAQLLQLKSEEMqTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQdevaKAQALREE 2522
Cdd:pfam05483 468 HYLKEVEDLKTELEKEKLKNIEL-TAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLK----QIENLEEK 542
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2523 QQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEE 2592
Cdd:pfam05483 543 EMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEE 612
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
2174-2603 |
7.72e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 52.48 E-value: 7.72e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2174 EQELTALRLQLEETDHQKSILDEElQRLKAEvtEAARQRGQVEeelfslRVQMEelGKLKariEAENRALVLRDKDSaqR 2253
Cdd:pfam01576 86 EEEERSQQLQNEKKKMQQHIQDLE-EQLDEE--EAARQKLQLE------KVTTE--AKIK---KLEEDILLLEDQNS--K 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2254 LLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK-EKMQAVQEATRLKAEAELLQqqkelAQEQA 2332
Cdd:pfam01576 150 LSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKeEKGRQELEKAKRKLEGESTD-----LQEQI 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2333 RRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQA--RAEEDAR-RFRKQAEDIGERL--Y 2407
Cdd:pfam01576 225 AELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEdlESERAARnKAEKQRRDLGEELeaL 304
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2408 RTELA-TQEKVMLVQTLETQRQQsdrDAERLREAIaelehEKDKLKQEAQLLQLKSEEMQTVR--QEQLLQ--------- 2475
Cdd:pfam01576 305 KTELEdTLDTTAAQQELRSKREQ---EVTELKKAL-----EEETRSHEAQLQEMRQKHTQALEelTEQLEQakrnkanle 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2476 -ETQALQQSFL---SEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQalreEQQRQQQQMQQEKQQLAASMEEARRRQHEA 2551
Cdd:pfam01576 377 kAKQALESENAelqAELRTLQQAKQDSEHKRKKLEGQLQELQARLS----ESERQRAELAEKLSKLQSELESVSSLLNEA 452
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2552 EEGVRRQQEELQRLAQQQQQQEKLLAEENQR---LRERLQHLEEERRAALARSEE 2603
Cdd:pfam01576 453 EGKNIKLSKDVSSLESQLQDTQELLQEETRQklnLSTRLRQLEDERNSLQEQLEE 507
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1293-1507 |
8.02e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 51.69 E-value: 8.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1293 KVQSGSESIIQEYVDLRTRYSELSTL---TSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA-- 1367
Cdd:COG4942 45 ALKKEEKALLKQLAALERRIAALARRiraLEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPLal 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1368 --HAQAKAQAEREAQGLQRRMQeevARREEVaveaqEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIrv 1445
Cdd:COG4942 125 llSPEDFLDAVRRLQYLKYLAP---ARREQA-----EELRADLAELAALRAELEAERAELEALLAELEEERAALEALK-- 194
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1446 vrlqleaTERQRggaegELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1507
Cdd:COG4942 195 -------AERQK-----LLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTP 244
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
2305-2450 |
8.90e-06 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 52.33 E-value: 8.90e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2305 MQAVQEATRlkaeaELLQQQKELA-------QEQARRLQ-EDKEQMAQ-QLAQetQGFQKTLETERQRQLEMSAEAERLR 2375
Cdd:PTZ00491 638 VEPVDERTR-----DSLQKSVQLAieittksQEAAARHQaELLEQEARgRLER--QKMHDKAKAEEQRTKLLELQAESAA 710
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 2376 LRVAEMSRAQARAEEDARRFRKQAEdigerLYRTEL-ATQEKVMLVQTLETQRQQSDRDAERlREAIAELEHEKDK 2450
Cdd:PTZ00491 711 VESSGQSRAEALAEAEARLIEAEAE-----VEQAELrAKALRIEAEAELEKLRKRQELELEY-EQAQNELEIAKAK 780
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
1216-1535 |
9.11e-06 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 52.36 E-value: 9.11e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1216 KQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQ 1295
Cdd:PRK10929 29 TQELEQAKAAKTPAQAEIVEALQSALNWLEERKGSLER-------AKQYQQVIDNFPKLSAELRQQLNNERDEPRSVPPN 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1296 SGSESIIQEYVDLRTRYSELSTLTSQ---YIRFISETLRRMEeeerlaeQQRAEERERLAEVEAALEKQRQLAEAHAQAK 1372
Cdd:PRK10929 102 MSTDALEQEILQVSSQLLEKSRQAQQeqdRAREISDSLSQLP-------QQQTEARRQLNEIERRLQTLGTPNTPLAQAQ 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1373 AQAereaqglqrrMQEEVARReevaveaqeqkRSIQEELQhLRQSSEAEIQAKAR-QVEAAERSRLRIEEEIRVVRLQLE 1451
Cdd:PRK10929 175 LTA----------LQAESAAL-----------KALVDELE-LAQLSANNRQELARlRSELAKKRSQQLDAYLQALRNQLN 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1452 aTERQRggaegelqalraRAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ-------AEAEAAREKQRAL 1524
Cdd:PRK10929 233 -SQRQR------------EAERALESTELLAEQSGDLPKSIVAQFKINRELSQALNQQAQrmdliasQQRQAASQTLQVR 299
|
330
....*....|.
gi 1920237968 1525 QALEELRLQAE 1535
Cdd:PRK10929 300 QALNTLREQSQ 310
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3917-3955 |
9.75e-06 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 45.01 E-value: 9.75e-06
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3917 FLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTAFEL 3955
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
2263-2479 |
9.78e-06 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 51.00 E-value: 9.78e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2263 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2342
Cdd:TIGR02794 46 GAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2343 AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKvmlvQT 2422
Cdd:TIGR02794 126 AKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAE----AA 201
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 2423 LETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQA 2479
Cdd:TIGR02794 202 KAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAG 258
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1343-2047 |
1.02e-05 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 52.11 E-value: 1.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1343 QRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER-EAQGlqrrmqeevarrEEVAVEAQEQKRSIQEELQHLRQsseae 1421
Cdd:PRK10246 168 ERAELLEELTGTEIYGQISAMVFEQHKSARTELEKlQAQA------------SGVALLTPEQVQSLTASLQVLTD----- 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1422 iqakarqveaaersrlriEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAerlrrqvQDETQRKRQ 1501
Cdd:PRK10246 231 ------------------EEKQLLTAQQQQQQSLNWLTRLDELQQEASRRQQALQQALAAEEKA-------QPQLAALSL 285
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1502 AEAELALRVQAEaeaarEKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQrsaeaELQSEHASFAEKTAql 1581
Cdd:PRK10246 286 AQPARQLRPHWE-----RIQEQSAALAHTRQQIEEVNTRLQSTMALRARIRHHAAKQSA-----ELQAQQQSLNTWLA-- 353
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1582 ertlkeEHVAVVQLREE-----ATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLtqaeaekqke 1656
Cdd:PRK10246 354 ------EHDRFRQWNNElagwrAQFSQQTSDREQLRQWQQQLTHAEQKLNALPAITLTLTADEVAAALAQ---------- 417
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1657 eaerearrrgKAEEQAVRQR--ELAEQELEKQRQLAEgTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLqREAAA 1734
Cdd:PRK10246 418 ----------HAEQRPLRQRlvALHGQIVPQQKRLAQ-LQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADV-KTICE 485
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1735 ATQKRRELEAELAKVRAEMEVLL--ASKARAEEESRS-TSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1811
Cdd:PRK10246 486 QEARIKDLEAQRAQLQAGQPCPLcgSTSHPAVEAYQAlEPGVNQSRLDALEKEVKKLGEEGAALRGQLDALTKQLQRDES 565
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1812 DAvRQRAEAERVLAEKLAAISEA--TRLKTEAEIA--LKEKEAENERLRRLAEDEAFQRRLLE--EQAAQHKADIEARLA 1885
Cdd:PRK10246 566 EA-QSLRQEEQALTQQWQAVCASlnITLQPQDDIQpwLDAQEEHERQLRLLSQRHELQGQIAAhnQQIIQYQQQIEQRQQ 644
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1886 QLR----------------------KASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGT 1943
Cdd:PRK10246 645 QLLtalagyaltlpqedeeaswlatRQQEAQSWQQRQNELTALQNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHE 724
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1944 AEDTLRSK-----EQAEQEAARQRQLAAEEERRRREAEERVQKSLAAE--EEAARQRKAALEevERLKAKVEEARRLRER 2016
Cdd:PRK10246 725 QCLSLHSQlqtlqQQDVLEAQRLQKAQAQFDTALQASVFDDQQAFLAAllDEETLTQLEQLK--QNLENQRQQAQTLVTQ 802
|
730 740 750
....*....|....*....|....*....|.
gi 1920237968 2017 AEQESARQLQLAQEAAQKRLQAEEKAHAFAV 2047
Cdd:PRK10246 803 TAQALAQHQQHRPDGLDLTVTVEQIQQELAQ 833
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1808-2508 |
1.04e-05 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 51.97 E-value: 1.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1808 LAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQhkadiEARLAQL 1887
Cdd:TIGR00606 165 LSEGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQ-----LESSREI 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1888 RKASESEL----ERQKGlVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQ 1963
Cdd:TIGR00606 240 VKSYENELdplkNRLKE-IEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1964 laaeeerrrrEAEERVQKSLAAEEEAAR---QRKAALE-EVERLKAKVE---EARRLRERAEQESARQLQLAQEAAQKRL 2036
Cdd:TIGR00606 319 ----------RELVDCQRELEKLNKERRllnQEKTELLvEQGRLQLQADrhqEHIRARDSLIQSLATRLELDGFERGPFS 388
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2037 QAEEK-AHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQ 2115
Cdd:TIGR00606 389 ERQIKnFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEG 468
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2116 AQAQAQAAAEKLRKEAE---------------QEAARRAQAEQAALRQKQAADAEME----------------KHKQFAE 2164
Cdd:TIGR00606 469 SSDRILELDQELRKAERelskaeknsltetlkKEVKSLQNEKADLDRKLRKLDQEMEqlnhhtttrtqmemltKDKMDKD 548
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2165 QALRQ-KAQVEQELTAL------RLQLEETDHQKS----ILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGklk 2233
Cdd:TIGR00606 549 EQIRKiKSRHSDELTSLlgyfpnKKQLEDWLHSKSkeinQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYE--- 625
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2234 ariEAENRALVLRDKDSAQRLLQEEAEKM-KQVAEEAARLSVAAQeaaRLRQLAEED-----LAQQRALAEKMLKEKMQA 2307
Cdd:TIGR00606 626 ---DKLFDVCGSQDEESDLERLKEEIEKSsKQRAMLAGATAVYSQ---FITQLTDENqsccpVCQRVFQTEAELQEFISD 699
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2308 VQEATRLkAEAELLQQQKELAQEQARRlqEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSR--AQ 2385
Cdd:TIGR00606 700 LQSKLRL-APDKLKSTESELKKKEKRR--DEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETllGT 776
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2386 ARAEEDARRFRKQAEDIGERLYRtELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEM 2465
Cdd:TIGR00606 777 IMPEEESAKVCLTDVTIMERFQM-ELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQ 855
|
730 740 750 760
....*....|....*....|....*....|....*....|...
gi 1920237968 2466 QTVRQeQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQL 2508
Cdd:TIGR00606 856 QEQIQ-HLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEV 897
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1729-2050 |
1.13e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 51.66 E-value: 1.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1729 QREAAAATQKRRELEAELAKVRAEMEvllaSKARAEEESRSTSEKSKQRlEAEAGRFRELAEEAARLralaeEAKRQRQL 1808
Cdd:pfam17380 281 QKAVSERQQQEKFEKMEQERLRQEKE----EKAREVERRRKLEEAEKAR-QAEMDRQAAIYAEQERM-----AMEREREL 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1809 AEEDAVRQRAEAERVLAEKLAAisEATRLKtEAEIALKEKEAENERLRRLAEdEAFQRRLLEEQAAQHKADIEARLAQLR 1888
Cdd:pfam17380 351 ERIRQEERKRELERIRQEEIAM--EISRMR-ELERLQMERQQKNERVRQELE-AARKVKILEEERQRKIQQQKVEMEQIR 426
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1889 KASESELERQ-KGLVEDTLRQRRQVEEEilalkgsfekaaagKAELELELGRIRGTAEDTLRSKEQAEQEaaRQRQLAAE 1967
Cdd:pfam17380 427 AEQEEARQREvRRLEEERAREMERVRLE--------------EQERQQQVERLRQQEEERKRKKLELEKE--KRDRKRAE 490
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1968 EERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARrlRERAEQESARQLQLAQE---AAQKRLQAEEKAHA 2044
Cdd:pfam17380 491 EQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER--RREAEEERRKQQEMEERrriQEQMRKATEERSRL 568
|
....*.
gi 1920237968 2045 FAVQQK 2050
Cdd:pfam17380 569 EAMERE 574
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
2164-2603 |
1.17e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 51.99 E-value: 1.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2164 EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAenral 2243
Cdd:PRK03918 161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEE----- 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2244 vLRDKDSAQRLLQEEAEKMKQVAEEaarlsvaaqeaaRLRQLAEedlaqQRALAEKMLKEKMQAVQEATRLKAEAELLQQ 2323
Cdd:PRK03918 236 -LKEEIEELEKELESLEGSKRKLEE------------KIRELEE-----RIEELKKEIEELEEKVKELKELKEKAEEYIK 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2324 QKELAQEQARRLQEDKEQMA--QQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMsRAQARAEEDARRFRKQAED 2401
Cdd:PRK03918 298 LSEFYEEYLDELREIEKRLSrlEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL-EERHELYEEAKAKKEELER 376
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2402 IGERLyrTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKqeAQLLQLKS---------EEMQTVRQEQ 2472
Cdd:PRK03918 377 LKKRL--TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK--KAIEELKKakgkcpvcgRELTEEHRKE 452
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2473 LLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEvAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAE 2552
Cdd:PRK03918 453 LLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKE-SELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLK 531
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 2553 EGVRRQQEELQRLAqQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEE 2603
Cdd:PRK03918 532 EKLIKLKGEIKSLK-KELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2426-2615 |
1.34e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 50.92 E-value: 1.34e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2426 QRQQSDRDAERLREAIAELEHEKDKLKQEAQLL--QLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKA 2503
Cdd:COG4942 21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALlkQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELE 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2504 KLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASME------EARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA 2577
Cdd:COG4942 101 AQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQylkylaPARREQAEELRADLAELAALRAELEAERAELEALL 180
|
170 180 190
....*....|....*....|....*....|....*...
gi 1920237968 2578 EENQRLRERLQHLEEERRAALARSEEIAPSRAAAARAL 2615
Cdd:COG4942 181 AELEEERAALEALKAERQKLLARLEKELAELAAELAEL 218
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
2314-2600 |
1.38e-05 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 51.88 E-value: 1.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2314 LKAEAELLQQQKELAQEQARRlqedkEQMAQQLAQETQGfQKTLETERQrqlemsAEAERLRLRVAEMsraqaRAEEDAR 2393
Cdd:COG3096 288 LELRRELFGARRQLAEEQYRL-----VEMARELEELSAR-ESDLEQDYQ------AASDHLNLVQTAL-----RQQEKIE 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2394 RFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEA----QLLQLKsEEMQTVR 2469
Cdd:COG3096 351 RYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQQTRAiqyqQAVQAL-EKARALC 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2470 Q---------EQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQ------DEVAKAQA-------LREEQQRQQ 2527
Cdd:COG3096 430 GlpdltpenaEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYElvckiaGEVERSQAwqtarelLRRYRSQQA 509
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 2528 QQMQQEKqqLAASMEEARRRQHEAEEgVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR 2600
Cdd:COG3096 510 LAQRLQQ--LRAQLAELEQRLRQQQN-AERLLEEFCQRIGQQLDAAEELEELLAELEAQLEELEEQAAEAVEQ 579
|
|
| ATAD3_N |
pfam12037 |
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal ... |
1309-1450 |
1.40e-05 |
|
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal domain of ATPase family AAA domain-containing protein 3 (ATAD3) which is involved in dimerization and interacts with the inner surface of the outer mitochondrial membrane. This domain is found associated with the AAA ATPase domain (pfam00004). ATAD3 is essential for mitochondrial network organization, mitochondrial metabolism and cell growth at organizm and cellular level. It may also play an important role in mitochondrial protein synthesis.
Pssm-ID: 463442 [Multi-domain] Cd Length: 264 Bit Score: 49.98 E-value: 1.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1309 RTRYSELSTLTSQYIRFI----SETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRqlaeahaqakAQAEREAQglqR 1384
Cdd:pfam12037 56 QTRQAELQAKIKEYEAAQeqlkIERQRVEYEERRKTLQEETKQKQQRAQYQDELARKR----------YQDQLEAQ---R 122
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1385 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL 1450
Cdd:pfam12037 123 RRNEELLRKQEESVAKQEAMRIQAQRRQTEEHEAELRRETERAKAEAEAEARAKEERENEDLNLEQ 188
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1465-1729 |
1.42e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 51.56 E-value: 1.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1465 QALRARAEEAEAQKRQAQEEAERLRRQVqDETQRKRQA--EAELALRVQAEAEAAREKQRALQA-LEELRLQAEEAERRL 1541
Cdd:COG3206 164 QNLELRREEARKALEFLEEQLPELRKEL-EEAEAALEEfrQKNGLVDLSEEAKLLLQQLSELESqLAEARAELAEAEARL 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1542 RQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaeaeraraeaerele 1621
Cdd:COG3206 243 AALRAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQI---------------------- 300
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1622 rwqlkanEALRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrgKAEEQAVRQRelaEQELEKQRQLAEGTAQQRLAAE 1701
Cdd:COG3206 301 -------AALRAQLQQEAQRILASL--------------------EAELEALQAR---EASLQAQLAQLEARLAELPELE 350
|
250 260
....*....|....*....|....*...
gi 1920237968 1702 QELIRLRAETeqgEQQRQLLEEELARLQ 1729
Cdd:COG3206 351 AELRRLEREV---EVARELYESLLQRLE 375
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1329-1545 |
1.57e-05 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 51.02 E-value: 1.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1329 TLRRMEEEERLAEQQRAEERERLAEVEAALEkqrqlaEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1408
Cdd:pfam02029 134 EIREKEYQENKWSTEVRQAEEEGEEEEDKSE------EAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEV 207
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1409 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRV-VRLQLEATERQRGGAEG-ELQALRARAEEAEAQKRQAQEEAE 1486
Cdd:pfam02029 208 KSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFLeAEQKLEELRRRRQEKESeEFEKLRQKQQEAELELEELKKKRE 287
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1487 RLRRQVQDETQRKRQAEAELALRVQaeaeaaREKQRALQALEelRLQAEEAERRLRQAE 1545
Cdd:pfam02029 288 ERRKLLEEEEQRRKQEEAERKLREE------EEKRRMKEEIE--RRRAEAAEKRQKLPE 338
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1341-1495 |
1.62e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 49.54 E-value: 1.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1341 EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREaqglQRRMQEEVARREEVAVEAQEQKRSIqeelqhlrqSSEA 1420
Cdd:COG1579 23 EHRLKELPAELAELEDELAALEARLEAAKTELEDLEKE----IKRLELEIEEVEARIKKYEEQLGNV---------RNNK 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1421 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDE 1495
Cdd:COG1579 90 EYEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAE 164
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
2146-2593 |
1.67e-05 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 51.36 E-value: 1.67e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTalrlQLEETDHQKSILDEELQRLKAEVTEAARQRGQVE-----EELF 2220
Cdd:pfam10174 58 LKEQYRVTQEENQHLQLTIQALQDELRAQRDLN----QLLQQDFTTSPVDGEDKFSTPELTEENFRRLQSEherqaKELF 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2221 SLRVQMEELgklKARIEAENRALVLRDkDSAQRLL---QEEAEKMKQVAEEAARLSVAAQEAARLRQLaeEDLAQQRALA 2297
Cdd:pfam10174 134 LLRKTLEEM---ELRIETQKQTLGARD-ESIKKLLemlQSKGLPKKSGEEDWERTRRIAEAEMQLGHL--EVLLDQKEKE 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2298 EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLqedkEQMAQQLAQETQgfqkTLETErqrqLEMSAEAERLRLR 2377
Cdd:pfam10174 208 NIHLREELHRRNQLQPDPAKTKALQTVIEMKDTKISSL----ERNIRDLEDEVQ----MLKTN----GLLHTEDREEEIK 275
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2378 VAEMSRAQARaeedarrFRK-QAEDIGERLYRTE---LATQEKVmlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQ 2453
Cdd:pfam10174 276 QMEVYKSHSK-------FMKnKIDQLKQELSKKEselLALQTKL---ETLTNQNSDCKQHIEVLKESLTAKEQRAAILQT 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2454 EAQLLQLKSEEMQTV-----RQEQLLQETQALQQSFLSE-KDSLLQRERCIEQEKAKLEQL---FQDEVAKAQALREEQQ 2524
Cdd:pfam10174 346 EVDALRLRLEEKESFlnkktKQLQDLTEEKSTLAGEIRDlKDMLDVKERKINVLQKKIENLqeqLRDKDKQLAGLKERVK 425
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 2525 RQQQQMQQEKQQLAaSMEEARRrqhEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEE 2593
Cdd:pfam10174 426 SLQTDSSNTDTALT-TLEEALS---EKERIIERLKEQREREDRERLEELESLKKENKDLKEKVSALQPE 490
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1699-1794 |
1.74e-05 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 51.49 E-value: 1.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1699 AAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlasKARAEEESRSTSEKSKQRL 1778
Cdd:PRK11448 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQL---QEKAAETSQERKQKRKEIT 222
|
90
....*....|....*.
gi 1920237968 1779 EAEAGRFrELAEEAAR 1794
Cdd:PRK11448 223 DQAAKRL-ELSEEETR 237
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1379-2015 |
2.07e-05 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 50.88 E-value: 2.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1379 AQGLQR---RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATER 1455
Cdd:pfam05483 73 SEGLSRlysKLYKEAEKIKKWKVSIEAELKQKENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNA 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1456 QRGGAEgELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAElaLRVQAEaeaarekqralQALEELRLQAE 1535
Cdd:pfam05483 153 TRHLCN-LLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEKMILAFEE--LRVQAE-----------NARLEMHFKLK 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1536 EAERRLRQAEAERARQV-----QVALETAQrSAEAElqsehasfaEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAE 1610
Cdd:pfam05483 219 EDHEKIQHLEEEYKKEIndkekQVSLLLIQ-ITEKE---------NKMKDLTFLLEESRDKANQLEEKTKLQDENLKELI 288
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1611 RARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQla 1690
Cdd:pfam05483 289 EKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLR-- 366
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1691 egTAQQRLaaeqelirlraetEQGEQQRQLLEEELARLQREAAAATQ----KRRELEaELAKVRAEMEVLLASKARAEEE 1766
Cdd:pfam05483 367 --TEQQRL-------------EKNEDQLKIITMELQKKSSELEEMTKfknnKEVELE-ELKKILAEDEKLLDEKKQFEKI 430
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1767 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALaeEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLkteaeialk 1846
Cdd:pfam05483 431 AEELKGKEQELIFLLQAREKEIHDLEIQLTAI--KTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLL--------- 499
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1847 ekeaENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGL--VEDTLRQRR--------QVEEEI 1916
Cdd:pfam05483 500 ----ENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELesVREEFIQKGdevkckldKSEENA 575
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1917 LALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKsLAAEEEAARQR--- 1993
Cdd:pfam05483 576 RSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNK-LELELASAKQKfee 654
|
650 660 670
....*....|....*....|....*....|....*...
gi 1920237968 1994 ----------------KAALEEVERLKAKVEEARRLRE 2015
Cdd:pfam05483 655 iidnyqkeiedkkiseEKLLEEVEKAKAIADEAVKLQK 692
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1832-2585 |
2.08e-05 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 50.98 E-value: 2.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1832 SEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQ 1911
Cdd:NF041483 22 AEMDRLKTEREKAVQHAEDLGYQVEVLRAKLHEARRSLASRPAYDGADIGYQAEQLLRNAQIQADQLRADAERELRDARA 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1912 VEEEIlaLKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQ----------AEQEAARQRQLAAEEERRRREAEERVQK 1981
Cdd:NF041483 102 QTQRI--LQEHAEHQARLQAELHTEAVQRRQQLDQELAERRQtveshvnenvAWAEQLRARTESQARRLLDESRAEAEQA 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1982 SLAAEEEAARqrkAALEEVERLKAKVEEARRLRE----RAEQESARQLQLAQEAAQKRLQAEEKAHAF-------AVQQK 2050
Cdd:NF041483 180 LAAARAEAER---LAEEARQRLGSEAESARAEAEailrRARKDAERLLNAASTQAQEATDHAEQLRSStaaesdqARRQA 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2051 EQELQQTLQQEQSVLERLR-----SEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAE 2125
Cdd:NF041483 257 AELSRAAEQRMQEAEEALRearaeAEKVVAEAKEAAAKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAEQAL 336
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2126 KLRKEAEQEAARRAQAEQAALRQKQAAdAEMEKHKQFAEQALRQKAQVEQELTalRLQLEETDHQKSILDEELQRLKAEV 2205
Cdd:NF041483 337 ADARAEAEKLVAEAAEKARTVAAEDTA-AQLAKAARTAEEVLTKASEDAKATT--RAAAEEAERIRREAEAEADRLRGEA 413
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2206 TEAARQ-RGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAAR-----LSVAAQEA 2279
Cdd:NF041483 414 ADQAEQlKGAAKDDTKEYRAKTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARtaeelLTKAKADA 493
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2280 ARLRQLAEEDLAQQRALA-EKMLKEKMQAVQEATRLKAEAELLQQQkelAQEQARRLQEDKEQMAQQLAQETQGFQKTLE 2358
Cdd:NF041483 494 DELRSTATAESERVRTEAiERATTLRRQAEETLERTRAEAERLRAE---AEEQAEEVRAAAERAARELREETERAIAARQ 570
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2359 TERQRQLE-MSAEAERlRLRVAEMSRAQARAEedARRFRKQAEDIGERLyRTELAtqekvmlvQTLETQRQQSDRDAERL 2437
Cdd:NF041483 571 AEAAEELTrLHTEAEE-RLTAAEEALADARAE--AERIRREAAEETERL-RTEAA--------ERIRTLQAQAEQEAERL 638
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2438 R-EAIAELEHEkdKLKQEAQLLQLKSEEMQtvRQEQLLQETQALQQSFLSEKDSLLQRercIEQEKAKleqlfqdevaka 2516
Cdd:NF041483 639 RtEAAADASAA--RAEGENVAVRLRSEAAA--EAERLKSEAQESADRVRAEAAAAAER---VGTEAAE------------ 699
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2517 qalreeqqrqqqqmqqekqQLAASMEEARRRQHEAEEGVRRQQEEL-QRLAQQQQQQEKLLAEENQRLRE 2585
Cdd:NF041483 700 -------------------ALAAAQEEAARRRREAEETLGSARAEAdQERERAREQSEELLASARKRVEE 750
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1365-1580 |
2.18e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 50.21 E-value: 2.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1365 AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLrqssEAEIQAKARQVEAAERsrlRIEEEIR 1444
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEAL----QAEIDKLQAEIAEAEA---EIEERRE 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1445 VVRLQLEATERQrGGAEGELQAL--------------------RARAEEAEAQKrQAQEEAERLRRQVQDETQRKRQAEA 1504
Cdd:COG3883 87 ELGERARALYRS-GGSVSYLDVLlgsesfsdfldrlsalskiaDADADLLEELK-ADKAELEAKKAELEAKLAELEALKA 164
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1505 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ 1580
Cdd:COG3883 165 ELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 240
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1340-1768 |
2.20e-05 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 50.68 E-value: 2.20e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1340 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE 1419
Cdd:COG5278 84 ARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLAL 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1420 AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1499
Cdd:COG5278 164 ALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALALA 243
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1500 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1579
Cdd:COG5278 244 LLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAAA 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1580 QLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAE 1659
Cdd:COG5278 324 ALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAAA 403
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1660 REARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKR 1739
Cdd:COG5278 404 AEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAAA 483
|
410 420
....*....|....*....|....*....
gi 1920237968 1740 RELEAELAKVRAEMEVLLASKARAEEESR 1768
Cdd:COG5278 484 LAEAEAAAALAAAAALSLALALAALLLAA 512
|
|
| MARTX_Nterm |
NF012221 |
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ... |
2146-2351 |
2.29e-05 |
|
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.
Pssm-ID: 467957 [Multi-domain] Cd Length: 1848 Bit Score: 50.99 E-value: 2.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADA-----EMEKHKQFAEQALRQKaqveqeltalrlQLEETDH---------QKSILDEELQRLKAEVTEAARQ 2211
Cdd:NF012221 1561 LADKERAEAdrqrlEQEKQQQLAAISGSQS------------QLESTDQnaletngqaQRDAILEESRAVTKELTTLAQG 1628
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2212 RGQVEEElfslRVQMEELGKlKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQ--------VAEEAARLSVAAQEAAR 2281
Cdd:NF012221 1629 LDALDSQ----ATYAGESGD-QWRNPFAGGLLdrVQEQLDDAKKISGKQLADAKQrhvdnqqkVKDAVAKSEAGVAQGEQ 1703
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2282 LRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQ 2351
Cdd:NF012221 1704 NQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQADAKGAK 1773
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1526-1964 |
2.38e-05 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 50.79 E-value: 2.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1526 ALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQ---------LR 1596
Cdd:COG3903 477 AAERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALrlaaalapfWF 556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1597 EEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQR 1676
Cdd:COG3903 557 LRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAAAAAAA 636
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1677 ELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVL 1756
Cdd:COG3903 637 AAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAALAAAAA 716
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1757 LASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATR 1836
Cdd:COG3903 717 AAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAAAAAAA 796
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1837 LKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEI 1916
Cdd:COG3903 797 AAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALAAAAAA 876
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 1920237968 1917 LALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQL 1964
Cdd:COG3903 877 AAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAA 924
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3786-3822 |
2.42e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 44.01 E-value: 2.42e-05
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 3786 LRLLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDT 3822
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| COG5281 |
COG5281 |
Phage-related minor tail protein [Mobilome: prophages, transposons]; |
2225-2605 |
2.45e-05 |
|
Phage-related minor tail protein [Mobilome: prophages, transposons];
Pssm-ID: 444092 [Multi-domain] Cd Length: 603 Bit Score: 50.76 E-value: 2.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2225 QMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEK 2304
Cdd:COG5281 1 AAALAAAAALAAAAAAAAASAAAAAAAAALAAAAAAAAAAAGLAAAAAAAAAASLAAAAAAAALAAAAAAAAAAAAADAL 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2305 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2384
Cdd:COG5281 81 AAALAEDAAAAAAAAEAALAALAAAALALAAAALAEAALAAAAAAAAAAAAAAAAAAAAAAAAAEAAKAAAAAAAAAALA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2385 QARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE 2464
Cdd:COG5281 161 AAAAAAAAAAAAAAAAAALAAASAAAAAAAAKAAAEAAAEAAAAAEAAAAAAAAAAEAAAAEAQALAAAALAEQAALAAA 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2465 MQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEA 2544
Cdd:COG5281 241 SAAAQALAALAAAAAAAALALAAAAELALTAQAEAAAAAAAAAAAAAQAAEAAAAAAEAQALAAAAAAAAAQLAAAAAAA 320
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2545 R-RRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIA 2605
Cdd:COG5281 321 AqALRAAAQALAALAQRALAAAALAAAAQEAALAAAAAALQAALEAAAAAAAAELAAAGDWA 382
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
939-1509 |
2.57e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 50.89 E-value: 2.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 939 DRLQAEREYGSCSRHYQQLLQSLEQGEQEESrcqrcISELKDIRL-QLEACETRTVHRLRlpldkepaRECAQRITEQQK 1017
Cdd:pfam15921 364 ERDQFSQESGNLDDQLQKLLADLHKREKELS-----LEKEQNKRLwDRDTGNSITIDHLR--------RELDDRNMEVQR 430
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1018 AQAEVDGL-----GKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEE 1092
Cdd:pfam15921 431 LEALLKAMksecqGQMERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKER 510
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1093 VlraheeqlkeaqavpatlpeLEATKAALKKLRAQAEAQQPVFDALRDE---LRGAQEVGERLQQRHGERDVEVERWRER 1169
Cdd:pfam15921 511 A--------------------IEATNAEITKLRSRVDLKLQELQHLKNEgdhLRNVQTECEALKLQMAEKDKVIEILRQQ 570
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1170 VTLLLE-------RWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLR--DAKQRQEQIQAVPLANSQAvrEQLRQE 1240
Cdd:pfam15921 571 IENMTQlvgqhgrTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIRelEARVSDLELEKVKLVNAGS--ERLRAV 648
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1241 KALLEDIERHGEKVEECQrfaKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQsgsesiiqeyvdLRTRYSELSTLTS 1320
Cdd:pfam15921 649 KDIKQERDQLLNEVKTSR---NELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQ------------LKSAQSELEQTRN 713
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1321 qyirfiseTLRRMEEEERLAeqqraeererlaeVEAALEKQRQLAEAHAQAKAqaereaqglqrrMQEEVARREEVAVEA 1400
Cdd:pfam15921 714 --------TLKSMEGSDGHA-------------MKVAMGMQKQITAKRGQIDA------------LQSKIQFLEEAMTNA 760
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRSIQEELQHLRQsseaeiqakarqveaaersrlrieeeirvvRLQLEATERQRggAEGELQALRA---RAEEAEAQ 1477
Cdd:pfam15921 761 NKEKHFLKEEKNKLSQ------------------------------ELSTVATEKNK--MAGELEVLRSqerRLKEKVAN 808
|
570 580 590
....*....|....*....|....*....|..
gi 1920237968 1478 KRQAQEEAERLRRQVQDETQRKRQAEAELALR 1509
Cdd:pfam15921 809 MEVALDKASLQFAECQDIIQRQEQESVRLKLQ 840
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1624-2042 |
2.61e-05 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 50.29 E-value: 2.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1624 QLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQE 1703
Cdd:COG5278 105 QQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLALALAALLLAAAALLLLLLALAA 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1704 LIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAG 1783
Cdd:COG5278 185 LLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALALALLLAALLLALLAALALAALLA 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1784 RFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEA 1863
Cdd:COG5278 265 AALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAAAALAALLALALATALAAAAAAL 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1864 FQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGT 1943
Cdd:COG5278 345 ALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAAAAEAAAAAAAAAAASAAEALEL 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1944 AEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2023
Cdd:COG5278 425 AEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAAALAEAEAAAALAAAAALSLALA 504
|
410
....*....|....*....
gi 1920237968 2024 QLQLAQEAAQKRLQAEEKA 2042
Cdd:COG5278 505 LAALLLAAAEAALAAALAA 523
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
2146-2448 |
2.66e-05 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 50.07 E-value: 2.66e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHkqfAEQALRQKAQVEQELTALRLQLEETDHQksildeeLQRLKAEVTEAARQRGQVEEELFSLRVQ 2225
Cdd:pfam19220 113 LRDKTAQAEALERQ---LAAETEQNRALEEENKALREEAQAAEKA-------LQRAEGELATARERLALLEQENRRLQAL 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2226 MEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEkmkqvaeeaarlsVAAQEAARLRQLAEEDLAQQRALAEKM-LKEK 2304
Cdd:pfam19220 183 SEEQAAELAELTRRLAELETQLDATRARLRALEGQ-------------LAAEQAERERAEAQLEEAVEAHRAERAsLRMK 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2305 MQAVQeaTRLKAEAELLQQQKELAQEQARRLQEdKEQMAQQLAQETQGFQKTLEterqrqlEMSAEAERLRLRVAEMSRA 2384
Cdd:pfam19220 250 LEALT--ARAAATEQLLAEARNQLRDRDEAIRA-AERRLKEASIERDTLERRLA-------GLEADLERRTQQFQEMQRA 319
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 2385 QARAEEDARRFRKQAEDIGERLYRTE---LATQEKV-MLVQTLETQRQQSDRDAERLREaiaELEHEK 2448
Cdd:pfam19220 320 RAELEERAEMLTKALAAKDAALERAEeriASLSDRIaELTKRFEVERAALEQANRRLKE---ELQRER 384
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
1258-1589 |
2.68e-05 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 50.40 E-value: 2.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1258 QRFAKQYINAIKDYelqlvtYKAQLEPVASPAKKPKvQSGSESIIQEYVDLRTRY-SELSTLTSQyirfiSETLRRMEEE 1336
Cdd:NF033838 53 NESQKEHAKEVESH------LEKILSEIQKSLDKRK-HTQNVALNKKLSDIKTEYlYELNVLKEK-----SEAELTSKTK 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1337 ERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ----RRMQEEVArreEVAVEAQEQKRSIQEELQ 1412
Cdd:NF033838 121 KELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNYPtntyKTLELEIA---ESDVEVKKAELELVKEEA 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1413 HLRQSSEAEIQAKAR-QVEAAERSRLrieEEIRVVRLQLEATERQRGGAE-GELQALRARAEEAEAQKRQA--------- 1481
Cdd:NF033838 198 KEPRDEEKIKQAKAKvESKKAEATRL---EKIKTDREKAEEEAKRRADAKlKEAVEKNVATSEQDKPKRRAkrgvlgepa 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1482 -----QEEAERLRRQVQDET-------QRKRQAEAE-LALRVQAEAEAAREKQR---ALQALEELRLQAEEAERRLRQAE 1545
Cdd:NF033838 275 tpdkkENDAKSSDSSVGEETlpspslkPEKKVAEAEkKVEEAKKKAKDQKEEDRrnyPTNTYKTLELEIAESDVKVKEAE 354
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1920237968 1546 A----ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEH 1589
Cdd:NF033838 355 LelvkEEAKEPRNEEKIKQAKAKVESKKAEATRLEKIKTDRKKAEEEA 402
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1399-1536 |
2.80e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 48.77 E-value: 2.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1399 EAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGA---------EGELQA 1466
Cdd:COG1579 21 RLEHRLKELPAELAELEDelaALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVrnnkeyealQKEIES 100
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1467 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEE 1536
Cdd:COG1579 101 LKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAA 170
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1392-1745 |
3.06e-05 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 49.90 E-value: 3.06e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1392 RREEVAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARA 1471
Cdd:COG4372 21 KTGILIAALSEQLRKALFELDKLQE----ELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAEL 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1472 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQ 1551
Cdd:COG4372 97 AQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1552 VQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEAL 1631
Cdd:COG4372 177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVI 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1632 RLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAET 1711
Cdd:COG4372 257 LKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILL 336
|
330 340 350
....*....|....*....|....*....|....
gi 1920237968 1712 EQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1745
Cdd:COG4372 337 AELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
2162-2393 |
3.23e-05 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 49.69 E-value: 3.23e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2162 FAEQALRQKAQ---VEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQrgqveeelfsLRVQMEELGKLKARIEA 2238
Cdd:PRK11637 38 FSAHASDNRDQlksIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRK----------LRETQNTLNQLNKQIDE 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2239 ENRalvlrdkdSAQRLLQEEAEKMKqvaeeaarlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQA----VQEAtRL 2314
Cdd:PRK11637 108 LNA--------SIAKLEQQQAAQER---------LLAAQLDAAFRQGEHTGLQLILSGEESQRGERILAyfgyLNQA-RQ 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2315 KAEAELLQQQKELAQEqaRRLQEDKEQMAQQLAQETQGFQKTLETER-----------------QRQL-EMSAEAERLRL 2376
Cdd:PRK11637 170 ETIAELKQTREELAAQ--KAELEEKQSQQKTLLYEQQAQQQKLEQARnerkktltglesslqkdQQQLsELRANESRLRD 247
|
250
....*....|....*...
gi 1920237968 2377 RVAEMSR-AQARAEEDAR 2393
Cdd:PRK11637 248 SIARAEReAKARAEREAR 265
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
851-1501 |
3.29e-05 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 50.35 E-value: 3.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 851 NQEAQEAIARlEAQHQALVALWhQLHTEMKSLLAWQSLGRDMQlirsWSLATFRTLKPEE------------QRQALRsL 918
Cdd:TIGR00618 223 VLEKELKHLR-EALQQTQQSHA-YLTQKREAQEEQLKKQQLLK----QLRARIEELRAQEavleetqerinrARKAAP-L 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 919 ELHYQAFLRDSQDAGGFGPEDRLQaEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRL 998
Cdd:TIGR00618 296 AAHIKAVTQIEQQAQRIHTELQSK-MRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQ 374
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 999 PLDKEPARECAQRIT----EQQKAQAEVDGLGKGVARLSAEAEKVLALPEPSPAAptlRSELELTLGKLEQVRSLSAIYL 1074
Cdd:TIGR00618 375 HTLTQHIHTLQQQKTtltqKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHA---KKQQELQQRYAELCAAAITCTA 451
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1075 EKLKtisLVIRSTQEAEEVLRAHEEQLKEAQAVpaTLPELEATKAALKKLRAQAEAQQPVFDALR---------DELRGA 1145
Cdd:TIGR00618 452 QCEK---LEKIHLQESAQSLKEREQQLQTKEQI--HLQETRKKAVVLARLLELQEEPCPLCGSCIhpnparqdiDNPGPL 526
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1146 QEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLgawlrdakqRQEQIQAV 1225
Cdd:TIGR00618 527 TRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNL---------QNITVRLQ 597
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1226 PLansqaVREQLRQEKALLEDIERHGEKVEECQRFAK--QYINAIKDYELQLVTYKAQLEpvASPAKKPKVQSGSESIIQ 1303
Cdd:TIGR00618 598 DL-----TEKLSEAEDMLACEQHALLRKLQPEQDLQDvrLHLQQCSQELALKLTALHALQ--LTLTQERVREHALSIRVL 670
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1304 EYVDLRTRYSELSTLTSQY--IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQG 1381
Cdd:TIGR00618 671 PKELLASRQLALQKMQSEKeqLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMH 750
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1382 LQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAE 1461
Cdd:TIGR00618 751 QARTVLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEE 830
|
650 660 670 680
....*....|....*....|....*....|....*....|
gi 1920237968 1462 GELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQ 1501
Cdd:TIGR00618 831 EQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAK 870
|
|
| PspA |
COG1842 |
Phage shock protein A [Transcription, Signal transduction mechanisms]; |
1667-1889 |
3.36e-05 |
|
Phage shock protein A [Transcription, Signal transduction mechanisms];
Pssm-ID: 441447 [Multi-domain] Cd Length: 217 Bit Score: 48.28 E-value: 3.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQRELAEQELEK-QRQLAEgtaqqrlaAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRE-LEA 1744
Cdd:COG1842 16 ALLDKAEDPEKMLDQAIRDmEEDLVE--------ARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKGREdLAR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1745 ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1824
Cdd:COG1842 88 EALERKAELEAQAEALEAQLAQLEEQVEKLKEALRQLESKLEELKAKKDTLKARAKAAKAQEKVNEALSGIDSDDATSAL 167
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1825 AEklaaiseatrlkteAEIALKEKEAENERLRRLAEDEAFQRRLLEeqaAQHKADIEARLAQLRK 1889
Cdd:COG1842 168 ER--------------MEEKIEEMEARAEAAAELAAGDSLDDELAE---LEADSEVEDELAALKA 215
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1088-1587 |
3.38e-05 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 50.34 E-value: 3.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1088 QEAEEVLRAHEEQlkeaQAVPATLPELEATKAALKK-LRAQAEAQQpvfdaLRDELRGAQEVG-------ERLQQRHGER 1159
Cdd:PRK04863 496 DVARELLRRLREQ----RHLAEQLQQLRMRLSELEQrLRQQQRAER-----LLAEFCKRLGKNlddedelEQLQEELEAR 566
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1160 ----DVEVERWRERVTLLlerwQAVLAQTDVRQRELEQLGRQLRYYRESADPL----GAWLRDAKQRQEQIQ--AVPLAN 1229
Cdd:PRK04863 567 leslSESVSEARERRMAL----RQQLEQLQARIQRLAARAPAWLAAQDALARLreqsGEEFEDSQDVTEYMQqlLERERE 642
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1230 SQAVREQLRQEK-ALLEDIER----HGEKVEECQRFAKQ-------------------YINA-------------IKDYE 1272
Cdd:PRK04863 643 LTVERDELAARKqALDEEIERlsqpGGSEDPRLNALAERfggvllseiyddvsledapYFSAlygparhaivvpdLSDAA 722
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1273 LQLVT--------YKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSelstltsqyiRFISETL-RRMEEEERLAEQQ 1343
Cdd:PRK04863 723 EQLAGledcpedlYLIEGDPDSFDDSVFSVEELEKAVVVKIADRQWRYS----------RFPEVPLfGRAAREKRIEQLR 792
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1344 RaeERERLAEveaalekqrqlaeahaqAKAQAEREAQGLQRRMQE---------EVARREEVAVEAQEQKRSIQEELQHL 1414
Cdd:PRK04863 793 A--EREELAE-----------------RYATLSFDVQKLQRLHQAfsrfigshlAVAFEADPEAELRQLNRRRVELERAL 853
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1415 RQSSEAEIQAKARQVEAAERSRL--------------RIEEEIRVVRLQLEATE------RQRGGA----EGELQALRAR 1470
Cdd:PRK04863 854 ADHESQEQQQRSQLEQAKEGLSAlnrllprlnlladeTLADRVEEIREQLDEAEeakrfvQQHGNAlaqlEPIVSVLQSD 933
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1471 AEEAEAQKR---QAQEEAERLRRQVQDET---QRKRQAEAELALRVQAEAEAAREKQRalQALEELRLQAEEAERRLRQA 1544
Cdd:PRK04863 934 PEQFEQLKQdyqQAQQTQRDAKQQAFALTevvQRRAHFSYEDAAEMLAKNSDLNEKLR--QRLEQAEQERTRAREQLRQA 1011
|
570 580 590 600
....*....|....*....|....*....|....*....|...
gi 1920237968 1545 EAERARQVQValetaqrsaEAELQSEHASFAEKTAQLERTLKE 1587
Cdd:PRK04863 1012 QAQLAQYNQV---------LASLKSSYDAKRQMLQELKQELQD 1045
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
942-1380 |
3.60e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 50.15 E-value: 3.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 942 QAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACE-----TRTVHRLRLPLDKEPAR--ECAQRITE 1014
Cdd:COG4717 78 EELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLqllplYQELEALEAELAELPERleELEERLEE 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1015 QQKAQAEVDGLGKGVARLSAEAEKVLALPEPSpaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVL 1094
Cdd:COG4717 158 LRELEEELEELEAELAELQEELEELLEQLSLA-----TEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1095 R------AHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRE 1168
Cdd:COG4717 233 EneleaaALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPA 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1169 RVTLLLERWQAVLAQTDVRQREleqlgrQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEkALLEDIE 1248
Cdd:COG4717 313 LEELEEEELEELLAALGLPPDL------SPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAE-AGVEDEE 385
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1249 RHGEKVEECQRFaKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELstltSQYIRFISE 1328
Cdd:COG4717 386 ELRAALEQAEEY-QELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEEL----REELAELEA 460
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1329 TLRRMEEEERLAE--QQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQ 1380
Cdd:COG4717 461 ELEQLEEDGELAEllQELEELKAELRELAEEWAALKLALELLEEAREEYREERL 514
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
1376-1570 |
3.81e-05 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 50.02 E-value: 3.81e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1376 EREAQGLQRRMQEEVarreEVAVEAQEQKRSIQEELQhlrqsseaEIQAKARqveaAERSRL--RIE-EEIRVVRLQLEA 1452
Cdd:PTZ00491 643 ERTRDSLQKSVQLAI----EITTKSQEAAARHQAELL--------EQEARGR----LERQKMhdKAKaEEQRTKLLELQA 706
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1453 TerqrgGAEGELQAlRARAE-EAEAQKRQAQEEAErlrrqVQDETQRKRqaeaelALRVQAEAEAAREKQRALQALEELR 1531
Cdd:PTZ00491 707 E-----SAAVESSG-QSRAEaLAEAEARLIEAEAE-----VEQAELRAK------ALRIEAEAELEKLRKRQELELEYEQ 769
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1920237968 1532 LQAE---EAERRLRQAEAERARQVQVAL--ETAQRSAEA--ELQSE 1570
Cdd:PTZ00491 770 AQNEleiAKAKELADIEATKFERIVEALgrETLIAIARAgpELQAK 815
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1326-1550 |
3.89e-05 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 49.87 E-value: 3.89e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1326 ISETLRRMEEEERLAEQQRAEERERLAEVEAAlEKQRQLAEAHAQAKAQAEREAQglqRRMQEEVARREEVavEAQEQKR 1405
Cdd:pfam02029 100 VAERKENNEEEENSSWEKEEKRDSRLGRYKEE-ETEIREKEYQENKWSTEVRQAE---EEGEEEEDKSEEA--EEVPTEN 173
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1406 SIQEELQHLRQSSEAEIQAKARQVEaaERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKrQAQEEA 1485
Cdd:pfam02029 174 FAKEEVKDEKIKKEKKVKYESKVFL--DQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFL-EAEQKL 250
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1486 ERLRRQVQD------ETQRKRQAEAELALrvqAEAEAAREKQRAL--------QALEELRLQAEEAERRLRQAEAERAR 1550
Cdd:pfam02029 251 EELRRRRQEkeseefEKLRQKQQEAELEL---EELKKKREERRKLleeeeqrrKQEEAERKLREEEEKRRMKEEIERRR 326
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1074-1542 |
4.29e-05 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 49.74 E-value: 4.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1074 LEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALkklraQAEAQQpVFDALRDELRGAQEVGERLQ 1153
Cdd:pfam05557 51 QELQKRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKLNEKESQ-----LADARE-VISCLKNELSELRRQIQRAE 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1154 QRHGERDVEVERWRERVTLLLERWQAV---LAQTDVRQREL---EQLGRQLRYYRESADPLGAWLRDAKQRQEQIqavpl 1227
Cdd:pfam05557 125 LELQSTNSELEELQERLDLLKAKASEAeqlRQNLEKQQSSLaeaEQRIKELEFEIQSQEQDSEIVKNSKSELARI----- 199
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1228 ANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKK------------PKVQ 1295
Cdd:pfam05557 200 PELEKELERLREHNKHLNENIENKLLLKEEVEDLKRKLEREEKYREEAATLELEKEKLEQELQSwvklaqdtglnlRSPE 279
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1296 SGSESIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEK-QRQL---------- 1364
Cdd:pfam05557 280 DLSRRIEQLQQREIVLKEENSSLTSS-ARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRlQRRVllltkerdgy 358
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1365 ------------AEAHAQAKAQAEREAQGLQRRMQ---EEVARREEVAVEA----QEQKRSIQEELQHLRQ--------S 1417
Cdd:pfam05557 359 railesydkeltMSNYSPQLLERIEEAEDMTQKMQahnEEMEAQLSVAEEElggyKQQAQTLERELQALRQqesladpsY 438
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1418 SEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQ--DE 1495
Cdd:pfam05557 439 SKEEVDSLRRKLETLELERQRLREQKNELEMELERRCLQGDYDPKKTKVLHLSMNPAAEAYQQRKNQLEKLQAEIErlKR 518
|
490 500 510 520
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1496 TQRKRQAEAELALRVQAEAEAAREKQralqaLEELRLQAEEAERRLR 1542
Cdd:pfam05557 519 LLKKLEDDLEQVLRLPETTSTMNFKE-----VLDLRKELESAELKNQ 560
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3339-3377 |
4.46e-05 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 43.09 E-value: 4.46e-05
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3339 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTATLL 3377
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3339-3374 |
4.63e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 43.24 E-value: 4.63e-05
10 20 30
....*....|....*....|....*....|....*.
gi 1920237968 3339 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTA 3374
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1816-2206 |
4.91e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 49.74 E-value: 4.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1816 QRAEAERVLAEKLAAIsEATRLKTEAEialkEKEAENERLRRLAEDEAFQRRLLEEQAAqhkadIEARLAQLRKASESEL 1895
Cdd:pfam17380 281 QKAVSERQQQEKFEKM-EQERLRQEKE----EKAREVERRRKLEEAEKARQAEMDRQAA-----IYAEQERMAMEREREL 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1896 ERQKglVEDTLRQRRQVEEEilalkgsfekaaagkaELELELGRIRGTaeDTLRSKEQAEQEAARQRqlaaeeerrrrea 1975
Cdd:pfam17380 351 ERIR--QEERKRELERIRQE----------------EIAMEISRMREL--ERLQMERQQKNERVRQE------------- 397
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1976 EERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLR-ERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKeqel 2054
Cdd:pfam17380 398 LEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREvRRLEEERAREMERVRLEEQERQQQVERLRQQEEERK---- 473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2055 qqtlqqeqsvleRLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQE 2134
Cdd:pfam17380 474 ------------RKKLELEKEKRDRKRAEEQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEE 541
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2135 aarraqaeqaalRQKQaadAEMEKHKQFAEQALRqkaqVEQELTALRLQLEETDHQKSILDEELQRLKAEVT 2206
Cdd:pfam17380 542 ------------RRKQ---QEMEERRRIQEQMRK----ATEERSRLEAMEREREMMRQIVESEKARAEYEAT 594
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1636-1812 |
4.97e-05 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 49.03 E-value: 4.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1636 QAEEVAQQKsltQAEAEKQKEEAEREArrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQ----RLAAEQELIRLRAET 1711
Cdd:PRK09510 88 QAEELQQKQ---AAEQERLKQLEKERL----AAQEQKKQAEEAAKQAALKQKQAEEAAAKAaaaaKAKAEAEAKRAAAAA 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1712 EQGEQQRQLLEEELArlQREAAAATQKRRELEA-----ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1786
Cdd:PRK09510 161 KKAAAEAKKKAEAEA--AKKAAAEAKKKAEAEAaakaaAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEA---- 234
|
170 180
....*....|....*....|....*.
gi 1920237968 1787 ELAEEAARLRALAEEAKRQRQLAEED 1812
Cdd:PRK09510 235 KAAAEKAAAAKAAEKAAAAKAAAEVD 260
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1506-1775 |
4.98e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.99 E-value: 4.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1506 LALRVQAEAEAAREKQRALQALEElRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTL 1585
Cdd:COG4942 11 LALAAAAQADAAAEAEAELEQLQQ-EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1586 KEEHVAVVQLREeatrraqqqaeaeRARAEAERELERWQLKANEALRLRLQAEEVAQqksltqaeaekqKEEAEREARRR 1665
Cdd:COG4942 90 KEIAELRAELEA-------------QKEELAELLRALYRLGRQPPLALLLSPEDFLD------------AVRRLQYLKYL 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1666 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1745
Cdd:COG4942 145 APARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224
|
250 260 270
....*....|....*....|....*....|
gi 1920237968 1746 LAKVRAEMEVLLASKARAEEESRSTSEKSK 1775
Cdd:COG4942 225 LEALIARLEAEAAAAAERTPAAGFAALKGK 254
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1340-1737 |
4.99e-05 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 49.27 E-value: 4.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1340 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEElqhlrqsse 1419
Cdd:COG3064 4 ALEEKAAEAAAQERLEQAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAE--------- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1420 aeiqaKARQVEAAERSRLRIEEEIRvvrlqlEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1499
Cdd:COG3064 75 -----AAKKLAEAEKAAAEAEKKAA------AEKAKAAKEAEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKA 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1500 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1579
Cdd:COG3064 144 AEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAA 223
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1580 QLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAE 1659
Cdd:COG3064 224 RAAAASREAALAAVEATEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAAL 303
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1660 REARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQ 1737
Cdd:COG3064 304 AAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLA 381
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4125-4153 |
5.21e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 42.85 E-value: 5.21e-05
10 20
....*....|....*....|....*....
gi 1920237968 4125 VRKRRVVIVDPETGKEMSVYEAYRKGLID 4153
Cdd:smart00250 6 AQSAIGGIIDPETGQKLSVEEALRRGLID 34
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
2146-2596 |
5.25e-05 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 49.84 E-value: 5.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFA--EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQ-RGQVEEELFSL 2222
Cdd:pfam12128 227 IRDIQAIAGIMKIRPEFTklQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTlDDQWKEKRDEL 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2223 R----VQMEELGKLKARIEA-ENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA---------RLRQLAEE 2288
Cdd:pfam12128 307 NgelsAADAAVAKDRSELEAlEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKhqdvtakynRRRSKIKE 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2289 DLA------QQRALAEKMLKEKMQAVQEATRLKAEAEL---LQQQKELAQEQARRLQEDKEQMAQQLAQETQGfQKTLET 2359
Cdd:pfam12128 387 QNNrdiagiKDKLAKIREARDRQLAVAEDDLQALESELreqLEAGKLEFNEEEYRLKSRLGELKLRLNQATAT-PELLLQ 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2360 ERQRQLEMSAEAERLRLRVAEMSRAQaraeEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQ-RQQSDRDAERLR 2438
Cdd:pfam12128 466 LENFDERIERAREEQEAANAEVERLQ----SELRQARKRRDQASEALRQASRRLEERQSALDELELQlFPQAGTLLHFLR 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2439 EAIAELEHEKDKLKQEAQL----LQLKSEEMQTVRQEQLLQETQALQQSflsEKDSLLQRERCIEQEKAKLEQLFQDEVA 2514
Cdd:pfam12128 542 KEAPDWEQSIGKVISPELLhrtdLDPEVWDGSVGGELNLYGVKLDLKRI---DVPEWAASEEELRERLDKAEEALQSARE 618
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2515 KAQALreeqqrqqqqmQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLA----QQQQQQEKLLAEENQRLRERLQHL 2590
Cdd:pfam12128 619 KQAAA-----------EEQLVQANGELEKASREETFARTALKNARLDLRRLFdekqSEKDKKNKALAERKDSANERLNSL 687
|
....*.
gi 1920237968 2591 EEERRA 2596
Cdd:pfam12128 688 EAQLKQ 693
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1676-2388 |
5.30e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 49.73 E-value: 5.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1676 RELAEQELEKQRQLAEGTAQQRLAAE---QELIRLRAETEQGEQQRQLLEEELARLQ-REAAAATQK-RRELEAELA--- 1747
Cdd:pfam15921 158 KCLKEDMLEDSNTQIEQLRKMMLSHEgvlQEIRSILVDFEEASGKKIYEHDSMSTMHfRSLGSAISKiLRELDTEISylk 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1748 ----KVRAEMEVLLA-SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAER 1822
Cdd:pfam15921 238 grifPVEDQLEALKSeSQNKIELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMR 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1823 VLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQL-----RKASESELER 1897
Cdd:pfam15921 318 QLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLladlhKREKELSLEK 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1898 QK-----------GLVEDTLRQR---RQVEEEIL-ALKGSFEKAAAGkaELELELGRIRGtaedtlrSKEQAEQEAARQR 1962
Cdd:pfam15921 398 EQnkrlwdrdtgnSITIDHLRRElddRNMEVQRLeALLKAMKSECQG--QMERQMAAIQG-------KNESLEKVSSLTA 468
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1963 QLAAEEERRRREAEERVQKSLAAE--EEAARQRKAALEEVER-LKAKVEEARRLRERAEQesarQLQLAQEaaqkrLQAE 2039
Cdd:pfam15921 469 QLESTKEMLRKVVEELTAKKMTLEssERTVSDLTASLQEKERaIEATNAEITKLRSRVDL----KLQELQH-----LKNE 539
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2040 EKaHAFAVQQKEQELQQTLQQEQSVLERLRseaeaarraaeeaeaareraereaaqsrRQVEEAERLkqsaeeqaqaqaq 2119
Cdd:pfam15921 540 GD-HLRNVQTECEALKLQMAEKDKVIEILR----------------------------QQIENMTQL------------- 577
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2120 aqaaaeklrkeaeqeaarraqaeqaalrqkqaadaeMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQ 2199
Cdd:pfam15921 578 ------------------------------------VGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIR 621
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2200 RLKAEVTEAARQRGQV----EEELFSLRVQMEELGKLKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQVAEE-AARL 2272
Cdd:pfam15921 622 ELEARVSDLELEKVKLvnagSERLRAVKDIKQERDQLLNEVKTSRNELnsLSEDYEVLKRNFRNKSEEMETTTNKlKMQL 701
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2273 SVAAQEAARLR---QLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAqe 2349
Cdd:pfam15921 702 KSAQSELEQTRntlKSMEGSDGHAMKVAMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELS-- 779
|
730 740 750 760
....*....|....*....|....*....|....*....|...
gi 1920237968 2350 tqgfqkTLETERQR---QLE-MSAEAERLRLRVAEMSRAQARA 2388
Cdd:pfam15921 780 ------TVATEKNKmagELEvLRSQERRLKEKVANMEVALDKA 816
|
|
| PspA |
COG1842 |
Phage shock protein A [Transcription, Signal transduction mechanisms]; |
1326-1531 |
5.38e-05 |
|
Phage shock protein A [Transcription, Signal transduction mechanisms];
Pssm-ID: 441447 [Multi-domain] Cd Length: 217 Bit Score: 47.51 E-value: 5.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1326 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVAR-REEVAVEAQEQK 1404
Cdd:COG1842 14 INALLDKAEDPEKMLDQAIRDMEEDLVEARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKgREDLAREALERK 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1405 RSIQEELQHLRQsseaeiqakarQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegelQALRARAEEAEAQKR----- 1479
Cdd:COG1842 94 AELEAQAEALEA-----------QLAQLEEQVEKLKEALRQLESKLEELKAKK-------DTLKARAKAAKAQEKvneal 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1480 ------QAQEEAERLRRQVqDETQRKRQAEAELALR--VQAEAEAAREKQRALQALEELR 1531
Cdd:COG1842 156 sgidsdDATSALERMEEKI-EEMEARAEAAAELAAGdsLDDELAELEADSEVEDELAALK 214
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1417-1630 |
5.80e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 49.06 E-value: 5.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1417 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQD-- 1494
Cdd:COG3883 13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGEra 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1495 -ETQRKRQAEAELAL------------RVQAEAEAAREKQRALQALEELRLQAEEAerrlrQAEAERARQVQVALETAQR 1561
Cdd:COG3883 93 rALYRSGGSVSYLDVllgsesfsdfldRLSALSKIADADADLLEELKADKAELEAK-----KAELEAKLAELEALKAELE 167
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1562 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEA 1630
Cdd:COG3883 168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAA 236
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1297-1599 |
5.90e-05 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 48.91 E-value: 5.90e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1297 GSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAaLEKQ--------RQLAEAH 1368
Cdd:pfam19220 63 AYGKLRRELAGLTRRLSAAEGELEELVARLAKLEAALREAEAAKEELRIELRDKTAQAEA-LERQlaaeteqnRALEEEN 141
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1369 AQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRsiqeeLQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRL 1448
Cdd:pfam19220 142 KALREEAQAAEKALQRAEGELATARERLALLEQENRR-----LQALSEEQAAELAELTRRLAELETQLDATRARLRALEG 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1449 QL--EATERQRGGAEGELQALRARAEEAEaqkrqaqeeaerLRRQVQDETQRKRQAE---AELALRVQAEAEAAREKQRa 1523
Cdd:pfam19220 217 QLaaEQAERERAEAQLEEAVEAHRAERAS------------LRMKLEALTARAAATEqllAEARNQLRDRDEAIRAAER- 283
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1524 lqALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSE--HASFAEKTAQLERTlkEEHVAVVQLREEA 1599
Cdd:pfam19220 284 --RLKEASIERDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEmlTKALAAKDAALERA--EERIASLSDRIAE 357
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
1393-1553 |
5.94e-05 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 49.36 E-value: 5.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1393 REEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAE 1472
Cdd:pfam07111 490 RNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELT 569
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1473 EAEAQKRQAQEEA-----ERLRRQVQDETQ-----RKRQAEAELALRvQAEAEAAREKQRAlqalEELRLQAEEAerrlR 1542
Cdd:pfam07111 570 QQQEIYGQALQEKvaeveTRLREQLSDTKRrlneaRREQAKAVVSLR-QIQHRATQEKERN----QELRRLQDEA----R 640
|
170
....*....|..
gi 1920237968 1543 QAEAER-ARQVQ 1553
Cdd:pfam07111 641 KEEGQRlARRVQ 652
|
|
| CH_jitterbug-like_rpt3 |
cd21185 |
third calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ... |
186-263 |
6.07e-05 |
|
third calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409034 Cd Length: 98 Bit Score: 44.60 E-value: 6.07e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 186 DNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDPEDVDVPQPDEKSIITYVSSL 263
Cdd:cd21185 20 NNFTTDWNDGRLLCGLVNALGGSVPGWPNLDPEESENNIQRGLEAGKS-LGVEPVLTAEEMADPEVEHLGIMAYAAQL 96
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1690-2186 |
6.19e-05 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 49.47 E-value: 6.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1690 AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV----------RAEMEVLLAS 1759
Cdd:COG3899 737 PDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYAnlgllllgdyEEAYEFGELA 816
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1760 KARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKT 1839
Cdd:COG3899 817 LALAERLGDRRLEARALFNLGFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAARL 896
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1840 EAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILAL 1919
Cdd:COG3899 897 LAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAAA 976
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1920 KGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 1999
Cdd:COG3899 977 AAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAA 1056
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2000 VERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAA 2079
Cdd:COG3899 1057 AAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALAALALAAAARAAAALL 1136
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2080 EEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKH 2159
Cdd:COG3899 1137 LLAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLA 1216
|
490 500
....*....|....*....|....*..
gi 1920237968 2160 KQFAEQALRQKAQVEQELTALRLQLEE 2186
Cdd:COG3899 1217 LEAAALLLLLLLAALALAAALLALRLL 1243
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1813-2024 |
6.40e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.61 E-value: 6.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1813 AVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIeARLAQLRKASE 1892
Cdd:COG4942 18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAEL-AELEKEIAELR 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1893 SELERQKGLVEDTL----RQRRQVEEEILALKGSFEKAAAGKA-------ELELELGRIRGTAEDTLRSKEQAEQEAARQ 1961
Cdd:COG4942 97 AELEAQKEELAELLralyRLGRQPPLALLLSPEDFLDAVRRLQylkylapARREQAEELRADLAELAALRAELEAERAEL 176
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 1962 RQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQ 2024
Cdd:COG4942 177 EALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
2258-2393 |
6.59e-05 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 48.72 E-value: 6.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2258 EAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEAtrlKAEAELLQQQKELAQEQARRLQE 2337
Cdd:COG2268 196 EIIRDARIAEAEAERETEIAIAQANREAEEAELEQEREIETARIAEAEAELAKK---KAEERREAETARAEAEAAYEIAE 272
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 2338 DKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDAR 2393
Cdd:COG2268 273 ANAEREVQRQLEIAEREREIELQEKEAEREEAELEADVRKPAEAEKQAAEAEAEAE 328
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3120-3156 |
6.80e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 42.85 E-value: 6.80e-05
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 3120 LRLLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKET 3156
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| CH_NAV2 |
cd21285 |
calponin homology (CH) domain found in neuron navigator 2; Neuron navigator 2 (NAV2), also ... |
41-145 |
7.32e-05 |
|
calponin homology (CH) domain found in neuron navigator 2; Neuron navigator 2 (NAV2), also called helicase APC down-regulated 1 (HELAD1), pore membrane and/or filament-interacting-like protein 2 (POMFIL2), retinoic acid inducible in neuroblastoma 1 (RAINB1), Steerin-2 (STEERIN2), or Unc-53 homolog 2 (unc53H2), possesses 3' to 5' helicase activity and exonuclease activity. It is involved in neuronal development, specifically in the development of different sensory organs. NAV2 contains a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409134 Cd Length: 121 Bit Score: 45.34 E-value: 7.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 41 DRVQKKTFTKWVNKHLIKhwrAEAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKG--RMRFHKLQNVQIALDYLRHR 118
Cdd:cd21285 8 NGFDKQIYTDWANHYLAK---SGHKRLIKDLQQDVTDGVLLAEIIQVVANEKIEDINGcpKNRSQMIENIDACLSFLAAK 84
|
90 100
....*....|....*....|....*..
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTI 145
Cdd:cd21285 85 GINIQGLSAEEIRNGNLKAILGLFFSL 111
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2165-2392 |
7.33e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 48.67 E-value: 7.33e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2165 QALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKL--KARIEAENRA 2242
Cdd:COG3883 13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEieERREELGERA 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2243 LVLRDKDSAQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAEL 2320
Cdd:COG3883 93 RALYRSGGSVSYLDVllGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAE 172
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2321 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDA 2392
Cdd:COG3883 173 LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 244
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1811-2289 |
7.41e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 49.00 E-value: 7.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1811 EDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLleEQAAQHKADIEARLAQLRKA 1890
Cdd:COG4717 77 EEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQEL--EALEAELAELPERLEELEER 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1891 seselerqkglvedtLRQRRQVEEEILALKgsfEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEER 1970
Cdd:COG4717 155 ---------------LEELRELEEELEELE---AELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELE 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1971 RRREAEervqKSLAAEEEAARQRKAALEEVERLKakveearRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQK 2050
Cdd:COG4717 217 EAQEEL----EELEEELEQLENELEAAALEERLK-------EARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGL 285
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2051 EQELQQTLQQEQSVLERLRseaeaarraaeeaEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKE 2130
Cdd:COG4717 286 LALLFLLLAREKASLGKEA-------------EELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQEL 352
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2131 AEQEAARRAQAEQAALRQKQAADaeMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQ-KSILDEELQRLKAEVTEAA 2209
Cdd:COG4717 353 LREAEELEEELQLEELEQEIAAL--LAEAGVEDEEELRAALEQAEEYQELKEELEELEEQlEELLGELEELLEALDEEEL 430
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2210 RQR-GQVEEELFSLRVQMEELGKLKARIEAENRAlvLRDKDSAQRLLQEEAE---KMKQVAEEAARLSVAAQEAARLRQL 2285
Cdd:COG4717 431 EEElEELEEELEELEEELEELREELAELEAELEQ--LEEDGELAELLQELEElkaELRELAEEWAALKLALELLEEAREE 508
|
....
gi 1920237968 2286 AEED 2289
Cdd:COG4717 509 YREE 512
|
|
| Golgin_A5 |
pfam09787 |
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ... |
1461-1595 |
7.72e-05 |
|
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.
Pssm-ID: 462900 [Multi-domain] Cd Length: 305 Bit Score: 48.22 E-value: 7.72e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1461 EGELQALRARAEEAEAQkrqAQEEAERLRRQVQDetqRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERR 1540
Cdd:pfam09787 67 RGQIQQLRTELQELEAQ---QQEEAESSREQLQE---LEEQLATERSARREAEAELERLQEELRYLEEELRRSKATLQSR 140
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1541 LRQAEAERARQ-VQVALETAQRSAEAELQSE-HA---SFAEKTAQLERTLKEEHVAVVQL 1595
Cdd:pfam09787 141 IKDREAEIEKLrNQLTSKSQSSSSQSELENRlHQlteTLIQKQTMLEALSTEKNSLVLQL 200
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1988-2332 |
7.96e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 48.97 E-value: 7.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1988 EAARQRKAALEEVERLKAKVEEARRLRER----AEQESARQLQLAQEAA----QKRLQAEEKAHAFAVQQKEQELQqtlq 2059
Cdd:pfam17380 286 ERQQQEKFEKMEQERLRQEKEEKAREVERrrklEEAEKARQAEMDRQAAiyaeQERMAMERERELERIRQEERKRE---- 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2060 qeqsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERlKQSAEEQAQAQAQAQAAAEKLRKEAEQEAArra 2139
Cdd:pfam17380 362 -----LERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAAR-KVKILEEERQRKIQQQKVEMEQIRAEQEEA--- 432
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2140 qaeqaalRQKQAADAEMEKhkqfaeqalrqkaqvEQELTALRLQLEETDHQKSILDEElqrlkaevtEAARQRGQVEeel 2219
Cdd:pfam17380 433 -------RQREVRRLEEER---------------AREMERVRLEEQERQQQVERLRQQ---------EEERKRKKLE--- 478
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2220 fslrvqMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEK 2299
Cdd:pfam17380 479 ------LEKEKRDRKRAEEQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERR 552
|
330 340 350
....*....|....*....|....*....|....*..
gi 1920237968 2300 MLKEKMQ-AVQEATRLKA---EAELLQQQKELAQEQA 2332
Cdd:pfam17380 553 RIQEQMRkATEERSRLEAmerEREMMRQIVESEKARA 589
|
|
| CH_PLS1_rpt1 |
cd21323 |
first calponin homology (CH) domain found in plastin-1; Plastin-1, also called ... |
44-156 |
8.85e-05 |
|
first calponin homology (CH) domain found in plastin-1; Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. It contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409172 Cd Length: 145 Bit Score: 45.80 E-value: 8.85e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 44 QKKTFTKWVNKHL-----IKHWRAEAQRHISdLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIALDY 114
Cdd:cd21323 25 EKVAFVNWINKALegdpdCKHVVPMNPTDES-LFKSLADGILLCKMINLSQPDTIDErainKKKLTPFTISENLNLALNS 103
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1920237968 115 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQV 156
Cdd:cd21323 104 ASAIGCTVVNIGSLDLKEGKPHLVLGLLWQIIKVGLFADIEI 145
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1670-2380 |
1.05e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 48.56 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQlAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR-----LQREAAAATQKRRELEA 1744
Cdd:pfam05483 98 EAELKQKENKLQENRKIIE-AQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNATRhlcnlLKETCARSAEKTKKYEY 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1745 ELAKVR---AEMEVLLASKARAEEESRSTSEKSkqRLEAEAgrfrELAEEAARLRALAEEAKRQRQlaeedavrqraEAE 1821
Cdd:pfam05483 177 EREETRqvyMDLNNNIEKMILAFEELRVQAENA--RLEMHF----KLKEDHEKIQHLEEEYKKEIN-----------DKE 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1822 RVLAEKLAAISEATRLKTEAEIALKEKEaenERLRRLAEDEAFQRRLLEeQAAQHKADIEARLAQLRKASESELERQKGL 1901
Cdd:pfam05483 240 KQVSLLLIQITEKENKMKDLTFLLEESR---DKANQLEEKTKLQDENLK-ELIEKKDHLTKELEDIKMSLQRSMSTQKAL 315
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1902 VED---TLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEerrrreaeer 1978
Cdd:pfam05483 316 EEDlqiATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITME---------- 385
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1979 VQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQ--ESARQLQLAQEAAQKRLQAEEK-AHAFAVQqkeqeLQ 2055
Cdd:pfam05483 386 LQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQfeKIAEELKGKEQELIFLLQAREKeIHDLEIQ-----LT 460
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2056 QTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLkqsaeeqaqaqaqaqaaaeklrkeaeqea 2135
Cdd:pfam05483 461 AIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDM----------------------------- 511
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2136 arraqaeQAALRQKQAadaEMEKHKQFAEQALRQKAQVEQELTALRLQLE--------ETDHQKSILD---EELQRLKAE 2204
Cdd:pfam05483 512 -------TLELKKHQE---DIINCKKQEERMLKQIENLEEKEMNLRDELEsvreefiqKGDEVKCKLDkseENARSIEYE 581
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2205 VTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEeaarLSVAAQEAARLRQ 2284
Cdd:pfam05483 582 VLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELE----LASAKQKFEEIID 657
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2285 LAEEDLAQQRALAEKMLKEKMQA---VQEATRLKAEAELLQQQK--------ELAQEQARRLQEDKEQ---MAQQLAQET 2350
Cdd:pfam05483 658 NYQKEIEDKKISEEKLLEEVEKAkaiADEAVKLQKEIDKRCQHKiaemvalmEKHKHQYDKIIEERDSelgLYKNKEQEQ 737
|
730 740 750 760
....*....|....*....|....*....|....*....|.
gi 1920237968 2351 QGFQKTLETER----------QRQLEMS-AEAERLRLRVAE 2380
Cdd:pfam05483 738 SSAKAALEIELsnikaellslKKQLEIEkEEKEKLKMEAKE 778
|
|
| PksD |
COG3321 |
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ... |
1863-2388 |
1.07e-04 |
|
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];
Pssm-ID: 442550 [Multi-domain] Cd Length: 1386 Bit Score: 48.72 E-value: 1.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1863 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEkAAAGKAELELELGRIRG 1942
Cdd:COG3321 867 PFQREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAA-LLALVALAAAAAALLAL 945
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1943 TAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2022
Cdd:COG3321 946 AAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAAAALLALA 1025
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2023 RQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE 2102
Cdd:COG3321 1026 ALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALALAALAAA 1105
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2103 AERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRL 2182
Cdd:COG3321 1106 LLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALALALAAAL 1185
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2183 QLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEElgklkARIEAENRALVLRDKDSAQRLLQEEAEKM 2262
Cdd:COG3321 1186 AAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAA-----ALALLALAAAAAAVAALAAAAAALLAALA 1260
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2263 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2342
Cdd:COG3321 1261 ALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAAVAAA 1340
|
490 500 510 520
....*....|....*....|....*....|....*....|....*.
gi 1920237968 2343 AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARA 2388
Cdd:COG3321 1341 LALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALAAAVA 1386
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1500-1784 |
1.07e-04 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 48.41 E-value: 1.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1500 RQAEAELALRVQAEAEAAREKQR--ALQAleelRLQAEEAER--RLRQAEAERARQVQVALETAQRSAEAELQSEHASFA 1575
Cdd:PRK05035 432 RQAKAEIRAIEQEKKKAEEAKARfeARQA----RLEREKAAReaRHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIV 507
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1576 EKTAQlertlKEEHVAVVQLREEATRRAQQQAEAERARAEAERelerwQLKANEALRLRLQAEEVAQQKSLTqaeAEKQK 1655
Cdd:PRK05035 508 IKAGA-----RPDNSAVIAAREARKAQARARQAEKQAAAAADP-----KKAAVAAAIARAKAKKAAQQAANA---EAEEE 574
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1656 EEAEREARRRGKAEEQAvrqRELAEQELEKQRQLAEGTAQQRLAAEQELIrLRAETEQGEQQRQLLEEElarlqreaaAA 1735
Cdd:PRK05035 575 VDPKKAAVAAAIARAKA---KKAAQQAASAEPEEQVAEVDPKKAAVAAAI-ARAKAKKAEQQANAEPEE---------PV 641
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1920237968 1736 TQKRRELEAELAKVRAEmevlLASKARAEEESRSTSEKSKQRLEAEAGR 1784
Cdd:PRK05035 642 DPRKAAVAAAIARAKAR----KAAQQQANAEPEEAEDPKKAAVAAAIAR 686
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1146-1551 |
1.08e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 48.89 E-value: 1.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1146 QEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAV 1225
Cdd:TIGR00606 694 QEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETL 773
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1226 planSQAVREQLRQEKALLED---IERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSE--- 1299
Cdd:TIGR00606 774 ----LGTIMPEEESAKVCLTDvtiMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIElnr 849
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1300 SIIQEYVD-LRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQR---------AEERERLAEVEAALEKQRQLAEAHA 1369
Cdd:TIGR00606 850 KLIQDQQEqIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTevqslireiKDAKEQDSPLETFLEKDQQEKEELI 929
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1370 QAKAQAEREAQGLQRRMQEEVarrEEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ 1449
Cdd:TIGR00606 930 SSKETSNKKAQDKVNDIKEKV---KNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQD 1006
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1450 LEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALrvqaeaeAAREKQRALQALEE 1529
Cdd:TIGR00606 1007 IDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDL-------IKRNHVLALGRQKG 1079
|
410 420
....*....|....*....|..
gi 1920237968 1530 LRLQAEEAERRLRQAEAERARQ 1551
Cdd:TIGR00606 1080 YEKEIKHFKKELREPQFRDAEE 1101
|
|
| CH_PLS3_rpt1 |
cd21325 |
first calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is ... |
44-157 |
1.15e-04 |
|
first calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Plastin- 3 contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409174 Cd Length: 148 Bit Score: 45.43 E-value: 1.15e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 44 QKKTFTKWVNKHLIKHWRAeaqRHI-------SDLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIAL 112
Cdd:cd21325 25 EKYAFVNWINKALENDPDC---RHVipmnpntDDLFKAVGDGIVLCKMINLSVPDTIDErainKKKLTPFIIQENLNLAL 101
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1920237968 113 DYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVS 157
Cdd:cd21325 102 NSASAIGCHVVNIGAEDLRAGKPHLVLGLLWQIIKIGLFADIELS 146
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1472-1551 |
1.18e-04 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 48.79 E-value: 1.18e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1472 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE------AEAAREKQRALQA-LEELRLQAEEAERRLRQA 1544
Cdd:PRK11448 138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELvaleglAAELEEKQQELEAqLEQLQEKAAETSQERKQK 217
|
....*..
gi 1920237968 1545 EAERARQ 1551
Cdd:PRK11448 218 RKEITDQ 224
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1474-1599 |
1.20e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 47.88 E-value: 1.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1474 AEAQKRQAQEEAERLRRQVQDETQRKRQAEaELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLR----QAEAERA 1549
Cdd:PRK09510 61 VEQYNRQQQQQKSAKRAEEQRKKKEQQQAE-ELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAlkqkQAEEAAA 139
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1550 RQVQVA----------LETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1599
Cdd:PRK09510 140 KAAAAAkakaeaeakrAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEA 199
|
|
| PRK07735 |
PRK07735 |
NADH-quinone oxidoreductase subunit C; |
1336-1565 |
1.23e-04 |
|
NADH-quinone oxidoreductase subunit C;
Pssm-ID: 236081 [Multi-domain] Cd Length: 430 Bit Score: 48.05 E-value: 1.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1336 EERLAEQQRAEERERLAEVEAAlEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELqhlr 1415
Cdd:PRK07735 11 KKEAARRAKEEARKRLVAKHGA-EISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEV---- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1416 qsSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDE 1495
Cdd:PRK07735 86 --TEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1496 TQRKRQAEAELALrvqaeaEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEA 1565
Cdd:PRK07735 164 KAKAKAAAAAKAK------AAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGN 227
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1987-2600 |
1.23e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 48.37 E-value: 1.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1987 EEAARQRKAALEEVERLKAKVEEARRlreraeqeSARQLQLAQEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqSVLE 2066
Cdd:COG4913 224 FEAADALVEHFDDLERAHEALEDARE--------QIELLEPIRELAERYAAARERL--------------------AELE 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2067 RLRSEAEAARRAAEEAEAARERAEREAAQSRRQvEEAERLKQsaeeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAAL 2146
Cdd:COG4913 276 YLRAALRLWFAQRRLELLEAELEELRAELARLE-AELERLEA-----------------RLDALREELDELEAQIRGNGG 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2147 RQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETdhqKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQM 2226
Cdd:COG4913 338 DRLEQLEREIERLERELEERERRRARLEALLAALGLPLPAS---AEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2227 EELGKLKARIEAENRALVLRDKDSAQRLLQeeaekMKQVAEEAARLS-VAAQEAARLRQLAEEDLAQQRAlAEKML---- 2301
Cdd:COG4913 415 RDLRRELRELEAEIASLERRKSNIPARLLA-----LRDALAEALGLDeAELPFVGELIEVRPEEERWRGA-IERVLggfa 488
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2302 ------KEKMQAVQEAT-RLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETE--RQRQLEMSAEAE 2372
Cdd:COG4913 489 ltllvpPEHYAAALRWVnRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWLEAElgRRFDYVCVDSPE 568
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2373 RLRLRVAEMSRA-QARAeeDARRFRKQAEDIGERLYRTELATQEKvmlVQTLETQRQQSDRDAERLREAIAELEHEKDKL 2451
Cdd:COG4913 569 ELRRHPRAITRAgQVKG--NGTRHEKDDRRRIRSRYVLGFDNRAK---LAALEAELAELEEELAEAEERLEALEAELDAL 643
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2452 KQEAQLLQ-LKSEEMQTVRQEQLLQETQALQQsflsekdsllQRERcIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQM 2530
Cdd:COG4913 644 QERREALQrLAEYSWDEIDVASAEREIAELEA----------ELER-LDASSDDLAAL-EEQLEELEAELEELEEELDEL 711
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 2531 QQEKQQLAASMEEARRRQHEAEEGVRRqQEELQRLAQQQQQQEKLLAEENQRLRERL-QHLEEERRAALAR 2600
Cdd:COG4913 712 KGEIGRLEKELEQAEEELDELQDRLEA-AEDLARLELRALLEERFAAALGDAVERELrENLEERIDALRAR 781
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1670-1865 |
1.24e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 48.47 E-value: 1.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQLAEgTAQQRLAA---EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1746
Cdd:COG3206 171 EEARKALEFLEEQLPELRKELE-EAEAALEEfrqKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1747 AKVRAEMEVLLASKA--------------RAEEESRSTSEKSK-QRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1811
Cdd:COG3206 250 GSGPDALPELLQSPViqqlraqlaeleaeLAELSARYTPNHPDvIALRAQIAALRAQLQQEAQRILASLEAELEALQARE 329
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1812 DAVRQRAEAERVLAEKLAAIS-EATRLKTEAEIALKEKEAENERLRRLAEDEAFQ 1865
Cdd:COG3206 330 ASLQAQLAQLEARLAELPELEaELRRLEREVEVARELYESLLQRLEEARLAEALT 384
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
2169-2412 |
1.26e-04 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 47.53 E-value: 1.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2169 QKAQVEQELTALRLQleetdhQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDK 2248
Cdd:TIGR02794 44 DPGAVAQQANRIQQQ------KKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2249 DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARlrQLAEEDLAQQRALAEKMLKE-KMQAVQEA-----TRLKAEAELLQ 2322
Cdd:TIGR02794 118 QKQAEEAKAKQAAEAKAKAEAEAERKAKEEAAK--QAEEEAKAKAAAEAKKKAEEaKKKAEAEAkakaeAEAKAKAEEAK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2323 QQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQaedI 2402
Cdd:TIGR02794 196 AKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAGSEVDKYAAIIQQA---I 272
|
250
....*....|
gi 1920237968 2403 GERLYRTELA 2412
Cdd:TIGR02794 273 QQNLYDDPSF 282
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
2163-2552 |
1.36e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 48.23 E-value: 1.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2163 AEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVT--EAARQRGQVEEELFSLRVQMEELGKLKARIEAEN 2240
Cdd:COG4717 76 LEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEklEKLLQLLPLYQELEALEAELAELPERLEELEERL 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2241 RALVlRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKekmQAVQEATRLKAEAEL 2320
Cdd:COG4717 156 EELR-ELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELE---EAQEELEELEEELEQ 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2321 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLET----------------------ERQRQLEMSAEAERLRLRV 2378
Cdd:COG4717 232 LENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSliltiagvlflvlgllallfllLAREKASLGKEAEELQALP 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2379 AEMSRAQARAEEDARRFRKQAEDIGER---LYRTELATQEKVMLVQTLETQRQQSDRDAER---LREAIAELEHEKDKLK 2452
Cdd:COG4717 312 ALEELEEEELEELLAALGLPPDLSPEElleLLDRIEELQELLREAEELEEELQLEELEQEIaalLAEAGVEDEEELRAAL 391
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2453 QEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQ 2532
Cdd:COG4717 392 EQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGEL 471
|
410 420
....*....|....*....|
gi 1920237968 2533 EKQQLAASMEEARRRQHEAE 2552
Cdd:COG4717 472 AELLQELEELKAELRELAEE 491
|
|
| KpsE |
COG3524 |
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis]; |
2162-2328 |
1.37e-04 |
|
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442746 [Multi-domain] Cd Length: 370 Bit Score: 47.54 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2162 FAEQALrqkAQVEQELTALRLQLEETDHQKSILDEElqrlkAEVTEAARQRGQVEEELFSLRVQMEELgklkarieaenR 2241
Cdd:COG3524 181 FAEEEV---ERAEERLRDAREALLAFRNRNGILDPE-----ATAEALLQLIATLEGQLAELEAELAAL-----------R 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2242 AlVLRDKDSAQRLLQEEAEKM-KQVAEEAARLSVAAQEAARLRQLAE-EDLAQQRALAEKMLKEKMQAVQEAtrlKAEAE 2319
Cdd:COG3524 242 S-YLSPNSPQVRQLRRRIAALeKQIAAERARLTGASGGDSLASLLAEyERLELEREFAEKAYTSALAALEQA---RIEAA 317
|
....*....
gi 1920237968 2320 llQQQKELA 2328
Cdd:COG3524 318 --RQQRYLA 324
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1299-1510 |
1.41e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 48.09 E-value: 1.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1299 ESIIQEYVDLRTRYSelSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERL------AEVEAALEKQRQLAEAHAQAK 1372
Cdd:COG3206 155 NALAEAYLEQNLELR--REEARKALEFLEEQLPELRKELEEAEAALEEFRQKNglvdlsEEAKLLLQQLSELESQLAEAR 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1373 AQAeREAQGLQRRMQEEVARREEVAVEA-------------QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRI 1439
Cdd:COG3206 233 AEL-AEAEARLAALRAQLGSGPDALPELlqspviqqlraqlAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEA 311
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1440 EEEIRVVRLQLEATERQRGGAEGELQALRARAE---EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRV 1510
Cdd:COG3206 312 QRILASLEAELEALQAREASLQAQLAQLEARLAelpELEAELRRLEREVEVARELYESLLQRLEEARLAEALTV 385
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1701-1851 |
1.42e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 46.46 E-value: 1.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1701 EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRS-TSEKSKQRLE 1779
Cdd:COG1579 16 DSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNvRNNKEYEALQ 95
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1780 AE----AGRFRELAEEAARLRALAEEAKRQRQLAEEdavrQRAEAERVLAEKLAAISEATRlKTEAEIALKEKEAE 1851
Cdd:COG1579 96 KEieslKRRISDLEDEILELMERIEELEEELAELEA----ELAELEAELEEKKAELDEELA-ELEAELEELEAERE 166
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1409-1783 |
1.42e-04 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 47.97 E-value: 1.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1409 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEAT----ERQRGGAEGELQALRARAEEAEAQKRQAQEE 1484
Cdd:pfam07888 30 ELLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQrrelESRVAELKEELRQSREKHEELEEKYKELSAS 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1485 AERL---RRQVQDETQRKRQAEAELALRVQAEAEAAREKQralQALEELRLQAEEAERRLRQAEAERaRQVQVALETAQ- 1560
Cdd:pfam07888 110 SEELseeKDALLAQRAAHEARIRELEEDIKTLTQRVLERE---TELERMKERAKKAGAQRKEEEAER-KQLQAKLQQTEe 185
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1561 --RSAEAELQSEHASFAEKTAQLERtLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKAnEALRLRLqaE 1638
Cdd:pfam07888 186 elRSLSKEFQELRNSLAQRDTQVLQ-LQDTITTLTQKLTTAHRKEAENEALLEELRSLQERLNASERKV-EGLGEEL--S 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1639 EVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQR 1718
Cdd:pfam07888 262 SMAAQRDRTQAELHQARLQAAQLTLQLADASLALREGRARWAQERETLQQSAEADKDRIEKLSAELQRLEERLQEERMER 341
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1719 QLLEEELArlqREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSE---KSKQRLEAEAG 1783
Cdd:pfam07888 342 EKLEVELG---REKDCNRVQLSESRRELQELKASLRVAQKEKEQLQAEKQELLEyirQLEQRLETVAD 406
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3083-3119 |
1.43e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 41.70 E-value: 1.43e-04
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 3083 KLLSAEKAVTGYKDPYSGQSVSLFQALKKGLIPREQG 3119
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| GBP_C |
cd16269 |
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ... |
2256-2354 |
1.45e-04 |
|
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.
Pssm-ID: 293879 [Multi-domain] Cd Length: 291 Bit Score: 47.19 E-value: 1.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2256 QEEAEKMKQVAEEAARLSVAAQEAARL----RQLAEEDLAQQRALAE--KMLKEKMQavQEATRLKAEAELLQQQKElaQ 2329
Cdd:cd16269 191 QALTEKEKEIEAERAKAEAAEQERKLLeeqqRELEQKLEDQERSYEEhlRQLKEKME--EERENLLKEQERALESKL--K 266
|
90 100
....*....|....*....|....*
gi 1920237968 2330 EQARRLQEDKEQMAQQLAQETQGFQ 2354
Cdd:cd16269 267 EQEALLEEGFKEQAELLQEEIRSLK 291
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
2163-2597 |
1.54e-04 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 48.41 E-value: 1.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2163 AEQALRQKAQVEQELTALRLQL-------EETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLK-A 2234
Cdd:COG3096 524 LEQRLRQQQNAERLLEEFCQRIgqqldaaEELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARApA 603
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2235 RIEAENRALVLRD------KDSA------QRLLQEE----------AEKMKQVAEEAARLSVAA-QEAARLRQLAE---- 2287
Cdd:COG3096 604 WLAAQDALERLREqsgealADSQevtaamQQLLEREreatverdelAARKQALESQIERLSQPGgAEDPRLLALAErlgg 683
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2288 -------EDLAQQRA-LAEKMLKEKMQA--VQEATRLKAEAE----------LLQQQKELAQEQARRLQEDKEQMAQQLA 2347
Cdd:COG3096 684 vllseiyDDVTLEDApYFSALYGPARHAivVPDLSAVKEQLAgledcpedlyLIEGDPDSFDDSVFDAEELEDAVVVKLS 763
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2348 QETQGFQKTLE------TERQRQLE-MSAEAERLRLRVAEMS---RAQARAEEDARRFrkqaedIGERLYRTELATQEKV 2417
Cdd:COG3096 764 DRQWRYSRFPEvplfgrAAREKRLEeLRAERDELAEQYAKASfdvQKLQRLHQAFSQF------VGGHLAVAFAPDPEAE 837
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2418 MlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQsflsEKDSLLQRERC 2497
Cdd:COG3096 838 L--AALRQRRSELERELAQHRAQEQQLRQQLDQLKEQLQLLNKLLPQANLLADETLADRLEELRE----ELDAAQEAQAF 911
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2498 IEQEKAKLEQLfqdeVAKAQALReeqqrqqqQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA 2577
Cdd:COG3096 912 IQQHGKALAQL----EPLVAVLQ--------SDPEQFEQLQADYLQAKEQQRRLKQQIFALSEVVQRRPHFSYEDAVGLL 979
|
490 500
....*....|....*....|....
gi 1920237968 2578 EENQ----RLRERLQHLEEERRAA 2597
Cdd:COG3096 980 GENSdlneKLRARLEQAEEARREA 1003
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3955-3989 |
1.57e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 41.70 E-value: 1.57e-04
10 20 30
....*....|....*....|....*....|....*
gi 1920237968 3955 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEF 3989
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
2167-2607 |
1.65e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 47.97 E-value: 1.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2167 LRQKAQVEQ-ELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIeaenralvl 2245
Cdd:PRK01156 188 LEEKLKSSNlELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEI--------- 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2246 RDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQK 2325
Cdd:PRK01156 259 KTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDY 338
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2326 ELAQEQARRLQEDKEQMAQQLAQET--QGFQKTLETERQRQLEMSAEAERLRlrvAEMSRAQARAEEDARRFRKQAEDIG 2403
Cdd:PRK01156 339 NDYIKKKSRYDDLNNQILELEGYEMdyNSYLKSIESLKKKIEEYSKNIERMS---AFISEILKIQEIDPDAIKKELNEIN 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2404 ERL--YRTELAT--QEKVMLVQTLETQRQQSD----------------------------RDAERLREAIAELEHEKDKL 2451
Cdd:PRK01156 416 VKLqdISSKVSSlnQRIRALRENLDELSRNMEmlngqsvcpvcgttlgeeksnhiinhynEKKSRLEEKIREIEIEVKDI 495
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2452 KQEAQLLQ-----LKSEEM-QTVRQEQLLQETQALQQSFLSE----KDSLLQRERCIEQEKA-KLEQLFQDEVAKAQALR 2520
Cdd:PRK01156 496 DEKIVDLKkrkeyLESEEInKSINEYNKIESARADLEDIKIKinelKDKHDKYEEIKNRYKSlKLEDLDSKRTSWLNALA 575
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2521 EEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEG-----------VRRQQEELQRLaqqqqQQEKLLAEENQRLRERLQH 2589
Cdd:PRK01156 576 VISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGfpddksyidksIREIENEANNL-----NNKYNEIQENKILIEKLRG 650
|
490
....*....|....*...
gi 1920237968 2590 LEEERRAALARSEEIAPS 2607
Cdd:PRK01156 651 KIDNYKKQIAEIDSIIPD 668
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1783-2029 |
1.84e-04 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 47.58 E-value: 1.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1783 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDE 1862
Cdd:pfam07888 34 NRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEEL 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1863 AFQRRLLEEQAAQHKADIE------ARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELE 1936
Cdd:pfam07888 114 SEEKDALLAQRAAHEARIReleediKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKE 193
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1937 LGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAeervqKSLAAEEEAARQRKAALEE-VERLKAKVEEARRLRE 2015
Cdd:pfam07888 194 FQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAEN-----EALLEELRSLQERLNASERkVEGLGEELSSMAAQRD 268
|
250
....*....|....*
gi 1920237968 2016 RAEQESAR-QLQLAQ 2029
Cdd:pfam07888 269 RTQAELHQaRLQAAQ 283
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1299-1643 |
2.00e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 2.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1299 ESIIQEYVDLRTRYSELST---LTSQYIRFISETLRRMEEEERLAE-QQRAEE-RERLAEVEAALEKQRQLAEAHAQAKA 1373
Cdd:COG4717 98 EELEEELEELEAELEELREeleKLEKLLQLLPLYQELEALEAELAElPERLEElEERLEELRELEEELEELEAELAELQE 177
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1374 QAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQ---- 1449
Cdd:COG4717 178 ELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQE----ELEELEEELEQLENELEAAALEERLKEARllll 253
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1450 --------------LEATERQRGGA----EGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ 1511
Cdd:COG4717 254 iaaallallglggsLLSLILTIAGVlflvLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1512 AEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV----------------------QVALETAQRSAEAELQS 1569
Cdd:COG4717 334 LSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAllaeagvedeeelraaleqaeeYQELKEELEELEEQLEE 413
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1570 -----EHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKAnEALRLRLQAEEVAQQ 1643
Cdd:COG4717 414 llgelEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQ-ELEELKAELRELAEE 491
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4264-4297 |
2.17e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 41.31 E-value: 2.17e-04
10 20 30
....*....|....*....|....*....|....
gi 1920237968 4264 EETGPVAGILDTETLEKVSITEAMHRNLVDNITG 4297
Cdd:smart00250 5 EAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
2161-2607 |
2.30e-04 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 47.21 E-value: 2.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2161 QFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAEN 2240
Cdd:COG5278 76 SFLEPYEEARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIR 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2241 RALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAEL 2320
Cdd:COG5278 156 ARLLLLALALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELL 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2321 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAE 2400
Cdd:COG5278 236 AALALALALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAA 315
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2401 DIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQAL 2480
Cdd:COG5278 316 AAAAAAAAALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAI 395
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2481 QQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQE 2560
Cdd:COG5278 396 AAAAAAAAAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALA 475
|
410 420 430 440
....*....|....*....|....*....|....*....|....*..
gi 1920237968 2561 ELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPS 2607
Cdd:COG5278 476 ALAAAAAALAEAEAAAALAAAAALSLALALAALLLAAAEAALAAALA 522
|
|
| CH_PLS1_rpt3 |
cd21329 |
third calponin homology (CH) domain found in plastin-1; Plastin-1, also called ... |
39-152 |
2.33e-04 |
|
third calponin homology (CH) domain found in plastin-1; Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. It contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409178 Cd Length: 118 Bit Score: 43.82 E-value: 2.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 39 ERDRVQKKTFTKWVNKHLIKHWraeaqrhISDLYEDLRDGHNLISLLEV---------LSGDSLPREKGRMRfhKLQNVQ 109
Cdd:cd21329 2 EGESSEERTFRNWMNSLGVNPY-------VNHLYSDLCDALVIFQLYEMtrvpvdwghVNKPPYPALGGNMK--KIENCN 72
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1920237968 110 IALDYLRHR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 152
Cdd:cd21329 73 YAVELGKNKaKFSLVGIAGSDLNEGNKTLTLALIWQLMRRYTLN 116
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
2366-2600 |
2.33e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.60 E-value: 2.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2366 EMSAEAERLRLRVAEMSRAQARAEeDARRFRKQAEDIgERLYRTELATQEKVMLVQTLETQRQ--QSDRDAERLREAIAE 2443
Cdd:COG4913 222 DTFEAADALVEHFDDLERAHEALE-DAREQIELLEPI-RELAERYAAARERLAELEYLRAALRlwFAQRRLELLEAELEE 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2444 LEHEKDKLKQEAQLLQLKSEEMQtvrqEQLLQETQALQQSFLSEKDSLlqrERCIEQEKAKLEQLFQdevaKAQALREEQ 2523
Cdd:COG4913 300 LRAELARLEAELERLEARLDALR----EELDELEAQIRGNGGDRLEQL---EREIERLERELEERER----RRARLEALL 368
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 2524 QRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQqqqekllaeenQRLRERLQHLEEERRAALAR 2600
Cdd:COG4913 369 AALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL-----------RDLRRELRELEAEIASLERR 434
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1462-1684 |
2.42e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 47.11 E-value: 2.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1462 GELQALRARAEEAEAQ-KRQAQEEAERLRRQVQDETQRKRQAEAElalRVQAEAEAAREKQRALQALEELRlQAEEAERR 1540
Cdd:PRK09510 65 NRQQQQQKSAKRAEEQrKKKEQQQAEELQQKQAAEQERLKQLEKE---RLAAQEQKKQAEEAAKQAALKQK-QAEEAAAK 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1541 LRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaeaeraraeaerel 1620
Cdd:PRK09510 141 AAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEA--------------------- 199
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1621 erwQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELE 1684
Cdd:PRK09510 200 ---KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
938-1599 |
2.44e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 47.66 E-value: 2.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 938 EDRLQAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQRITEQQK 1017
Cdd:pfam02463 293 KEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLE 372
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1018 AQAEVDGLGKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAH 1097
Cdd:pfam02463 373 EELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEE 452
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1098 EEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQE-------VGERLQQRHGERDVEVERWRERV 1170
Cdd:pfam02463 453 LEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKEskarsglKVLLALIKDGVGGRIISAHGRLG 532
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1171 TLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAV--------PLANSQAVREQLRQEKA 1242
Cdd:pfam02463 533 DLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLklplksiaVLEIDPILNLAQLDKAT 612
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1243 LLEDIERHGEKV----EECQRFAKQYINAiKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTL 1318
Cdd:pfam02463 613 LEADEDDKRAKVvegiLKDTELTKLKESA-KAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAK 691
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1319 TSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL-QRRMQEEVARREEVA 1397
Cdd:pfam02463 692 EEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSrLKKEEKEEEKSELSL 771
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1398 VEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrLQLEATERQRGGAEGELQALRARAEEAEAQ 1477
Cdd:pfam02463 772 KEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLE---EEQLLIEQEEKIKEEELEELALELKEEQKL 848
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1478 KRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALE 1557
Cdd:pfam02463 849 EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAE 928
|
650 660 670 680
....*....|....*....|....*....|....*....|..
gi 1920237968 1558 TAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1599
Cdd:pfam02463 929 ILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKE 970
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2368-2616 |
2.48e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 46.68 E-value: 2.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2368 SAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHE 2447
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2448 KDKLKQEAQLLQLKSEEMQTVRQEQLLqetqaLQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQ 2527
Cdd:COG4942 99 LEAQKEELAELLRALYRLGRQPPLALL-----LSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAER 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2528 QqmqqEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPS 2607
Cdd:COG4942 174 A----ELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFA 249
|
....*....
gi 1920237968 2608 RAAAARALP 2616
Cdd:COG4942 250 ALKGKLPWP 258
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1667-1860 |
2.64e-04 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 46.45 E-value: 2.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1746
Cdd:pfam13868 149 EEREEDERILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAE 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1747 AKVRAEMEVLLASKARAEEESRsTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1826
Cdd:pfam13868 229 KKARQRQELQQAREEQIELKER-RLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQRA 307
|
170 180 190
....*....|....*....|....*....|....
gi 1920237968 1827 KLAAISEATRLKTEAEIALKEKEAENERLRRLAE 1860
Cdd:pfam13868 308 AEREEELEEGERLREEEAERRERIEEERQKKLKE 341
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1361-1540 |
2.83e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 47.08 E-value: 2.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1361 QRQLAEAHAQAK---AQAEREAQglqrrmqeevARREEVAVEAqeqkrsiQEELQHLRQSSEAEIQAKARQVEAAERsRL 1437
Cdd:PRK12704 30 EAKIKEAEEEAKrilEEAKKEAE----------AIKKEALLEA-------KEEIHKLRNEFEKELRERRNELQKLEK-RL 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1438 RIEEEIrvVRLQLEATERQRGGAEGELQALRARAEEAEAQkrqaQEEAERLRRQVQDETQR-----KRQAEAELALRVqa 1512
Cdd:PRK12704 92 LQKEEN--LDRKLELLEKREEELEKKEKELEQKQQELEKK----EEELEELIEEQLQELERisgltAEEAKEILLEKV-- 163
|
170 180
....*....|....*....|....*....
gi 1920237968 1513 EAEAAREKQRALQALEElrlQA-EEAERR 1540
Cdd:PRK12704 164 EEEARHEAAVLIKEIEE---EAkEEADKK 189
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1758-1955 |
3.01e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 46.72 E-value: 3.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1758 ASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQrqlaEEDAVRQRAEAERVLAEKLAAISEATRL 1837
Cdd:PRK09510 72 KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ----AEEAAKQAALKQKQAEEAAAKAAAAAKA 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1838 KTEAEIALKEKEAENerlrrlAEDEAfQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEIL 1917
Cdd:PRK09510 148 KAEAEAKRAAAAAKK------AAAEA-KKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220
|
170 180 190
....*....|....*....|....*....|....*...
gi 1920237968 1918 ALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAE 1955
Cdd:PRK09510 221 AEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
|
|
| MARTX_Nterm |
NF012221 |
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ... |
1667-1879 |
3.02e-04 |
|
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.
Pssm-ID: 467957 [Multi-domain] Cd Length: 1848 Bit Score: 47.52 E-value: 3.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQRELAEQ-----ELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQRQLLEEElarlqreaaaatqkRRE 1741
Cdd:NF012221 1555 DAAQNALADKERAEAdrqrlEQEKQQQLAAISGSQ---SQLESTDQNALETNGQAQRDAILEE--------------SRA 1617
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1742 LEAELAKVRAEMEVLLAS-------------------KARAEEESRSTSEKSKQRLEAEAGRF----RELAEEAARLRAL 1798
Cdd:NF012221 1618 VTKELTTLAQGLDALDSQatyagesgdqwrnpfagglLDRVQEQLDDAKKISGKQLADAKQRHvdnqQKVKDAVAKSEAG 1697
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1799 AEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRlLEEQAAQHKA 1878
Cdd:NF012221 1698 VAQGEQNQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQ-ADAKGAKQDE 1776
|
.
gi 1920237968 1879 D 1879
Cdd:NF012221 1777 S 1777
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
2146-2320 |
3.03e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 46.72 E-value: 3.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEE---LFSL 2222
Cdd:PRK09510 80 QRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEakrAAAA 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2223 RVQMEELGKLKARIEAENRALVLRDK----DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAE 2298
Cdd:PRK09510 160 AKKAAAEAKKKAEAEAAKKAAAEAKKkaeaEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAE 239
|
170 180
....*....|....*....|..
gi 1920237968 2299 KmlKEKMQAVQEATRLKAEAEL 2320
Cdd:PRK09510 240 K--AAAAKAAEKAAAAKAAAEV 259
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1670-1917 |
3.10e-04 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 46.60 E-value: 3.10e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRaeteqgeQQRQLLEEELARLQR-------EAAAATQKRREL 1742
Cdd:pfam19220 128 AAETEQNRALEEENKALREEAQAAEKALQRAEGELATAR-------ERLALLEQENRRLQAlseeqaaELAELTRRLAEL 200
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1743 EAELAKVRAEMEVLLASKAraeeESRSTSEKSKQRLEAEAGRFR-ELAEEAARLRALAEEAKRQRQLAEE--DAVRQRAE 1819
Cdd:pfam19220 201 ETQLDATRARLRALEGQLA----AEQAERERAEAQLEEAVEAHRaERASLRMKLEALTARAAATEQLLAEarNQLRDRDE 276
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1820 AERVLAEKLaaiSEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQK 1899
Cdd:pfam19220 277 AIRAAERRL---KEASIERDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEMLTKALAAKDAALERAEERIASLSD 353
|
250
....*....|....*...
gi 1920237968 1900 GLveDTLRQRRQVEEEIL 1917
Cdd:pfam19220 354 RI--AELTKRFEVERAAL 369
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1674-2390 |
3.46e-04 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 47.10 E-value: 3.46e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1674 RQRELAEQEleKQRQLAEGTAQQRLAAEQ-ELIRLRAeTEQGEQQRQLLEeelaRLQREAAAATQKRR---ELEAELAKV 1749
Cdd:PRK10246 251 RLDELQQEA--SRRQQALQQALAAEEKAQpQLAALSL-AQPARQLRPHWE----RIQEQSAALAHTRQqieEVNTRLQST 323
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1750 RAEMEVLLASKARAEEESRSTSEKSKQRLeAEAGRFRELAEEAARLRALAEEAKRQRQlaEEDAVRQRAEAERvlaEKLA 1829
Cdd:PRK10246 324 MALRARIRHHAAKQSAELQAQQQSLNTWL-AEHDRFRQWNNELAGWRAQFSQQTSDRE--QLRQWQQQLTHAE---QKLN 397
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1830 AISEATRLKTEAEIAlkekeaenERLRRLAEDEAFQRRLLEEQaAQHkADIEARLAQLrKASESELERQKGLVEDTLRQR 1909
Cdd:PRK10246 398 ALPAITLTLTADEVA--------AALAQHAEQRPLRQRLVALH-GQI-VPQQKRLAQL-QVAIQNVTQEQTQRNAALNEM 466
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1910 RQVEEEilalkgsfekaaagKAElelELGRIRGTAEDTLRSKEQAEQEAarQRQLAAEEERRRREAEERVQKSLAAEEEA 1989
Cdd:PRK10246 467 RQRYKE--------------KTQ---QLADVKTICEQEARIKDLEAQRA--QLQAGQPCPLCGSTSHPAVEAYQALEPGV 527
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1990 ARQRKAALE-EVERLKakvEEARRLRERAEQeSARQLQLAQEAAQkRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERL 2068
Cdd:PRK10246 528 NQSRLDALEkEVKKLG---EEGAALRGQLDA-LTKQLQRDESEAQ-SLRQEEQALTQQWQAVCASLNITLQPQDDIQPWL 602
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2069 rseaeaarraaeeaeaareraereaaqsrrqvEEAERLKQsaeeqaqaqaqaqaaaeklrkeaeqeaarraqaEQAALRQ 2148
Cdd:PRK10246 603 --------------------------------DAQEEHER---------------------------------QLRLLSQ 617
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2149 KQAADAEMEKH----KQFAEQALRQKAQVEQELTALRLQLEETDHQKSIL---DEELQRLKAEVTEAARQRGQVE--EEL 2219
Cdd:PRK10246 618 RHELQGQIAAHnqqiIQYQQQIEQRQQQLLTALAGYALTLPQEDEEASWLatrQQEAQSWQQRQNELTALQNRIQqlTPL 697
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2220 FSLRVQMEELGKLKARIEAEN------RALVLRDKDSA--QRLLQEEAEKMKQVAEEAARL--SVAAQEAARLRQLAEED 2289
Cdd:PRK10246 698 LETLPQSDDLPHSEETVALDNwrqvheQCLSLHSQLQTlqQQDVLEAQRLQKAQAQFDTALqaSVFDDQQAFLAALLDEE 777
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2290 LAQQralAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQK--TLETERQRQLEM 2367
Cdd:PRK10246 778 TLTQ---LEQLKQNLENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVTVEQIQQELAQLAQQLREntTRQGEIRQQLKQ 854
|
730 740
....*....|....*....|....
gi 1920237968 2368 SAEA-ERLRLRVAEMSRAQARAEE 2390
Cdd:PRK10246 855 DADNrQQQQALMQQIAQATQQVED 878
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1421-1588 |
3.60e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 45.30 E-value: 3.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1421 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQvQDETQRKR 1500
Cdd:COG1579 11 DLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQ-LGNVRNNK 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1501 QAEAelalrVQAEAEAAREKQRAL-QALEELRLQAEEAERRLRQAEAERARQVQ--VALETAQRSAEAELQSEHASFAEK 1577
Cdd:COG1579 90 EYEA-----LQKEIESLKRRISDLeDEILELMERIEELEEELAELEAELAELEAelEEKKAELDEELAELEAELEELEAE 164
|
170
....*....|.
gi 1920237968 1578 TAQLERTLKEE 1588
Cdd:COG1579 165 REELAAKIPPE 175
|
|
| MAP7 |
pfam05672 |
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ... |
1359-1490 |
3.61e-04 |
|
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.
Pssm-ID: 461709 [Multi-domain] Cd Length: 153 Bit Score: 43.88 E-value: 3.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1359 EKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHL--RQSSEAEIQAKARQVEAAERSR 1436
Cdd:pfam05672 11 EAARILAEKRRQAREQREREEQERLEKEEEERLRKEELRRRAEEERARREEEARRLeeERRREEEERQRKAEEEAEEREQ 90
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1437 LRIEEEIRVVRLQLEATERQRGGAEGELQalraraeeaEAQKRQAQEEAERLRR 1490
Cdd:pfam05672 91 REQEEQERLQKQKEEAEAKAREEAERQRQ---------EREKIMQQEEQERLER 135
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
1666-1847 |
3.69e-04 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 46.77 E-value: 3.69e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1666 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQR-LAAEQELIRLRAEteqgeqQRQLLEEELARLQREAAAATQKRRELEA 1744
Cdd:COG2433 375 GLSIEEALEELIEKELPEEEPEAEREKEHEEReLTEEEEEIRRLEE------QVERLEAEVEELEAELEEKDERIERLER 448
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1745 ELAKVRAEMEvllaSKARAEEESRstsekskqRLEAEAGRF-RELAEEAARLRALAEEAKRQRQLAEEDavrqrAEAERV 1823
Cdd:COG2433 449 ELSEARSEER----REIRKDREIS--------RLDREIERLeRELEEERERIEELKRKLERLKELWKLE-----HSGELV 511
|
170 180
....*....|....*....|....
gi 1920237968 1824 LAEKLAAISEATRLKTEAEIALKE 1847
Cdd:COG2433 512 PVKVVEKFTKEAIRRLEEEYGLKE 535
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
1667-1838 |
3.72e-04 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 46.93 E-value: 3.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQR-ELAEQElekqrqlaegtAQQRLaaEQELIRLRAETEqgEQQRQLLeeelarlqreaaaatqkrrELEAE 1745
Cdd:PTZ00491 662 KSQEAAARHQaELLEQE-----------ARGRL--ERQKMHDKAKAE--EQRTKLL-------------------ELQAE 707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1746 LAKVRAemevllASKARAEEESRSTSEKSKQRLEAEAGRFRelaEEAARLRALAE-EAKRQRQLAEEDAVRQRAEAERVL 1824
Cdd:PTZ00491 708 SAAVES------SGQSRAEALAEAEARLIEAEAEVEQAELR---AKALRIEAEAElEKLRKRQELELEYEQAQNELEIAK 778
|
170
....*....|....
gi 1920237968 1825 AEKLAAIsEATRLK 1838
Cdd:PTZ00491 779 AKELADI-EATKFE 791
|
|
| EmrA |
COG1566 |
Multidrug resistance efflux pump EmrA [Defense mechanisms]; |
1399-1534 |
3.76e-04 |
|
Multidrug resistance efflux pump EmrA [Defense mechanisms];
Pssm-ID: 441174 [Multi-domain] Cd Length: 331 Bit Score: 46.19 E-value: 3.76e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1399 EAQEQKRSIQEELQHLRQSS--EAEIQAKARQVEAAERSRLRIEEEI-RVVRLQleateRQRGGAEGELQALRARAEEAE 1475
Cdd:COG1566 87 QAEAQLAAAEAQLARLEAELgaEAEIAAAEAQLAAAQAQLDLAQRELeRYQALY-----KKGAVSQQELDEARAALDAAQ 161
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1476 AQKRQAQEEAERLRRQVQDETQrKRQAEAELAlrvQAEAEAAREKQRalqaLEELRLQA 1534
Cdd:COG1566 162 AQLEAAQAQLAQAQAGLREEEE-LAAAQAQVA---QAEAALAQAELN----LARTTIRA 212
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3006-3043 |
3.79e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.54 E-value: 3.79e-04
10 20 30
....*....|....*....|....*....|....*...
gi 1920237968 3006 RRALRGSGVIAGVWLEEAGQKLSIYEALRKDLLQPEAA 3043
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
2153-2349 |
3.80e-04 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 46.83 E-value: 3.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2153 DAEMEKHKQFAEQALR---QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQR------GQVEEELFSLR 2223
Cdd:PRK11281 55 EAEDKLVQQDLEQTLAlldKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETlstlslRQLESRLAQTL 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2224 VQMEELGklKARIEAENRALVLRDK--------DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAeedLAQQRA 2295
Cdd:PRK11281 135 DQLQNAQ--NDLAEYNSQLVSLQTQperaqaalYANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQAL---LNAQND 209
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2296 LAEKMLK--EKMQAVQEATR--LKAEAELLQQQKELAQE--QARRLQEDKEQMAQQLAQE 2349
Cdd:PRK11281 210 LQRKSLEgnTQLQDLLQKQRdyLTARIQRLEHQLQLLQEaiNSKRLTLSEKTVQEAQSQD 269
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
2246-2416 |
4.04e-04 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 45.99 E-value: 4.04e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2246 RDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATR-LKAEAELLQQQ 2324
Cdd:TIGR02794 65 KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAkAKAEAEAERKA 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2325 KELAQEQARrlqEDKEQMAQQLAQetqgfQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGE 2404
Cdd:TIGR02794 145 KEEAAKQAE---EEAKAKAAAEAK-----KKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAKAKAAAEAAAKAEAE 216
|
170
....*....|..
gi 1920237968 2405 RLYRTELATQEK 2416
Cdd:TIGR02794 217 AAAAAAAEAERK 228
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1307-1587 |
4.08e-04 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 46.43 E-value: 4.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1307 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQrAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1386
Cdd:pfam07888 77 ELESRVAELKEELRQSREKHEELEEKYKELSASSEEL-SEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERM 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1387 QEEVARREEVAVEAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGE 1463
Cdd:pfam07888 156 KERAKKAGAQRKEEEAERKQLQAKLQQTEEelrSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEAL 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1464 LQALRARAEEAEAQKRQAQ------EEAERLRRQVQDETQRKRQAEAELALRVQAEAEAARE-KQRALQALEELRLQAE- 1535
Cdd:pfam07888 236 LEELRSLQERLNASERKVEglgeelSSMAAQRDRTQAELHQARLQAAQLTLQLADASLALREgRARWAQERETLQQSAEa 315
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 1536 EAERRLRQAEAERARQVQVALETAQR-SAEAELQSEHASFAEKTAQLERTLKE 1587
Cdd:pfam07888 316 DKDRIEKLSAELQRLEERLQEERMEReKLEVELGREKDCNRVQLSESRRELQE 368
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3414-3450 |
4.18e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.54 E-value: 4.18e-04
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 3414 KLLSAEKAVTGYRDPYSGSTISLFQAMKKGLVLREHG 3450
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1408-1766 |
4.24e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 46.05 E-value: 4.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1408 QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1487
Cdd:COG4372 12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1488 LRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAEL 1567
Cdd:COG4372 92 AQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQ 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1568 QSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLT 1647
Cdd:COG4372 172 ELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEEL 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1648 QAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR 1727
Cdd:COG4372 252 LEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELA 331
|
330 340 350
....*....|....*....|....*....|....*....
gi 1920237968 1728 LQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1766
Cdd:COG4372 332 LAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1667-1896 |
4.45e-04 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 46.48 E-value: 4.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEE--------QAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEEL--------ARLQR 1730
Cdd:PRK05035 447 KAEEakarfearQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARpdnsaviaAREAR 526
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1731 EAAAATQKRRELEAE-----LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrelaeeAARLRALAeeAKRQ 1805
Cdd:PRK05035 527 KAQARARQAEKQAAAaadpkKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVA---------AAIARAKA--KKAA 595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1806 RQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAedeafqrrlleeqAAQHKAdiEARLA 1885
Cdd:PRK05035 596 QQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVA-------------AAIARA--KARKA 660
|
250
....*....|.
gi 1920237968 1886 QLRKASESELE 1896
Cdd:PRK05035 661 AQQQANAEPEE 671
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
2433-2563 |
4.83e-04 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 46.36 E-value: 4.83e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2433 DAERLREAIAELEHEKDKLKQEAQLLqlkseemqtvrqEQLLQETQALQQSFLSEKDSLLQRErciEQEKAKLEQLFQDE 2512
Cdd:PRK00409 514 DKEKLNELIASLEELERELEQKAEEA------------EALLKEAEKLKEELEEKKEKLQEEE---DKLLEEAEKEAQQA 578
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2513 V--AKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGV-------RRQQEELQ 2563
Cdd:PRK00409 579 IkeAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKekkkkkqKEKQEELK 638
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1417-1551 |
4.89e-04 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 46.44 E-value: 4.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1417 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlRRQVQDET 1496
Cdd:PRK12678 69 TPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARR-GAARKAGE 147
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 1497 QRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQ 1551
Cdd:PRK12678 148 GGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDD 202
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1591-2046 |
4.97e-04 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 46.55 E-value: 4.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1591 AVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEE 1670
Cdd:COG3903 477 AAERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALRLAAALAPFWF 556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1671 QAVRQRElAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVR 1750
Cdd:COG3903 557 LRGLLRE-GRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAAAAAA 635
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1751 AEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA 1830
Cdd:COG3903 636 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAALAAAA 715
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1831 ISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRR 1910
Cdd:COG3903 716 AAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAAAAAA 795
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1911 QVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAA 1990
Cdd:COG3903 796 AAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALAAAAA 875
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1991 RQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFA 2046
Cdd:COG3903 876 AAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
1334-1599 |
5.20e-04 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 46.57 E-value: 5.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1334 EEEERLAEQQRAEERER----------LAEVEAALEKQR---QLAEAHAQAKAQAEREAQGLQRR----MQEEVARREEV 1396
Cdd:PRK10811 507 EEAMALPSEEEFAERKRpeqpalatfaMPDVPPAPTPAEpaaPVVAAAPKAAAATPPAQPGLLSRffgaLKALFSGGEET 586
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1397 AVEAQEQKRSIQEElqhlRQSSEaeiQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEA 1476
Cdd:PRK10811 587 KPQEQPAPKAEAKP----ERQQD---RRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQA 659
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1477 QKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvQAEAEAarekqralQALEELRLQAEEAERRLRQAEAERAR------ 1550
Cdd:PRK10811 660 EVTEKARTQDEQQQAPRRERQRRRNDEKRQA---QQEAKA--------LNVEEQSVQETEQEERVQQVQPRRKQrqlnqk 728
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1551 ---QVQVALETAQRSAEAELQSEHASfAEKTAQLERTLKEEHVAVVQLREEA 1599
Cdd:PRK10811 729 vriEQSVAEEAVAPVVEETVAAEPVV-QEVPAPRTELVKVPLPVVAQTAPEQ 779
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3375-3410 |
5.40e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.16 E-value: 5.40e-04
10 20 30
....*....|....*....|....*....|....*.
gi 1920237968 3375 TLLLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPE 3410
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1657-1820 |
5.70e-04 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 46.02 E-value: 5.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1657 EAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtaQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREaaaat 1736
Cdd:COG2268 229 EQEREIETARIAEAEAELAKKKAEERREAETARAE---AEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAE----- 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1737 QKRRELEAELaKVRAEmevllASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEedAVRQ 1816
Cdd:COG2268 301 REEAELEADV-RKPAE-----AEKQAAEAEAEAEAEAIRAKGLAEAEGKRALAEAWNKLGDAAILLMLIEKLPE--IAEA 372
|
....
gi 1920237968 1817 RAEA 1820
Cdd:COG2268 373 AAKP 376
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3046-3079 |
5.90e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.16 E-value: 5.90e-04
10 20 30
....*....|....*....|....*....|....
gi 1920237968 3046 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPE 3079
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| MARTX_Nterm |
NF012221 |
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ... |
2273-2551 |
5.91e-04 |
|
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.
Pssm-ID: 467957 [Multi-domain] Cd Length: 1848 Bit Score: 46.37 E-value: 5.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2273 SVAAQEAARLRQLAEEDLAQQRALAEKmlkekmqavqeatrlkaeaellqqqkELAQEQARRLQEDKeqmAQQLAqETQG 2352
Cdd:NF012221 1538 SESSQQADAVSKHAKQDDAAQNALADK--------------------------ERAEADRQRLEQEK---QQQLA-AISG 1587
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2353 FQKTLETERQRQLEMSAEAERlrlrvaemsraqARAEEDARRFRKQAEDIGERLyrtelatqekvmlvQTLETQRQQSDR 2432
Cdd:NF012221 1588 SQSQLESTDQNALETNGQAQR------------DAILEESRAVTKELTTLAQGL--------------DALDSQATYAGE 1641
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2433 DAERLREAIAE--LEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSllqrerciEQEKAKLEQLFQ 2510
Cdd:NF012221 1642 SGDQWRNPFAGglLDRVQEQLDDAKKISGKQLADAKQRHVDNQQKVKDAVAKSEAGVAQG--------EQNQANAEQDID 1713
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 1920237968 2511 DEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRR-QHEA 2551
Cdd:NF012221 1714 DAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRgEQDA 1755
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
1678-1963 |
5.96e-04 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 45.84 E-value: 5.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1678 LAEQELEKQRQLAegTAQQRLAAEQELIRlraeteQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLL 1757
Cdd:PRK11637 38 FSAHASDNRDQLK--SIQQDIAAKEKSVR------QQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELN 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1758 ASKARAEEESRSTSEKSKQRLEAEagrFRELAEEAARLRALAEEAKRqrqlaeedavrqraeAERVLAeKLAAISEAtRL 1837
Cdd:PRK11637 110 ASIAKLEQQQAAQERLLAAQLDAA---FRQGEHTGLQLILSGEESQR---------------GERILA-YFGYLNQA-RQ 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1838 KTEAEialkekeaenerLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVedtlrqrrqveeeil 1917
Cdd:PRK11637 170 ETIAE------------LKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLT--------------- 222
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1918 ALKGSFEKAAAGKAELELELGRIRGT-AEDTLRSKEQAEQEA-------ARQRQ 1963
Cdd:PRK11637 223 GLESSLQKDQQQLSELRANESRLRDSiARAEREAKARAEREAreaarvrDKQKQ 276
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1991-2480 |
6.08e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.91 E-value: 6.08e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1991 RQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRS 2070
Cdd:COG4717 64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAE 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2071 EAeaarraaeeaeaareraereaaqsrRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAArraqaeqaalRQKQ 2150
Cdd:COG4717 144 LP-------------------------ERLEELEERLEELRELEEELEELEAELAELQEELEELLE----------QLSL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2151 AADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA-EVTEAARQRGQVEEELFSLRVQMEEL 2229
Cdd:COG4717 189 ATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALeERLKEARLLLLIAAALLALLGLGGSL 268
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2230 GKLKARI------EAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE 2303
Cdd:COG4717 269 LSLILTIagvlflVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEE 348
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2304 KMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKtleTERQRQLEMSAEAERLRLRVAEMSR 2383
Cdd:COG4717 349 LQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQEL---KEELEELEEQLEELLGELEELLEAL 425
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2384 AQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAER--LREAIAELEHEKDKLKQEAQLLQLK 2461
Cdd:COG4717 426 DEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQELeeLKAELRELAEEWAALKLALELLEEA 505
|
490
....*....|....*....
gi 1920237968 2462 SEEMQTVRQEQLLQETQAL 2480
Cdd:COG4717 506 REEYREERLPPVLERASEY 524
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
2169-2508 |
6.14e-04 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 46.17 E-value: 6.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2169 QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVT-------EAARQRGQVEEELFSLRVQMEELGKLKARIEAENR 2241
Cdd:TIGR04523 118 QKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEklnnkynDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLL 197
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2242 AL-----VLRDKDSAQRLLQEEAEKMK----QVAEEAARLSVAAQEAARLRQLAEE---DLAQQRALAEKMLKEKMQAVQ 2309
Cdd:TIGR04523 198 KLelllsNLKKKIQKNKSLESQISELKkqnnQLKDNIEKKQQEINEKTTEISNTQTqlnQLKDEQNKIKKQLSEKQKELE 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2310 EATR-----------LKAEAELLQQQKE---------LAQEQARRLQEDKEQMAQ------QLAQETQGFQKTLETERQR 2363
Cdd:TIGR04523 278 QNNKkikelekqlnqLKSEISDLNNQKEqdwnkelksELKNQEKKLEEIQNQISQnnkiisQLNEQISQLKKELTNSESE 357
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2364 QLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAE 2443
Cdd:TIGR04523 358 NSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIK 437
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2444 LEHEKDKLKQEAQLLQLKSEEMQTVRqEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQL 2508
Cdd:TIGR04523 438 NNSEIKDLTNQDSVKELIIKNLDNTR-ESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKL 501
|
|
| Filament |
pfam00038 |
Intermediate filament protein; |
1323-1587 |
6.16e-04 |
|
Intermediate filament protein;
Pssm-ID: 459643 [Multi-domain] Cd Length: 313 Bit Score: 45.30 E-value: 6.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1323 IRFISETLRRMEEEERLAEQQRAEERERLAEV-EAALEKQRQLAEAHAQAKAQAEREaqglqrrmqeevarREEVAVEAQ 1401
Cdd:pfam00038 20 VRFLEQQNKLLETKISELRQKKGAEPSRLYSLyEKEIEDLRRQLDTLTVERARLQLE--------------LDNLRLAAE 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1402 EQKRSIQEELQhLRQSSEAEIQAKARQVEAAERSRLRIEEEIrvvrlqleaterqrggaegelQALRaraEEAEAQKRQA 1481
Cdd:pfam00038 86 DFRQKYEDELN-LRTSAENDLVGLRKDLDEATLARVDLEAKI---------------------ESLK---EELAFLKKNH 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1482 QEEAERLRRQVQDETQ-------RKRQAEAELA-LRVQAEAEAAREKQRA----LQALEELRLQAEEAERRLRQAEAERA 1549
Cdd:pfam00038 141 EEEVRELQAQVSDTQVnvemdaaRKLDLTSALAeIRAQYEEIAAKNREEAeewyQSKLEELQQAAARNGDALRSAKEEIT 220
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1550 ---RQVQ-------------VALETAQRSAEAELQSEHASFAEKTAQLERTLKE 1587
Cdd:pfam00038 221 elrRTIQsleielqslkkqkASLERQLAETEERYELQLADYQELISELEAELQE 274
|
|
| CH_PLS2_rpt1 |
cd21324 |
first calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or ... |
44-156 |
6.42e-04 |
|
first calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-2 contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409173 Cd Length: 145 Bit Score: 43.07 E-value: 6.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 44 QKKTFTKWVNKHLIKHWRAeaqRHI-------SDLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIAL 112
Cdd:cd21324 25 EKYAFVNWINKALENDPDC---KHVipmnpntDDLFKAVGDGIVLCKMINFSVPDTIDErtinKKKLTPFTIQENLNLAL 101
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1920237968 113 DYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQV 156
Cdd:cd21324 102 NSASAIGCHVVNIGAEDLKEGKPYLVLGLLWQVIKIGLFADIEL 145
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
2150-2473 |
6.53e-04 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 45.29 E-value: 6.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2150 QAADAEMEKHKQFAEQALRQKAQVEQELtalRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL 2229
Cdd:pfam13868 16 LAAKCNKERDAQIAEKKRIKAEEKEEER---RLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEE 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2230 GKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQ 2309
Cdd:pfam13868 93 YEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDE-FNEEQAEWKELEKEEEREEDERILEYLKEKAEREEER 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2310 EATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGF---QKTLETERQRQLEMSAEAERLRLRVAEMSRA 2384
Cdd:pfam13868 172 EAEREEIEEEKEREIARLRaqQEKAQDEKAERDELRAKLYQEEQERkerQKEREEAEKKARQRQELQQAREEQIELKERR 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2385 QARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQtLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE 2464
Cdd:pfam13868 252 LAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKR-LEHRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRER 330
|
....*....
gi 1920237968 2465 MQTVRQEQL 2473
Cdd:pfam13868 331 IEEERQKKL 339
|
|
| CEP63 |
pfam17045 |
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole ... |
1666-1788 |
6.81e-04 |
|
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole activity.
Pssm-ID: 465338 [Multi-domain] Cd Length: 264 Bit Score: 44.81 E-value: 6.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1666 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQ-ELIRLRAETEQGEQQRQLLEEELARLQR-----EAAAATQKR 1739
Cdd:pfam17045 127 GKLEEFRQKSLEWEQQRLQYQQQVASLEAQRKALAEQsSLIQSAAYQVQLEGRKQCLEASQSEIQRlrsklERAQDSLCA 206
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1740 RELEAELAKVRAE-----MEVLLASKARAEEESRStSEKSKQRLEAEAGRFREL 1788
Cdd:pfam17045 207 QELELERLRMRVSelgdsNRKLLEEQQRLLEELRM-SQRQLQVLQNELMELKAT 259
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
2188-2349 |
7.13e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 45.57 E-value: 7.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2188 DHQKSILDEELQRLKAEVTEAARQRGQVEEELfsLRVQMEELGKLKARieaenralvlRDKDSAQRLLQEEAEKMKQVAE 2267
Cdd:PRK09510 69 QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQ--ERLKQLEKERLAAQ----------EQKKQAEEAAKQAALKQKQAEE 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2268 EAARLSVAAQEAARLRQLAEEDLAQQrALAEKMLKEKMQAVQEA---TRLKAEAELLQQQKELAQEQArrlQEDKEQMAQ 2344
Cdd:PRK09510 137 AAAKAAAAAKAKAEAEAKRAAAAAKK-AAAEAKKKAEAEAAKKAaaeAKKKAEAEAAAKAAAEAKKKA---EAEAKKKAA 212
|
....*
gi 1920237968 2345 QLAQE 2349
Cdd:PRK09510 213 AEAKK 217
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1670-1951 |
7.26e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 45.28 E-value: 7.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1749
Cdd:COG4372 69 EQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1750 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1829
Cdd:COG4372 149 EEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLE 228
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1830 AISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQR 1909
Cdd:COG4372 229 AKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALS 308
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 1920237968 1910 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSK 1951
Cdd:COG4372 309 LIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGL 350
|
|
| COG4995 |
COG4995 |
Uncharacterized conserved protein, contains CHAT domain [Function unknown]; |
2009-2460 |
8.28e-04 |
|
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
Pssm-ID: 444019 [Multi-domain] Cd Length: 711 Bit Score: 45.73 E-value: 8.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2009 EARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARER 2088
Cdd:COG4995 3 ALALLALLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLALAL 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2089 AEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALR 2168
Cdd:COG4995 83 AALALALLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAAAA 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2169 QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDK 2248
Cdd:COG4995 163 AALLALALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLALLA 242
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2249 DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2328
Cdd:COG4995 243 LAAAAAALAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLLLAA 322
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2329 QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYR 2408
Cdd:COG4995 323 LLLLLAALALLALLLLLAAAALLAAALAAALALAAALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAALLA 402
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2409 TELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQL 2460
Cdd:COG4995 403 LAAAQLLRLLLAALALLLALAAYAAARLALLALIEYIILPDRLYAFVQLYQL 454
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1331-1519 |
8.28e-04 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 45.67 E-value: 8.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1331 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQlAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEE 1410
Cdd:PRK12678 78 RRAARAAAAARQAEQPAAEAAAAKAEAAPAARA-AAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEA 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1411 LQHLRQSSEAEIQAKARQVEAAERSRLRieeeiRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRR 1490
Cdd:PRK12678 157 RADAAERTEEEERDERRRRGDREDRQAE-----AERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRR 231
|
170 180
....*....|....*....|....*....
gi 1920237968 1491 QVQDETQRKRQAEAELALRVQAEAEAARE 1519
Cdd:PRK12678 232 RRDRRDARGDDNREDRGDRDGDDGEGRGG 260
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
2149-2437 |
8.45e-04 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 45.71 E-value: 8.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2149 KQAADAEMEKHKQFaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAaRQRGQVEEELfslRVQMEE 2228
Cdd:COG3096 402 QQALDVQQTRAIQY-QQAVQALEKARALCGLPDLTPENAEDYLAAFRAKEQQATEEVLEL-EQKLSVADAA---RRQFEK 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2229 LGKLKARIEAEnralVLRDK--DSAQRLLQEEAEKMKQvaeeAARLSVAAQEAARLRQLAEEdlaQQRA--LAEKMLKEK 2304
Cdd:COG3096 477 AYELVCKIAGE----VERSQawQTARELLRRYRSQQAL----AQRLQQLRAQLAELEQRLRQ---QQNAerLLEEFCQRI 545
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2305 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLaQETQGFQKTLETERQRQLEMSAEAERLRlrvaEMSRA 2384
Cdd:COG3096 546 GQQLDAAEELEELLAELEAQLEELEEQAAEAVEQRSELRQQL-EQLRARIKELAARAPAWLAAQDALERLR----EQSGE 620
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 2385 QARAEEDARRFRKQAedigerLYRTELATQEKvmlvQTLETQRQQSDRDAERL 2437
Cdd:COG3096 621 ALADSQEVTAAMQQL------LEREREATVER----DELAARKQALESQIERL 663
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1667-2048 |
8.61e-04 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 45.42 E-value: 8.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1746
Cdd:COG3064 20 QAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAEAAKKLAEAEKAAAEAEKKAAAEKAK 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1747 AKVRAEMEVLLASKARAEEESRSTSEKSK--QRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1824
Cdd:COG3064 100 AAKEAEAAAAAEKAAAAAEKEKAEEAKRKaeEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAA 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1825 AEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVED 1904
Cdd:COG3064 180 AALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADLAAVGV 259
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1905 TLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLA 1984
Cdd:COG3064 260 LGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEA 339
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1985 AEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ 2048
Cdd:COG3064 340 ALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGL 403
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
1332-1448 |
8.95e-04 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 45.59 E-value: 8.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1332 RMEEEERLAEQQRAEERERLAEVEA---ALEKQ-RQLAEAHAQAKAQAEREAQGLQRRMQEEVAR-----REEVAVEAQE 1402
Cdd:PRK00409 524 SLEELERELEQKAEEAEALLKEAEKlkeELEEKkEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEiikelRQLQKGGYAS 603
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1403 QKRS-IQEELQHLRQSSEAEIQAKARQVEAAErsRLRIEEEIRVVRL 1448
Cdd:PRK00409 604 VKAHeLIEARKRLNKANEKKEKKKKKQKEKQE--ELKVGDEVKYLSL 648
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1081-1736 |
9.16e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 45.48 E-value: 9.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1081 SLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPE-LEATKAALKKLRAQAEAQqpvfdalRDELR-GAQEVGERLQQRHGE 1158
Cdd:pfam05483 158 NLLKETCARSAEKTKKYEYEREETRQVYMDLNNnIEKMILAFEELRVQAENA-------RLEMHfKLKEDHEKIQHLEEE 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1159 RDVEVERWRERVTLLLerwqavlAQTDVRQRELEQLGRQLRYYRESADPLgawlrdakQRQEQIQAVPLANSQAVREQLR 1238
Cdd:pfam05483 231 YKKEINDKEKQVSLLL-------IQITEKENKMKDLTFLLEESRDKANQL--------EEKTKLQDENLKELIEKKDHLT 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1239 QEkalLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYK-AQLEPVASPAK---------KPKVQSGSESIIQEYVDL 1308
Cdd:pfam05483 296 KE---LEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKeAQMEELNKAKAahsfvvtefEATTCSLEELLRTEQQRL 372
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1309 RTRYSELSTLTSQYIRFISEtLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLaEAHAQAKAQAEREAQGLQRRMQE 1388
Cdd:pfam05483 373 EKNEDQLKIITMELQKKSSE-LEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQF-EKIAEELKGKEQELIFLLQAREK 450
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1389 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEiqaKARQVEAAERSRLRIEEEIRVVR------LQLEATERQRGGAEG 1462
Cdd:pfam05483 451 EIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKE---KLKNIELTAHCDKLLLENKELTQeasdmtLELKKHQEDIINCKK 527
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1463 ELQALRARAEEAEAQKRQAQEEAERLRR---QVQDETQRKRQAEAELALRVQAEAEAAREKQRALQ-ALEELRLQAEEAE 1538
Cdd:pfam05483 528 QEERMLKQIENLEEKEMNLRDELESVREefiQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILEnKCNNLKKQIENKN 607
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1539 RRLRQAEAE-RARQVQVALETAQRSA--------EAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRaqqqaea 1609
Cdd:pfam05483 608 KNIEELHQEnKALKKKGSAENKQLNAyeikvnklELELASAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEKA------- 680
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1610 eraraeaerelerwQLKANEALRLRLQAEEVAQQKsltqaeaekqkEEAEREARRRGKAEEQAVRQRELAEQELEKQRQL 1689
Cdd:pfam05483 681 --------------KAIADEAVKLQKEIDKRCQHK-----------IAEMVALMEKHKHQYDKIIEERDSELGLYKNKEQ 735
|
650 660 670 680
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1690 AEGTAqqRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAAT 1736
Cdd:pfam05483 736 EQSSA--KAALEIELSNIKAELLSLKKQLEIEKEEKEKLKMEAKENT 780
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1399-1752 |
9.49e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 44.89 E-value: 9.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1399 EAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQK 1478
Cdd:COG4372 10 KARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQL 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1479 RQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERA---RQVQVA 1555
Cdd:COG4372 90 QAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLEslqEELAAL 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1556 LETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRL 1635
Cdd:COG4372 170 EQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKE 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1636 QAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGE 1715
Cdd:COG4372 250 ELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLE 329
|
330 340 350
....*....|....*....|....*....|....*..
gi 1920237968 1716 QQRQLLEEELARLQREAAAATQKRRELEAELAKVRAE 1752
Cdd:COG4372 330 LALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAG 366
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1340-1501 |
9.64e-04 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 45.28 E-value: 9.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1340 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE 1419
Cdd:PRK12678 65 AAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARK 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1420 AEIQAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGELQALRARAEEAEAQKRQAQEEaERLRRQVQDETQRK 1499
Cdd:PRK12678 145 AGEGGEQPATEARADAAERTEEE--------ERDERRRRGDREDRQAEAERGERGRREERGRDGD-DRDRRDRREQGDRR 215
|
..
gi 1920237968 1500 RQ 1501
Cdd:PRK12678 216 EE 217
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
2146-2603 |
1.02e-03 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 45.50 E-value: 1.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQKAQV----EQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2221
Cdd:pfam05557 78 NRLKKKYLEALNKKLNEKESQLADAREVisclKNELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQN 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2222 LRVQMEELGKLKARIEAENRALVLRDKDSaqrllqEEAEKMKqvaEEAARLSVAAQEAARLRqlaeEDLAQQRALAEKML 2301
Cdd:pfam05557 158 LEKQQSSLAEAEQRIKELEFEIQSQEQDS------EIVKNSK---SELARIPELEKELERLR----EHNKHLNENIENKL 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2302 KEKMQAVQEATRLkaeaellqQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKT-------------LETERQRQLEMS 2368
Cdd:pfam05557 225 LLKEEVEDLKRKL--------EREEKYREEAATLELEKEKLEQELQSWVKLAQDTglnlrspedlsrrIEQLQQREIVLK 296
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2369 AEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTE-----------LATQEKVMLVQTLE------TQRQQSD 2431
Cdd:pfam05557 297 EENSSLTSSARQLEKARRELEQELAQYLKKIEDLNKKLKRHKalvrrlqrrvlLLTKERDGYRAILEsydkelTMSNYSP 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2432 RDAERLREA----------IAELEHEKDKLKQEA----QLLQLKSEEMQTVRQeqllQETQALQQSFLSEKDSLLQRERC 2497
Cdd:pfam05557 377 QLLERIEEAedmtqkmqahNEEMEAQLSVAEEELggykQQAQTLERELQALRQ----QESLADPSYSKEEVDSLRRKLET 452
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2498 IEQEKAKLEQlfQDEVAKAQALREEQQRQQQQMQQEKQQLaasmeearrRQHEAEEGVRRQQEELQRLAqqqqqqeklla 2577
Cdd:pfam05557 453 LELERQRLRE--QKNELEMELERRCLQGDYDPKKTKVLHL---------SMNPAAEAYQQRKNQLEKLQ----------- 510
|
490 500
....*....|....*....|....*.
gi 1920237968 2578 EENQRLRERLQHLEEERRAALARSEE 2603
Cdd:pfam05557 511 AEIERLKRLLKKLEDDLEQVLRLPET 536
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2360-2605 |
1.03e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 45.43 E-value: 1.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2360 ERQRQLEMSAEA-ERLRLRVAEMSRAQARAEEDARRFRKQAEdigerlYRTELATQEKVMLVQTLETQRQQsdrdAERLR 2438
Cdd:TIGR02168 176 ETERKLERTRENlDRLEDILNELERQLKSLERQAEKAERYKE------LKAELRELELALLVLRLEELREE----LEELQ 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2439 EAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqlLQETQALQQSFLSEKDSLLQR-ERCIEQEKAKLEQLFQDEVAKAQ 2517
Cdd:TIGR02168 246 EELKEAEEELEELTAELQELEEKLEELRLEVSE--LEEEIEELQKELYALANEISRlEQQKQILRERLANLERQLEELEA 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2518 ALREeqqrqqqqmqqekqqLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAA 2597
Cdd:TIGR02168 324 QLEE---------------LESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
|
....*...
gi 1920237968 2598 LARSEEIA 2605
Cdd:TIGR02168 389 AQLELQIA 396
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4374-4411 |
1.06e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 39.39 E-value: 1.06e-03
10 20 30
....*....|....*....|....*....|....*...
gi 1920237968 4374 QRFLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTA 4411
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1323-1566 |
1.12e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 44.82 E-value: 1.12e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1323 IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQE 1402
Cdd:COG3883 18 IQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYR 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1403 QKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALR-------AraeEAE 1475
Cdd:COG3883 98 SGGSVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKaeleaakA---ELE 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1476 AQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA 1555
Cdd:COG3883 175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGA 254
|
250
....*....|.
gi 1920237968 1556 LETAQRSAEAE 1566
Cdd:COG3883 255 AGAAAGSAGAA 265
|
|
| GBP_C |
cd16269 |
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ... |
1341-1468 |
1.14e-03 |
|
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.
Pssm-ID: 293879 [Multi-domain] Cd Length: 291 Bit Score: 44.49 E-value: 1.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1341 EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAqAEREaqglqRRMQEEVARREEVavEAQEQKRSIQEELQHLRQSSEA 1420
Cdd:cd16269 177 QSKEAEAEAILQADQALTEKEKEIEAERAKAEA-AEQE-----RKLLEEQQRELEQ--KLEDQERSYEEHLRQLKEKMEE 248
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 1920237968 1421 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegELQALR 1468
Cdd:cd16269 249 ERENLLKEQERALESKLKEQEALLEEGFKEQAELLQE-----EIRSLK 291
|
|
| SPEC |
smart00150 |
Spectrin repeats; |
608-700 |
1.15e-03 |
|
Spectrin repeats;
Pssm-ID: 197544 [Multi-domain] Cd Length: 101 Bit Score: 41.16 E-value: 1.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 608 HGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQA 687
Cdd:smart00150 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIEERLE 80
|
90
....*....|...
gi 1920237968 688 ALQTQWSWMLQLC 700
Cdd:smart00150 81 ELNERWEELKELA 93
|
|
| AtpF |
COG0711 |
FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ... |
1464-1563 |
1.16e-03 |
|
FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ATP synthase, membrane subunit b or b' is part of the Pathway/BioSystem: FoF1-type ATP synthase
Pssm-ID: 440475 [Multi-domain] Cd Length: 152 Bit Score: 42.47 E-value: 1.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1464 LQALRARAEEAE---AQKRQAQEEAERLRRQVQDEtQRKRQAEAElALRVQAEAEAAREKQRALQALEelrlqaEEAERR 1540
Cdd:COG0711 26 LKALDERQEKIAdglAEAERAKEEAEAALAEYEEK-LAEARAEAA-EIIAEARKEAEAIAEEAKAEAE------AEAERI 97
|
90 100
....*....|....*....|...
gi 1920237968 1541 LRQAEAERARQVQVALETAQRSA 1563
Cdd:COG0711 98 IAQAEAEIEQERAKALAELRAEV 120
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1348-1599 |
1.18e-03 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 44.52 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1348 RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1427
Cdd:pfam13868 12 NSKLLAAKCNKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQE 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1428 QVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1507
Cdd:pfam13868 92 EYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEER 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1508 LRVQAEAEAAREKQRAlqaleELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELqsehasfAEKTAQLERTLKE 1587
Cdd:pfam13868 172 EAEREEIEEEKEREIA-----RLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREE-------AEKKARQRQELQQ 239
|
250
....*....|..
gi 1920237968 1588 EHVAVVQLREEA 1599
Cdd:pfam13868 240 AREEQIELKERR 251
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
1471-1855 |
1.20e-03 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 45.43 E-value: 1.20e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1471 AEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAR---EKQRALQALEELRLQAEEAERRLRQAEAE 1547
Cdd:PRK10929 104 TDALEQEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTEARRqlnEIERRLQTLGTPNTPLAQAQLTALQAESA 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1548 RARQVQVALETAQRSAE-----AELQSEhaSFAEKTAQLERTLkeehvavvqlreeatrraqqqaeaeraraeaereler 1622
Cdd:PRK10929 184 ALKALVDELELAQLSANnrqelARLRSE--LAKKRSQQLDAYL------------------------------------- 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1623 wqlkanEALRLRLQAEevaqqksltqaeaekqkeeaerearrrgkaeeqavRQRElAEQELEKQRQLAEGTAQQRLAAEQ 1702
Cdd:PRK10929 225 ------QALRNQLNSQ-----------------------------------RQRE-AERALESTELLAEQSGDLPKSIVA 262
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1703 ELIRLRAETEQGEQQRQLLeEELARLQREAAAATQKRREleaELAKVRAEMEVLLASKARAE----EESRSTSEKSKQRL 1778
Cdd:PRK10929 263 QFKINRELSQALNQQAQRM-DLIASQQRQAASQTLQVRQ---ALNTLREQSQWLGVSNALGEalraQVARLPEMPKPQQL 338
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1779 EAEAGRFRelaeeAARLR--ALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA---------------ISEATRLK--- 1838
Cdd:PRK10929 339 DTEMAQLR-----VQRLRyeDLLNKQPQLRQIRQADGQPLTAEQNRILDAQLRTqrellnsllsggdtlILELTKLKvan 413
|
410
....*....|....*...
gi 1920237968 1839 TEAEIALKE-KEAENERL 1855
Cdd:PRK10929 414 SQLEDALKEvNEATHRYL 431
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
1666-2048 |
1.21e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 45.00 E-value: 1.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1666 GKAEEQAVRQRELAEQELEK-----QRQLAEGTAQQRLAAEQELIRLRAE--TEQGEQQRQLLEEELARLQREAAAATQK 1738
Cdd:NF033838 50 SSGNESQKEHAKEVESHLEKilseiQKSLDKRKHTQNVALNKKLSDIKTEylYELNVLKEKSEAELTSKTKKELDAAFEQ 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1739 RRELEAELAKVRAEMEVLLA-----SKARAEEESRSTSEKSKQRLEAEAGRFrELAEEAARLRALAEEAKRQRqlaEEDA 1813
Cdd:NF033838 130 FKKDTLEPGKKVAEATKKVEeaekkAKDQKEEDRRNYPTNTYKTLELEIAES-DVEVKKAELELVKEEAKEPR---DEEK 205
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1814 VRQrAEAErVLAEKlaaiSEATRLKteaEIALKEKEAENERLRRLaedEAFQRRLLEEQAAQHKADIEARLAqlRKASES 1893
Cdd:NF033838 206 IKQ-AKAK-VESKK----AEATRLE---KIKTDREKAEEEAKRRA---DAKLKEAVEKNVATSEQDKPKRRA--KRGVLG 271
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1894 ELERQKGLVEDTLRQRRQVEEEIL---ALKGSFEKAAAGKAELELElGRIRGTAEDTLR-----SKEQAEQEAARqrqla 1965
Cdd:NF033838 272 EPATPDKKENDAKSSDSSVGEETLpspSLKPEKKVAEAEKKVEEAK-KKAKDQKEEDRRnyptnTYKTLELEIAE----- 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1966 aeEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKA---KVEEARRLRERAEQESARQLQLAQEAAQKrlQAEEKA 2042
Cdd:NF033838 346 --SDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAeatRLEKIKTDRKKAEEEAKRKAAEEDKVKEK--PAEQPQ 421
|
....*.
gi 1920237968 2043 HAFAVQ 2048
Cdd:NF033838 422 PAPAPQ 427
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1412-1700 |
1.30e-03 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 44.94 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1412 QHLRQSsEAEIQAKARQVEAAERSRLRIEEeirvvrlqleateRQrggaegelqalrARAEeaeaqkRQAQEEAERLRRQ 1491
Cdd:PRK05035 429 QYYRQA-KAEIRAIEQEKKKAEEAKARFEA-------------RQ------------ARLE------REKAAREARHKKA 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1492 VQDETQRKRQAEAELALRVQAEAEAAREK----------QRALQALEELR-LQAEEAERRLRQAEAERARQVQVALETAQ 1560
Cdd:PRK05035 477 AEARAAKDKDAVAAALARVKAKKAAATQPivikagarpdNSAVIAAREARkAQARARQAEKQAAAAADPKKAAVAAAIAR 556
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1561 RSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERElerwqlkANEALRLRLQAEEV 1640
Cdd:PRK05035 557 AKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKA-------AVAAAIARAKAKKA 629
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1641 AQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAA 1700
Cdd:PRK05035 630 EQQANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAKA 689
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2755-2791 |
1.39e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 39.00 E-value: 1.39e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 2755 KLLSAERAVTGYKDPYTGEQISLFQAMKKDLIVREHG 2791
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1768-1916 |
1.47e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 44.77 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1768 RSTSEKSKQRLEAEAGRFRELAEEAArlralaEEAKRQRQL-AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALK 1846
Cdd:PRK12704 26 KKIAEAKIKEAEEEAKRILEEAKKEA------EAIKKEALLeAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLD 99
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1847 EKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKasesELERQKGLVEDTLRQR--RQVEEEI 1916
Cdd:PRK12704 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ----ELERISGLTAEEAKEIllEKVEEEA 167
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1336-1581 |
1.48e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 44.50 E-value: 1.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1336 EERLAEQQRaEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQ----------EEVARREEVAVEAQEQKR 1405
Cdd:pfam07888 33 QNRLEECLQ-ERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAelkeelrqsrEKHEELEEKYKELSASSE 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1406 SIQEE---LQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQ 1482
Cdd:pfam07888 112 ELSEEkdaLLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLS 191
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1483 EEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVAleTAQRS 1562
Cdd:pfam07888 192 KEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLQERLNASERKVEGLGEELSSMA--AQRDR 269
|
250
....*....|....*....
gi 1920237968 1563 AEAELQSEHASFAEKTAQL 1581
Cdd:pfam07888 270 TQAELHQARLQAAQLTLQL 288
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
2199-2393 |
1.54e-03 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 44.56 E-value: 1.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2199 QRLKAEVTEAARQRGQVEEElfslrvqmeelgklkarieaenralvlrdkdsaQRLLQEEAEKMKQVAEEAARLSVAAQE 2278
Cdd:pfam15709 341 ERAEMRRLEVERKRREQEEQ---------------------------------RRLQQEQLERAEKMREELELEQQRRFE 387
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2279 AARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEA---ELLQQQKELAQEQARRLQEDKE---QMAQQLAQETQG 2352
Cdd:pfam15709 388 EIRLRKQRLEEERQRQEEEERKQRLQLQAAQERARQQQEEfrrKLQELQRKKQQEEAERAEAEKQrqkELEMQLAEEQKR 467
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1920237968 2353 FQKTLETERQRQLEMSAEAERLRLRVAEMSRaqARAEEDAR 2393
Cdd:pfam15709 468 LMEMAEEERLEYQRQKQEAEEKARLEAEERR--QKEEEAAR 506
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
853-1810 |
1.75e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 44.78 E-value: 1.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 853 EAQEAIARLEAQHQALVAlwhQLHTEMKSLLAWQSLGRDMQLIRSWSLATFRTLKpEEQRQALRSLELHYQAFLRDSQDA 932
Cdd:pfam01576 219 DLQEQIAELQAQIAELRA---QLAKKEEELQAALARLEEETAQKNNALKKIRELE-AQISELQEDLESERAARNKAEKQR 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 933 GGFGPE-DRLQAEREYGSCSRHYQQLLQSleQGEQEESRCQRCISElkdirlqleacETRtVHRLRLPLDKEPARECAQR 1011
Cdd:pfam01576 295 RDLGEElEALKTELEDTLDTTAAQQELRS--KREQEVTELKKALEE-----------ETR-SHEAQLQEMRQKHTQALEE 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1012 ITEQQKaQAEVDGLGKGVARLSAEAEkVLALPEPSPAAPTLRSELELTLGKLE-QVRSLSAIYLEKLKtislvirstQEA 1090
Cdd:pfam01576 361 LTEQLE-QAKRNKANLEKAKQALESE-NAELQAELRTLQQAKQDSEHKRKKLEgQLQELQARLSESER---------QRA 429
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1091 EEVLRAHEEQLkEAQAVPATLPELEATKAALKKLRAQAEAQ-QPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRER 1169
Cdd:pfam01576 430 ELAEKLSKLQS-ELESVSSLLNEAEGKNIKLSKDVSSLESQlQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEE 508
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1170 VtlllerwqavlaqtdVRQRELEqlgRQLRyyresadPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIEr 1249
Cdd:pfam01576 509 E---------------EAKRNVE---RQLS-------TLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLE- 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1250 hgEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPK----VQSGSESIIQEYVDLRTRYS----ELSTLTSQ 1321
Cdd:pfam01576 563 --EKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKkfdqMLAEEKAISARYAEERDRAEaearEKETRALS 640
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1322 YIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEreaqglqrRMQEEVARREEVAVEAQ 1401
Cdd:pfam01576 641 LARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVE--------EMKTQLEELEDELQATE 712
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1402 EQKRSIQEELQHLRQSSEAEIQAKArqvEAAERSRLRIEEEIRVVRLQLEATERQRGGA-------EGELQALRARAEEA 1474
Cdd:pfam01576 713 DAKLRLEVNMQALKAQFERDLQARD---EQGEEKRRQLVKQVRELEAELEDERKQRAQAvaakkklELDLKELEAQIDAA 789
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1475 EAQKRQAQEEAERLRRQVQDetqrkRQAEAELALRVQAEAEA-AREKQRALQALEELRLQAEE----AERRLRQAEAERA 1549
Cdd:pfam01576 790 NKGREEAVKQLKKLQAQMKD-----LQRELEEARASRDEILAqSKESEKKLKNLEAELLQLQEdlaaSERARRQAQQERD 864
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1550 R-QVQVALETAQRSAeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQqaeaeraraeaerelerwqlkaN 1628
Cdd:pfam01576 865 ElADEIASGASGKSA---LQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQ----------------------V 919
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1629 EALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgkaeEQAVRQRELAEQELEKQRQLAEGTAQQRLAAeqELIRLR 1708
Cdd:pfam01576 920 EQLTTELAAERSTSQKS------------------------ESARQQLERQNKELKAKLQEMEGTVKSKFKS--SIAALE 973
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1709 AETEQgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEMEvllaSKARAEEESRSTSEKSKQRLEAEAGRFREL 1788
Cdd:pfam01576 974 AKIAQ-------LEEQLEQESRERQAANKLVRRTEKKLKEVLLQVE----DERRHADQYKDQAEKGNSRMKQLKRQLEEA 1042
|
970 980
....*....|....*....|..
gi 1920237968 1789 AEEAArlRALAEEAKRQRQLAE 1810
Cdd:pfam01576 1043 EEEAS--RANAARRKLQRELDD 1062
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1334-1598 |
1.76e-03 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 44.47 E-value: 1.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1334 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREA-----------QGL-----------QRRMQEEVA 1391
Cdd:pfam02029 5 EEAARERRRRAREERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELkpsgqggldeeEAFldrtakreerrQKRLQEALE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1392 RREEVAVEAQEQKRSIQEELQHlRQSSEAEIQAKARQVEaAERSRLRIEEEIRVVRL---QLEATERQRGGAEGELQALR 1468
Cdd:pfam02029 85 RQKEFDPTIADEKESVAERKEN-NEEEENSSWEKEEKRD-SRLGRYKEEETEIREKEyqeNKWSTEVRQAEEEGEEEEDK 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1469 ARAEEAEAQKRQAQEEAERLRRQVQDE---------------TQRKRQA--EAELALRVQAEAEAAREKQRALQALE-EL 1530
Cdd:pfam02029 163 SEEAEEVPTENFAKEEVKDEKIKKEKKvkyeskvfldqkrghPEVKSQNgeEEVTKLKVTTKRRQGGLSQSQEREEEaEV 242
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1531 RLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAE--KTAQLERTLKEEHVAVVQLREE 1598
Cdd:pfam02029 243 FLEAEQKLEELRRRRQEKESEEFEKLRQKQQEAELELEELKKKREErrKLLEEEEQRRKQEEAERKLREE 312
|
|
| PRK10920 |
PRK10920 |
putative uroporphyrinogen III C-methyltransferase; Provisional |
2309-2387 |
1.77e-03 |
|
putative uroporphyrinogen III C-methyltransferase; Provisional
Pssm-ID: 236795 Cd Length: 390 Bit Score: 44.32 E-value: 1.77e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 2309 QEATRLKAEAELLQQQKELAQEQArrlQEDKEQMAQQLAQETqgfqKTLETERQRQLEMSAEAERLRLRVAEMSRAQAR 2387
Cdd:PRK10920 60 QQAQNQTATNDALANQLTALQKAQ---ESQKQELEGILKQQA----KALDQANRQQAALAKQLDELQQKVATISGSDAK 131
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
1382-2047 |
1.86e-03 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 44.36 E-value: 1.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1382 LQRRMQeevARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRL---RIEEEIRVVR-LQLEATERQR 1457
Cdd:pfam07111 21 LERRLD---TQRPTVTMWEQDVSGDGQGPGRRGRSLELEGSQALSQQAELISRQLQelrRLEEEVRLLReTSLQQKMRLE 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1458 GGA-EGELQALRARAEEAEAQK-RQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKqralqALEELRLQAE 1535
Cdd:pfam07111 98 AQAmELDALAVAEKAGQAEAEGlRAALAGAEMVRKNLEEGSQRELEEIQRLHQEQLSSLTQAHEE-----ALSSLTSKAE 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1536 EAERRLRQAEAERARQVQvALETAQRsaEAELQSEHASFAEKTAQLERTLKEEHVAVV--QLREEATRRAQQQAEAERAR 1613
Cdd:pfam07111 173 GLEKSLNSLETKRAGEAK-QLAEAQK--EAELLRKQLSKTQEELEAQVTLVESLRKYVgeQVPPEVHSQTWELERQELLD 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1614 AEAERELERWQLKAN-EALRLRLQaeevaqqkSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEG 1692
Cdd:pfam07111 250 TMQHLQEDRADLQATvELLQVRVQ--------SLTHMLALQEEELTRKIQPSDSLEPEFPKKCRSLLNRWREKVFALMVQ 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1693 TAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQReaaAATQKRRELEAELAKVRA-EMEVLLASKARAEEESRSTS 1771
Cdd:pfam07111 322 LKAQDLEHRDSVKQLRGQVAELQEQVTSQSQEQAILQR---ALQDKAAEVEVERMSAKGlQMELSRAQEARRRQQQQTAS 398
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1772 EKSKQRLEAEAGRFRELAEEAARLRaLAEEAKRQRQLAEE--DAVRQRAEAERVLAEKLAAIS---EATRLKTEAEIALK 1846
Cdd:pfam07111 399 AEEQLKFVVNAMSSTQIWLETTMTR-VEQAVARIPSLSNRlsYAVRKVHTIKGLMARKVALAQlrqESCPPPPPAPPVDA 477
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1847 EKEAENERLRRlaedeafQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKA 1926
Cdd:pfam07111 478 DLSLELEQLRE-------ERNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEVAQQLEQELQRAQESLASVGQQLEVA 550
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1927 AAGKAELELELGRIRGTAEDTLRSKEQAEQEA-----ARQRQLAAEEERRRREAEERVQK---SLAAEEEAARQRKAALE 1998
Cdd:pfam07111 551 RQGQQESTEEAASLRQELTQQQEIYGQALQEKvaeveTRLREQLSDTKRRLNEARREQAKavvSLRQIQHRATQEKERNQ 630
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1999 EVERLK--AKVEEARRLRERAEQ-ESARQLQLAQEAAQKRLQAEEKAHAFAV 2047
Cdd:pfam07111 631 ELRRLQdeARKEEGQRLARRVQElERDKNLMLATLQQEGLLSRYKQQRLLAV 682
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1449-1912 |
1.96e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 44.26 E-value: 1.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1449 QLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALE 1528
Cdd:COG3064 17 RLEQAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAEAAKKLAEAEKAAAEAEKKAAAE 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1529 ELRLQAEEAERRLRQAEAERARQVQValETAQRSAE--AELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQ 1606
Cdd:COG3064 97 KAKAAKEAEAAAAAEKAAAAAEKEKA--EEAKRKAEeeAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARA 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1607 AEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQ 1686
Cdd:COG3064 175 AAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADL 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1687 RQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1766
Cdd:COG3064 255 AAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGA 334
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1767 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALK 1846
Cdd:COG3064 335 ASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGLRLDLGAALLEA 414
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1847 EKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQV 1912
Cdd:COG3064 415 ASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDGGAVLADL 480
|
|
| FAM184 |
pfam15665 |
Family with sequence similarity 184, A and B; The function of FAM184 is not known. |
1408-1598 |
2.00e-03 |
|
Family with sequence similarity 184, A and B; The function of FAM184 is not known.
Pssm-ID: 464788 [Multi-domain] Cd Length: 211 Bit Score: 42.73 E-value: 2.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1408 QEELQHLRQSSEAEIQakarqveaaersrlRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1487
Cdd:pfam15665 13 EAEIQALKEAHEEEIQ--------------QILAETREKILQYKSKIGEELDLKRRIQTLEESLEQHERMKRQALTEFEQ 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1488 LRRQVQDetqRKRQAEAELALRVQAEA----EAAREKQRALQALEELRLQAE-EAERRLRQAEAERARQVQVALETaqrs 1562
Cdd:pfam15665 79 YKRRVEE---RELKAEAEHRQRVVELSreveEAKRAFEEKLESFEQLQAQFEqEKRKALEELRAKHRQEIQELLTT---- 151
|
170 180 190
....*....|....*....|....*....|....*.
gi 1920237968 1563 aeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREE 1598
Cdd:pfam15665 152 ----QRAQSASSLAEQEKLEELHKAELESLRKEVED 183
|
|
| CH_FIMB_rpt1 |
cd21294 |
first calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar ... |
71-146 |
2.06e-03 |
|
first calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar proteins; Fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409143 Cd Length: 125 Bit Score: 41.28 E-value: 2.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 71 LYEDLRDGHNLISLLEvlsgDSLP-------------REKGRMRFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKL 137
Cdd:cd21294 38 LFDECKDGLVLSKLIN----DSVPdtidervlnkpprKNKPLNNFQMIENNNIVINSAKAIGCSVVNIGAGDIIEGREHL 113
|
....*....
gi 1920237968 138 TLGLIWTII 146
Cdd:cd21294 114 ILGLIWQII 122
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2421-2604 |
2.09e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 44.34 E-value: 2.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2421 QTLETQRQQSDR----DAERLREAIAELEHEKDKLKqeaqllqlKSEEMQTVRQEQLLQETQ--ALQQSFLSEKDSLLQR 2494
Cdd:pfam17380 281 QKAVSERQQQEKfekmEQERLRQEKEEKAREVERRR--------KLEEAEKARQAEMDRQAAiyAEQERMAMERERELER 352
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2495 ERcIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEK 2574
Cdd:pfam17380 353 IR-QEERKRELERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEE 431
|
170 180 190
....*....|....*....|....*....|.
gi 1920237968 2575 LLAEENQRL-RERLQHLEEERRAALARSEEI 2604
Cdd:pfam17380 432 ARQREVRRLeEERAREMERVRLEEQERQQQV 462
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1703-2050 |
2.10e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 44.12 E-value: 2.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1703 ELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAElaKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEA 1782
Cdd:pfam07888 30 ELLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWE--RQRRELESRVAELKEELRQSREKHEELEEKYKELS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1783 GRFRELAEEAARLRAlAEEAKRQRQLAEEDAVrqRAEAERVLaeklaaiseatrlkteaeialkEKEAENERLRRLAEDE 1862
Cdd:pfam07888 108 ASSEELSEEKDALLA-QRAAHEARIRELEEDI--KTLTQRVL----------------------ERETELERMKERAKKA 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1863 AFQRRLLEEQAAQHKADIEARLAQLRKASeSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRG 1942
Cdd:pfam07888 163 GAQRKEEEAERKQLQAKLQQTEEELRSLS-KEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEALLEELRS 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1943 TAE-------------------DTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALE----- 1998
Cdd:pfam07888 242 LQErlnaserkveglgeelssmAAQRDRTQAELHQARLQAAQLTLQLADASLALREGRARWAQERETLQQSAEADkdrie 321
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1999 ----EVERLKAKVEEARRLRERAEQESARQ--LQLAQEAAQKRLQAEEKAhAFAVQQK 2050
Cdd:pfam07888 322 klsaELQRLEERLQEERMEREKLEVELGREkdCNRVQLSESRRELQELKA-SLRVAQK 378
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1340-1593 |
2.16e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 2.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1340 AEQQRAEERERlaeVEAALEkqrqlaEAHAQAKAQAEREAQGL---------------------QRRMQEEVARREEVAV 1398
Cdd:PHA03247 1586 AKQQRAEATDR---VTAALR------EALAAHERRAQSEAESLanlktllrvaaipataaktldQARSVAEIVDQIELLL 1656
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1399 EAQEQKRSIQEE----LQHLRQSSEAEIQAKARQVEAAERSRLRieeeirvVRLQLEATERQRggaegeLQALRARAEEA 1474
Cdd:PHA03247 1657 EQTEKAAELDVAavdwLEHARRVFEAHPLTAARGGGPDPLARLH-------ARLDALGETRRR------TEALRRSLEAA 1723
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1475 EAQKRQAQEEAERLRRQvqdetqrkrqaeaelALRVQAEAEAAREKQRALQALEE--LRLQAEEAERRLrqaeAERARQV 1552
Cdd:PHA03247 1724 EAEWDEVWGRFGRVRGG---------------AWKSPEALRAAREQLRALQTATNtvLGLRADAHYERL----PAKYQGA 1784
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1920237968 1553 qVALETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVAVV 1593
Cdd:PHA03247 1785 -LGAKSAERAGAVE---ELGAAVARHDGLLARLREEVVARV 1821
|
|
| COG4995 |
COG4995 |
Uncharacterized conserved protein, contains CHAT domain [Function unknown]; |
2025-2460 |
2.19e-03 |
|
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
Pssm-ID: 444019 [Multi-domain] Cd Length: 711 Bit Score: 44.19 E-value: 2.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2025 LQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAE 2104
Cdd:COG4995 1 LLALALLALLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLAL 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2105 RLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQL 2184
Cdd:COG4995 81 ALAALALALLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2185 EETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQ 2264
Cdd:COG4995 161 AAAALLALALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLAL 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2265 VAEEAARLSvAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQ 2344
Cdd:COG4995 241 LALAAAAAA-LAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLL 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2345 QLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEM-SRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTL 2423
Cdd:COG4995 320 LAALLLLLAALALLALLLLLAAAALLAAALAAALALAaALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAA 399
|
410 420 430
....*....|....*....|....*....|....*..
gi 1920237968 2424 ETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQL 2460
Cdd:COG4995 400 LLALAAAQLLRLLLAALALLLALAAYAAARLALLALI 436
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
1464-1588 |
2.19e-03 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 44.43 E-value: 2.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1464 LQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEaarekqralQALEELRLQAEEAERRLRQ 1543
Cdd:PRK00409 525 LEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQ---------QAIKEAKKEADEIIKELRQ 595
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1920237968 1544 AEAERARQVqvaletaqrsAEAELQSEHASFAEKTAQLERTLKEE 1588
Cdd:PRK00409 596 LQKGGYASV----------KAHELIEARKRLNKANEKKEKKKKKQ 630
|
|
| PRK12705 |
PRK12705 |
hypothetical protein; Provisional |
1358-1564 |
2.30e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 237178 [Multi-domain] Cd Length: 508 Bit Score: 43.93 E-value: 2.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1358 LEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEvaRREEVAVEAQEQKRSIQEELQHLrQSSEAEIQAKARQVEAaersrl 1437
Cdd:PRK12705 25 LKKRQRLAKEAERILQEAQKEAEEKLEAALLE--AKELLLRERNQQRQEARREREEL-QREEERLVQKEEQLDA------ 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1438 RIEEeirvvrlqLEATERQRGGAEgelQALRARAEEAEAQKRQAQEEAERLrrqvqdETQRKRQAEAELALRVQAEAEaa 1517
Cdd:PRK12705 96 RAEK--------LDNLENQLEERE---KALSARELELEELEKQLDNELYRV------AGLTPEQARKLLLKLLDAELE-- 156
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1518 REKQRALQALEelrlqaEEAerrlrQAEAERARQVQVAlETAQRSAE 1564
Cdd:PRK12705 157 EEKAQRVKKIE------EEA-----DLEAERKAQNILA-QAMQRIAS 191
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1337-1598 |
2.31e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 44.43 E-value: 2.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1337 ERLAEQQRAEERERLAEVEaALEKQRQLAEAHAQAkaqaereaqgLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRq 1416
Cdd:pfam10174 453 ERLKEQREREDRERLEELE-SLKKENKDLKEKVSA----------LQPELTEKESSLIDLKEHASSLASSGLKKDSKLK- 520
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1417 SSEAEIQA-------------KARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQK----- 1478
Cdd:pfam10174 521 SLEIAVEQkkeecsklenqlkKAHNAEEAVRTNPEINDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENEKndkdk 600
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1479 ----------RQAQEEAERLR--RQVQDETQRKRQAEAELALRVQ-AEAEAAREKQRA--LQALEELRLQAEEAERRLRQ 1543
Cdd:pfam10174 601 kiaelesltlRQMKEQNKKVAniKHGQQEMKKKGAQLLEEARRREdNLADNSQQLQLEelMGALEKTRQELDATKARLSS 680
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1544 AE--------------AERARQVQVALETAQRSAEAELQSEHASFA--EKTAQLERTLKEEhvaVVQLREE 1598
Cdd:pfam10174 681 TQqslaekdghltnlrAERRKQLEEILEMKQEALLAAISEKDANIAllELSSSKKKKTQEE---VMALKRE 748
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
2146-2310 |
2.33e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 42.99 E-value: 2.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2146 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQ 2225
Cdd:COG1579 9 LLDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNN 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2226 mEELGKLKARIEAENRALVLRDkDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM 2305
Cdd:COG1579 89 -KEYEALQKEIESLKRRISDLE-DEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAERE 166
|
....*
gi 1920237968 2306 QAVQE 2310
Cdd:COG1579 167 ELAAK 171
|
|
| Rabaptin |
pfam03528 |
Rabaptin; |
1328-1600 |
2.35e-03 |
|
Rabaptin;
Pssm-ID: 367545 [Multi-domain] Cd Length: 486 Bit Score: 43.94 E-value: 2.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEaHAQAKAQAEREAQGLQRRMQEevARREEVAVEAQEQKRSI 1407
Cdd:pfam03528 96 DEVKSQWQEEVASLQAIMKETVREYEVQFHRRLEQERAQ-WNQYRESAEREIADLRRRLSE--GQEEENLEDEMKKAQED 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1408 QEELQHLRQSSEAEIQA-KARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALraraeeaEAQKRQAQEEAE 1486
Cdd:pfam03528 173 AEKLRSVVMPMEKEIAAlKAKLTEAEDKIKELEASKMKELNHYLEAEKSCRTDLEMYVAVL-------NTQKSVLQEDAE 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1487 RLRRQVQDETQRkrqaeaeLALRVQAEAEAAREKQRAL-QALEELRLQAEEAERRLRQAEAERARQVqvalETAQRSAEA 1565
Cdd:pfam03528 246 KLRKELHEVCHL-------LEQERQQHNQLKHTWQKANdQFLESQRLLMRDMQRMESVLTSEQLRQV----EEIKKKDQE 314
|
250 260 270
....*....|....*....|....*....|....*
gi 1920237968 1566 ELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAT 1600
Cdd:pfam03528 315 EHKRARTHKEKETLKSDREHTVSIHAVFSPAGVET 349
|
|
| CH_PLS_rpt1 |
cd21292 |
first calponin homology (CH) domain found in the plastin family; The plastin family includes ... |
44-146 |
2.36e-03 |
|
first calponin homology (CH) domain found in the plastin family; The plastin family includes plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409141 Cd Length: 145 Bit Score: 41.50 E-value: 2.36e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 44 QKKTFTKWVNKHLIKHwrAEAQRHI------SDLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIALD 113
Cdd:cd21292 25 EKVAFVNWINKNLGDD--PDCKHLLpmdpntDDLFEKVKDGILLCKMINLSVPDTIDErainKKKLTVFTIHENLTLALN 102
|
90 100 110
....*....|....*....|....*....|...
gi 1920237968 114 YLRHRQVKLVNIRNDDIADGNPKLTLGLIWTII 146
Cdd:cd21292 103 SASAIGCNVVNIGAEDLKEGKPHLVLGLLWQII 135
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
1449-1564 |
2.37e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 43.42 E-value: 2.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1449 QLEATERQR-GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQAL 1527
Cdd:PRK09039 67 DLLSLERQGnQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQI 146
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1920237968 1528 EELR---------LQAEEAERRLRQAE-AERARQVQVALetAQRSAE 1564
Cdd:PRK09039 147 AALRrqlaaleaaLDASEKRDRESQAKiADLGRRLNVAL--AQRVQE 191
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1352-1559 |
2.39e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 2.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1352 AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ--RRMQEEVARREEVAVEAQEQKRSIQE-----------ELQHLRQSS 1418
Cdd:PHA03247 1150 STVDAAVRAHGVLADAVAALSPAVRDPACPLAflVALADSAAGYVKATRLALDARRAIARlgalgaaaadlAVAVRRENP 1229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1419 EAE------IQAKARQVEAAERSRLRIEEEIRVVrLQLEATERQRGGAEGELQAL-------RARAEEAEAQkrqAQEEA 1485
Cdd:PHA03247 1230 QAEgdraalLEAAARAVTAAREGLAACEGEFGGL-LHAEGSAGDPSPSGRALQELgkvvgatRRRADELEAA---AADLA 1305
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1486 ERLRRQVQDETQRKRQAEAELAL-RVQAEAEAAREKQRALQALE-ELRLQAEEAERRLRQAEAERARQVQVALETA 1559
Cdd:PHA03247 1306 EKMAARRARASRERWAADVEAALdRVENRAEFDAVELRRLQALAaTHGYNPRDFRKRAEQALAANAKTATLALEAA 1381
|
|
| Rabaptin |
pfam03528 |
Rabaptin; |
2265-2508 |
2.53e-03 |
|
Rabaptin;
Pssm-ID: 367545 [Multi-domain] Cd Length: 486 Bit Score: 43.94 E-value: 2.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2265 VAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKML--KEKMQAVQEATRLKAEAELLQQQKELAQEQAR--------- 2333
Cdd:pfam03528 6 LQQRVAELEKENAEFYRLKQQLEAEFNQKRAKFKELYlaKEEDLKRQNAVLQEAQVELDALQNQLALARAEmenikavat 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2334 ----RLQEDKEQMAQQLAQETQGFQKTL-ETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRfrkqaedigerlyR 2408
Cdd:pfam03528 86 vsenTKQEAIDEVKSQWQEEVASLQAIMkETVREYEVQFHRRLEQERAQWNQYRESAEREIADLRR-------------R 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2409 TELATQEkvmlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQA--------- 2479
Cdd:pfam03528 153 LSEGQEE-----ENLEDEMKKAQEDAEKLRSVVMPMEKEIAALKAKLTEAEDKIKELEASKMKELNHYLEAekscrtdle 227
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1920237968 2480 -------LQQSFLSEKDSLLQRE-----RCIEQEKAKLEQL 2508
Cdd:pfam03528 228 myvavlnTQKSVLQEDAEKLRKElhevcHLLEQERQQHNQL 268
|
|
| Borrelia_P83 |
pfam05262 |
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins. |
1670-1814 |
2.57e-03 |
|
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
Pssm-ID: 114011 [Multi-domain] Cd Length: 489 Bit Score: 43.84 E-value: 2.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1749
Cdd:pfam05262 209 QEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQI 288
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1750 RAEM---EVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAV 1814
Cdd:pfam05262 289 EIKKndeEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNEDAI 356
|
|
| SPEC |
smart00150 |
Spectrin repeats; |
511-605 |
2.62e-03 |
|
Spectrin repeats;
Pssm-ID: 197544 [Multi-domain] Cd Length: 101 Bit Score: 40.39 E-value: 2.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 511 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQL---SPATRGAYRDCLGRLD 587
Cdd:smart00150 4 LRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLieeGHPDAEEIEERLEELN 83
|
90
....*....|....*...
gi 1920237968 588 LQYAKLLSSSKARLRSLE 605
Cdd:smart00150 84 ERWEELKELAEERRQKLE 101
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1412-1901 |
3.00e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 43.49 E-value: 3.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1412 QHLRQSSEAEIQAKARqVEAAERSRLRIEEEIRVVRLQLEATERQRGGA--EGELQALRARAEEAEAQKRQAQEEAERLR 1489
Cdd:COG3064 2 QEALEEKAAEAAAQER-LEQAEAEKRAAAEAEQKAKEEAEEERLAELEAkrQAEEEAREAKAEAEQRAAELAAEAAKKLA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1490 RQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQALEELRLQAEEaerrlRQAEAERARQVQVALETAQRSAEAELQS 1569
Cdd:COG3064 81 EAEKAAAEAEKKAAAEKA-KAAKEAEAAAAAEKAAAAAEKEKAEEAK-----RKAEEEAKRKAEEERKAAEAEAAAKAEA 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1570 EHASFAEKTAQLERTLKEEhVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQA 1649
Cdd:COG3064 155 EAARAAAAAAAAAAAAAAR-AAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAA 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1650 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQ 1729
Cdd:COG3064 234 LAAVEATEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAA 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1730 REAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLA 1809
Cdd:COG3064 314 EEAVLAAAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILA 393
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1810 EEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRK 1889
Cdd:COG3064 394 AAGGGGLLGLRLDLGAALLEAASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDG 473
|
490
....*....|..
gi 1920237968 1890 ASESELERQKGL 1901
Cdd:COG3064 474 GAVLADLLLLGG 485
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
2306-2603 |
3.34e-03 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 43.32 E-value: 3.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2306 QAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRvaemsrAQ 2385
Cdd:pfam02029 6 EAARERRRRAREERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQK------RL 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2386 ARAEEDARRFRKQAEDIGErlyrtELATQEKVMLVQTLETQRQQSDRDAERLREAIAELE------------------HE 2447
Cdd:pfam02029 80 QEALERQKEFDPTIADEKE-----SVAERKENNEEEENSSWEKEEKRDSRLGRYKEEETEirekeyqenkwstevrqaEE 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2448 KDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQ 2527
Cdd:pfam02029 155 EGEEEEDKSEEAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQ 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2528 QQMQQEKQQLAA--SMEEARRRQHEAEEgvrrqqEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRA-------AL 2598
Cdd:pfam02029 235 EREEEAEVFLEAeqKLEELRRRRQEKES------EEFEKLRQKQQEAELELEELKKKREERRKLLEEEEQRrkqeeaeRK 308
|
....*
gi 1920237968 2599 ARSEE 2603
Cdd:pfam02029 309 LREEE 313
|
|
| CusB_dom_1 |
pfam00529 |
Cation efflux system protein CusB domain 1; The cation efflux system protein CusB from E. coli ... |
1415-1568 |
3.35e-03 |
|
Cation efflux system protein CusB domain 1; The cation efflux system protein CusB from E. coli can be divided into four different domains, the first three domains of the protein are mostly beta-strands and the fourth forms an all alpha-helical domain. This entry represents the first beta-domain (domain 1) of CusB and it is formed by the N and C-terminal ends of the polypeptide (residues 89-102 and 324-385). CusB is part of the copper-transporting efflux system CusCFBA. This domain can also be found in other membrane-fusion proteins, such as HlyD, MdtN, MdtE and AaeA. HlyD is a component of the prototypical alpha-haemolysin (HlyA) bacterial type I secretion system, along with the other components HlyB and TolC. HlyD is anchored in the cytoplasmic membrane by a single transmembrane domain and has a large periplasmic domain within the carboxy-terminal 100 amino acids, HlyB and HlyD form a stable complex that binds the recombinant protein bearing a C-terminal HlyA signal sequence and ATP in the cytoplasm. HlyD, HlyB and TolC combine to form the three-component ABC transporter complex that forms a trans-membrane channel or pore through which HlyA can be transferred directly to the extracellular medium. Cutinase has been shown to be transported effectively through this pore.
Pssm-ID: 425733 [Multi-domain] Cd Length: 322 Bit Score: 43.18 E-value: 3.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1415 RQSSEAEIQAKARQVEA-AERSRLRIE-EEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlRRQV 1492
Cdd:pfam00529 54 TDYQAALDSAEAQLAKAqAQVARLQAElDRLQALESELAISRQDYDGATAQLRAAQAAVKAAQAQLAQAQIDLAR-RRVL 132
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1493 QDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQ 1568
Cdd:pfam00529 133 APIGGISRESLVTAGALVAQAQANLLATVAQLDQIYVQITQSAAENQAEVRSELSGAQLQIAEAEAELKLAKLDLE 208
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
2271-2389 |
3.38e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 43.61 E-value: 3.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2271 RLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEA---TRLKAEAELLQQQKELaQEQARRLQEDKEQMAQQLA 2347
Cdd:PRK12704 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEihkLRNEFEKELRERRNEL-QKLEKRLLQKEENLDRKLE 103
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2348 ------QETQGFQKTLETERQRQLEMSAEAERLR-------LRVAEMSRAQARAE 2389
Cdd:PRK12704 104 llekreEELEKKEKELEQKQQELEKKEEELEELIeeqlqelERISGLTAEEAKEI 158
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
2166-2599 |
3.41e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 43.66 E-value: 3.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2166 ALRQKAQVEQ-ELTALRLQLEEtdhQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEaeNRALV 2244
Cdd:pfam10174 335 AKEQRAAILQtEVDALRLRLEE---KESFLNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE--NLQEQ 409
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2245 LRDKDsaqRLLQEEAEKMKQVAEEAARLSVA---AQEAARLRQLAEEDLAQQRALAEKMLKEKMQAV-QEATRLKAEAEL 2320
Cdd:pfam10174 410 LRDKD---KQLAGLKERVKSLQTDSSNTDTAlttLEEALSEKERIIERLKEQREREDRERLEELESLkKENKDLKEKVSA 486
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2321 LQQQKelaQEQARRLQEDKEQmAQQLAQ---ETQGFQKTLETERQRQLEmsaEAERLrlrVAEMSRAQaRAEEDARrfrk 2397
Cdd:pfam10174 487 LQPEL---TEKESSLIDLKEH-ASSLASsglKKDSKLKSLEIAVEQKKE---ECSKL---ENQLKKAH-NAEEAVR---- 551
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2398 QAEDIGERLYRTELATQEKVMlvqtlETQRQQSDrdAERLREAIAELEHEK-DKLKQEAQLLQLKSEEMQTvrQEQLLQE 2476
Cdd:pfam10174 552 TNPEINDRIRLLEQEVARYKE-----ESGKAQAE--VERLLGILREVENEKnDKDKKIAELESLTLRQMKE--QNKKVAN 622
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2477 TQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQdevakaqalreeqqrqqqqmqqekqqLAASMEEARRRQHEAEEGVR 2556
Cdd:pfam10174 623 IKHGQQEMKKKGAQLLEEARRREDNLADNSQQLQ--------------------------LEELMGALEKTRQELDATKA 676
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 1920237968 2557 RQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALA 2599
Cdd:pfam10174 677 RLSSTQQSLAEKDGHLTNLRAERRKQLEEILEMKQEALLAAIS 719
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1055-1271 |
3.48e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.75 E-value: 3.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1055 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALkklraqaEAQQPV 1134
Cdd:COG4913 614 ALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERL-------DASSDD 686
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1135 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDvrqrELEQLGRQLRYYResadpLGAWLRD 1214
Cdd:COG4913 687 LAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLE----AAEDLARLELRAL-----LEERFAA 757
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1215 AKQRqeqiqavplANSQAVREQLRQE-KALLEDIERHGEKVEEC-QRFAKQYINAIKDY 1271
Cdd:COG4913 758 ALGD---------AVERELRENLEERiDALRARLNRAEEELERAmRAFNREWPAETADL 807
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3749-3780 |
3.50e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 37.85 E-value: 3.50e-03
10 20 30
....*....|....*....|....*....|..
gi 1920237968 3749 RLLSAERAVTGYRDPYTEQTISLFQAMKKDLI 3780
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLI 33
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
2263-2604 |
3.52e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 43.79 E-value: 3.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2263 KQVAEEAARLSVAA--QEAARLRQLAEEDLAQQRALAEKmlkekmqavqEATRLKAEAELLQQQKELAQEQARR--LQED 2338
Cdd:PRK04863 260 KHLITESTNYVAADymRHANERRVHLEEALELRRELYTS----------RRQLAAEQYRLVEMARELAELNEAEsdLEQD 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2339 KEQMA--QQLAQETQGFQKTLE------TERQRQLEMSAEAERLRLRVAEMSRAQAR-AEEDARRFRKQAEDIGERL--- 2406
Cdd:PRK04863 330 YQAASdhLNLVQTALRQQEKIEryqadlEELEERLEEQNEVVEEADEQQEENEARAEaAEEEVDELKSQLADYQQALdvq 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2407 YRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEaqLLQLksEEMQTVRQEQLLQETQALQ--QSF 2484
Cdd:PRK04863 410 QTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEE--LLSL--EQKLSVAQAAHSQFEQAYQlvRKI 485
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2485 LSEKDsllqRERCIEQEKAKLEQL--FQDEVAKAQALReeqqrqqqqmqqekqqlaASMEEARRRQHEAEEGVRRQQEEL 2562
Cdd:PRK04863 486 AGEVS----RSEAWDVARELLRRLreQRHLAEQLQQLR------------------MRLSELEQRLRQQQRAERLLAEFC 543
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 1920237968 2563 QRLAQQQQQQEkLLAEENQRLRERLQHLEEERRAALARSEEI 2604
Cdd:PRK04863 544 KRLGKNLDDED-ELEQLQEELEARLESLSESVSEARERRMAL 584
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2718-2751 |
3.54e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 37.85 E-value: 3.54e-03
10 20 30
....*....|....*....|....*....|....
gi 1920237968 2718 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPE 2751
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| Tektin |
pfam03148 |
Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular ... |
1684-1957 |
3.62e-03 |
|
Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers of tektin heterodimers with axial periodicity matching tubulin. Tektin polypeptides consist of several alpha-helical regions that are predicted to form coiled coils. Indeed, tektins share considerable structural similarities with intermediate filament proteins. Possible functional roles for tektins are: stabilization of tubulin protofilaments; attachment of A and B-tubules in ciliary/flagellar microtubule doublets and C-tubules in centrioles; binding of axonemal components.
Pssm-ID: 460827 [Multi-domain] Cd Length: 383 Bit Score: 43.31 E-value: 3.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1684 EKQRQLAEgtaQQRLAAE---QELIRLRAETEQGEQQRQllEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASK 1760
Cdd:pfam03148 6 QELYREAE---AQRNDAErlrQESRRLRNETDAKTKWDQ--YDSNRRLGERIQDITFWKSELEKELEELDEEIELLLEEK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1761 ARAEEESRSTSEK---SKQRLEAEAGRF----------RELAEEA-------ARLRALAEEAkrQRQLAEEDAVRQRAEA 1820
Cdd:pfam03148 81 RRLEKALEALEEPlhiAQECLTLREKRQgidlvhdeveKELLKEVeliegiqELLQRTLEQA--WEQLRLLRAARHKLEK 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1821 ErvLAEKLAAI---SEATRLK-TEAEIALKEKEAENERLRRLAED-EAFQRRLLE--EQAAQHKADIEARLAQLRKASES 1893
Cdd:pfam03148 159 D--LSDKKEALeidEKCLSLNnTSPNISYKPGPTRIPPNSSTPEEwEKFTQDNIEraEKERAASAQLRELIDSILEQTAN 236
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1894 ELERQKGLVEDTLRQRrqVEEeilalkgsFEKAaagKAELELELGRIRgtaedtlrsKEQAEQE 1957
Cdd:pfam03148 237 DLRAQADAVNFALRKR--IEE--------TEDA---KNKLEWQLKKTL---------QEIAELE 278
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
2251-2384 |
3.64e-03 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 43.47 E-value: 3.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2251 AQRLLQE-----EAEKMK-QVAEEAAR---LSVAAQEAARlrQLAEEDLAQQRALAEKMLKEkMQAVQEATRLKAEAELL 2321
Cdd:PTZ00491 672 AELLEQEargrlERQKMHdKAKAEEQRtklLELQAESAAV--ESSGQSRAEALAEAEARLIE-AEAEVEQAELRAKALRI 748
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 2322 QQQKELAQEQARRLQEdkeqmaqqLAQETQgfQKTLETERQRQLeMSAEAERL--------RLRVAEMSRA 2384
Cdd:PTZ00491 749 EAEAELEKLRKRQELE--------LEYEQA--QNELEIAKAKEL-ADIEATKFerivealgRETLIAIARA 808
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
1174-1447 |
3.66e-03 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 43.75 E-value: 3.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1174 LERWQAVLAQTDVRQRELEQLGRQLryyrESADplgawlRDAKQRQEQIQAVPLANSQAVREQLrqEKALLEDIERhgeK 1253
Cdd:PRK11281 65 LEQTLALLDKIDRQKEETEQLKQQL----AQAP------AKLRQAQAELEALKDDNDEETRETL--STLSLRQLES---R 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1254 VEECQRFAKQYINAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSelSTLTSQyiRFISETLR-R 1332
Cdd:PRK11281 130 LAQTLDQLQNAQNDLAEYNSQLVSLQTQPE---------RAQAALYANSQRLQQIRNLLK--GGKVGG--KALRPSQRvL 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1333 MEEEERLAEQQRAEERERLA---EVEAALEKQRQLAEAHAQakaQAEREAQGLQRRM-QEEVARREEVAVEAQEQKRS-- 1406
Cdd:PRK11281 197 LQAEQALLNAQNDLQRKSLEgntQLQDLLQKQRDYLTARIQ---RLEHQLQLLQEAInSKRLTLSEKTVQEAQSQDEAar 273
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1407 ------IQEELQHLRQSSEAEIQAKAR-------------QVEAAERSRLRIEEEIRVVR 1447
Cdd:PRK11281 274 iqanplVAQELEINLQLSQRLLKATEKlntltqqnlrvknWLDRLTQSERNIKEQISVLK 333
|
|
| RecN |
COG0497 |
DNA repair ATPase RecN [Replication, recombination and repair]; |
1463-1597 |
3.67e-03 |
|
DNA repair ATPase RecN [Replication, recombination and repair];
Pssm-ID: 440263 [Multi-domain] Cd Length: 555 Bit Score: 43.53 E-value: 3.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1463 ELQALRARAEEAEAQKRQAQEEAERLRRQVQdetqrkrqaE-AELALRVQAEAEAAREKQRaLQALEELRLQAEEAERRL 1541
Cdd:COG0497 166 AWRALKKELEELRADEAERARELDLLRFQLE---------ElEAAALQPGEEEELEEERRR-LSNAEKLREALQEALEAL 235
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1542 RQAEAerarQVQVALETAQRSAEaELQSEHASFAEKTAQLERtlkeehvAVVQLRE 1597
Cdd:COG0497 236 SGGEG----GALDLLGQALRALE-RLAEYDPSLAELAERLES-------ALIELEE 279
|
|
| PspA |
COG1842 |
Phage shock protein A [Transcription, Signal transduction mechanisms]; |
1438-1568 |
3.73e-03 |
|
Phage shock protein A [Transcription, Signal transduction mechanisms];
Pssm-ID: 441447 [Multi-domain] Cd Length: 217 Bit Score: 42.12 E-value: 3.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1438 RIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAA 1517
Cdd:COG1842 20 KAEDPEKMLDQAIRDMEEDLVEARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKGREDLAREALERKAELEAQ 99
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1518 REKQRAL-----QALEELRLQAEEAERRLRQAEAERArqvqvALETAQRSAEAELQ 1568
Cdd:COG1842 100 AEALEAQlaqleEQVEKLKEALRQLESKLEELKAKKD-----TLKARAKAAKAQEK 150
|
|
| PRK07735 |
PRK07735 |
NADH-quinone oxidoreductase subunit C; |
1800-2049 |
3.79e-03 |
|
NADH-quinone oxidoreductase subunit C;
Pssm-ID: 236081 [Multi-domain] Cd Length: 430 Bit Score: 43.05 E-value: 3.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1800 EEAKRQRQLAEEDAVRQRAEAERVLAEKLAA-ISEATRLKTEAEIALKEKEAENerlrrlaedeafqrrlLEEQAAQHKA 1878
Cdd:PRK07735 2 DPEKDLEDLKKEAARRAKEEARKRLVAKHGAeISKLEEENREKEKALPKNDDMT----------------IEEAKRRAAA 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1879 DIEARLAQLRKASESELERQkglvedtlrqrrqVEEEILALKGSFEKAAAGKAElELELGRIRGTAEDTLRSKEQAEQEA 1958
Cdd:PRK07735 66 AAKAKAAALAKQKREGTEEV-------------TEEEKAKAKAKAAAAAKAKAA-ALAKQKREGTEEVTEEEKAAAKAKA 131
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1959 ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQA 2038
Cdd:PRK07735 132 AAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAA 211
|
250
....*....|.
gi 1920237968 2039 EEKAHAFAVQQ 2049
Cdd:PRK07735 212 KAKAAALAKQK 222
|
|
| KpsE |
COG3524 |
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis]; |
1364-1551 |
3.99e-03 |
|
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442746 [Multi-domain] Cd Length: 370 Bit Score: 42.92 E-value: 3.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1364 LAEAHAQAKAQAEREAQGLQRRMQEEVARreevaveAQEQKRSIQEELQHLRQsseaeiqakarqveaaERSRLRIEEEI 1443
Cdd:COG3524 160 LAESEELVNQLSERAREDAVRFAEEEVER-------AEERLRDAREALLAFRN----------------RNGILDPEATA 216
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1444 RVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAL-RVQAEaeaarekqr 1522
Cdd:COG3524 217 EALLQLIATLEGQLAELEAELAALRSYLSPNSPQVRQLRRRIAALEKQIAAERARLTGASGGDSLaSLLAE--------- 287
|
170 180 190
....*....|....*....|....*....|.
gi 1920237968 1523 alqaLEELRLQAEEAERRLRQAEA--ERARQ 1551
Cdd:COG3524 288 ----YERLELEREFAEKAYTSALAalEQARI 314
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
2095-2281 |
4.11e-03 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 43.40 E-value: 4.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2095 QSRRQVEEAERLKQsAEEQAQAQAQAQAAAEKLRKeaeqeaarraqaeqAALRQKQAADAEMEKHKQFAEQALRQKAQVE 2174
Cdd:pfam15709 360 QRRLQQEQLERAEK-MREELELEQQRRFEEIRLRK--------------QRLEEERQRQEEEERKQRLQLQAAQERARQQ 424
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2175 QEltALRLQLEETDHQKsildeelQRLKAEVTEAARQRGQVEEElfslrvQMEELGKLKARIEAENRALVLRDKDSAQRL 2254
Cdd:pfam15709 425 QE--EFRRKLQELQRKK-------QQEEAERAEAEKQRQKELEM------QLAEEQKRLMEMAEEERLEYQRQKQEAEEK 489
|
170 180 190
....*....|....*....|....*....|..
gi 1920237968 2255 LQEEAEKMKQVAEEAARLSVA-----AQEAAR 2281
Cdd:pfam15709 490 ARLEAEERRQKEEEAARLALEeamkqAQEQAR 521
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1109-1519 |
4.15e-03 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 43.13 E-value: 4.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1109 ATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGaqevgerLQQRHGERDVEVERWRERvtllLERWQAVLAQTDVRQ 1188
Cdd:pfam19220 38 AILRELPQAKSRLLELEALLAQERAAYGKLRRELAG-------LTRRLSAAEGELEELVAR----LAKLEAALREAEAAK 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1189 RELEQLGRQLRYYRESADplgawlRDAKQRQEQIQAVPLANsQAVREQLRQEKALLEDIERHGEKVEECQRFAKQyinai 1268
Cdd:pfam19220 107 EELRIELRDKTAQAEALE------RQLAAETEQNRALEEEN-KALREEAQAAEKALQRAEGELATARERLALLEQ----- 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1269 kdyelqlvtykaqlepvaspaKKPKVQSGSESIIQEYVDLRTRYSELSTL---TSQYIRFISETLRRMEEEERLAEQQRA 1345
Cdd:pfam19220 175 ---------------------ENRRLQALSEEQAAELAELTRRLAELETQldaTRARLRALEGQLAAEQAERERAEAQLE 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1346 EERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLrqssEAEIQAK 1425
Cdd:pfam19220 234 EAVEAHRAERASLRMKLEALTARAAATEQLLAEARNQLRDRDEAIRAAERRLKEASIERDTLERRLAGL----EADLERR 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1426 ARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRqaeAE 1505
Cdd:pfam19220 310 TQQFQEMQRARAELEERAEMLTKALAAKDAALERAEERIASLSDRIAELTKRFEVERAALEQANRRLKEELQRER---AE 386
|
410
....*....|....
gi 1920237968 1506 LALrVQAEAEAARE 1519
Cdd:pfam19220 387 RAL-AQGALEIARE 399
|
|
| ClpA |
COG0542 |
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein ... |
1430-1547 |
4.16e-03 |
|
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 440308 [Multi-domain] Cd Length: 836 Bit Score: 43.53 E-value: 4.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1430 EAAerSRLRIE-----EEIRVVRLQLEATERqrggaegELQALRaraeeaEAQKRQAQEEAERLRRQVQDETQRKRQAEA 1504
Cdd:COG0542 397 EAA--ARVRMEidskpEELDELERRLEQLEI-------EKEALK------KEQDEASFERLAELRDELAELEEELEALKA 461
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1920237968 1505 elalRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE 1547
Cdd:COG0542 462 ----RWEAEKELIEEIQELKEELEQRYGKIPELEKELAELEEE 500
|
|
| SAC6 |
COG5069 |
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton]; |
18-263 |
4.24e-03 |
|
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];
Pssm-ID: 227401 [Multi-domain] Cd Length: 612 Bit Score: 43.39 E-value: 4.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 18 VTLAAVWHW---HRGRRRVQDEQ-------DERDRVQKKTFTKWVNKHLIKHwraeaqrHISDLYEDLRDGHNLISLLEV 87
Cdd:COG5069 344 LDLAFVAHLfntHPGQEPLEEEEkpeieefDAEGEFEARVFTFWLNSLDVSP-------EITNLFGDLRDQLILLQALSK 416
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 88 LSGD---SLPREKGR-------MRFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNpKLTLGLIW-------TIILHFQ 150
Cdd:COG5069 417 KLMPmtvTHKLVKKQpasgieeNRFKAFENENYAVDLGITEGFSLVGIKGLEILDGI-RLKLTLVWqvlrsntALFNHVL 495
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 151 ISDiqvsgqsEDMTAKEKLLLWSQRMV------EGCQGLRCDNFTTSWRDGRLFNAIIHrhkPTLIDMNKVyRQTNLENL 224
Cdd:COG5069 496 KKD-------GCGLSDSDLCAWLGSLGlkgdkeEGIRSFGDPAGSVSGVFYLDVLKGIH---SELVDYDLV-TRGFTEFD 564
|
250 260 270 280
....*....|....*....|....*....|....*....|....*.
gi 1920237968 225 DQA-------FSVAERDLGVTRLLDPEDVDVPQPdEKSIITYVSSL 263
Cdd:COG5069 565 DIAdarslaiSSKILRSLGAIIKFLPEDINGVRP-RLDVLTFIESL 609
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2307-2561 |
4.27e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 42.89 E-value: 4.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2307 AVQEATRLKAEAELLQQQKEL--AQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2384
Cdd:COG3883 5 ALAAPTPAFADPQIQAKQKELseLQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2385 QARAEEDARRFRKQ------------AEDIGE---RLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKD 2449
Cdd:COG3883 85 REELGERARALYRSggsvsyldvllgSESFSDfldRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2450 KLKQEAQLLQLKSEEmqtvrQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQ 2529
Cdd:COG3883 165 ELEAAKAELEAQQAE-----QEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 239
|
250 260 270
....*....|....*....|....*....|..
gi 1920237968 2530 MQQEKQQLAASMEEARRRQHEAEEGVRRQQEE 2561
Cdd:COG3883 240 AAAAASAAGAGAAGAAGAAAGSAGAAGAAAGA 271
|
|
| PRK07735 |
PRK07735 |
NADH-quinone oxidoreductase subunit C; |
1710-1932 |
4.31e-03 |
|
NADH-quinone oxidoreductase subunit C;
Pssm-ID: 236081 [Multi-domain] Cd Length: 430 Bit Score: 43.05 E-value: 4.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1710 ETEQGEQQRQLLEEELARLQ-----------REAAAATQKRRELEAELAKVRAEMEV-----LLASKARAEEESRSTSEK 1773
Cdd:PRK07735 11 KKEAARRAKEEARKRLVAKHgaeiskleeenREKEKALPKNDDMTIEEAKRRAAAAAkakaaALAKQKREGTEEVTEEEK 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1774 SKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED--AVRQRAEAERVLAEKLAAISEATRLKTEAEIAL---KEK 1848
Cdd:PRK07735 91 AKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAkaAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKakaKAA 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1849 EAENERLRRLAEDEAFQRRLLEEQAAQH-KADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEE-ILALKGSFEKA 1926
Cdd:PRK07735 171 AAAKAKAAALAKQKAAEAGEGTEEVTEEeKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAKAKaIAAAKAKAAAA 250
|
....*.
gi 1920237968 1927 AAGKAE 1932
Cdd:PRK07735 251 ARAKTK 256
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1365-1553 |
4.43e-03 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 43.35 E-value: 4.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1365 AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEir 1444
Cdd:PRK12678 67 AATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARK-- 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1445 vvRLQLEATERQRGGAEGELQALRARAEEAEA--QKRQAQEEAERLRRQVQDETQRKRQaEAELALRVQAEAEAAREKQR 1522
Cdd:PRK12678 145 --AGEGGEQPATEARADAAERTEEEERDERRRrgDREDRQAEAERGERGRREERGRDGD-DRDRRDRREQGDRREERGRR 221
|
170 180 190
....*....|....*....|....*....|.
gi 1920237968 1523 ALQALEELRLQAEEAERRLRQAEAERARQVQ 1553
Cdd:PRK12678 222 DGGDRRGRRRRRDRRDARGDDNREDRGDRDG 252
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1718-2030 |
4.46e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.97 E-value: 4.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1718 RQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESrstsEKSKQRLEAEAGRFRELAEEAARLRA 1797
Cdd:COG4372 19 RPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEEL----EQARSELEQLEEELEELNEQLQAAQA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1798 LAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHK 1877
Cdd:COG4372 95 ELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1878 ADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQE 1957
Cdd:COG4372 175 ALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEE 254
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237968 1958 AARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQE 2030
Cdd:COG4372 255 VILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKK 327
|
|
| GBP_C |
pfam02841 |
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral ... |
2260-2354 |
4.49e-03 |
|
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.
Pssm-ID: 460721 [Multi-domain] Cd Length: 297 Bit Score: 42.66 E-value: 4.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2260 EKMKQVAEEAARLSVAAQEAARLRQLAEEDL----AQQRALAE--KMLKEKMQAVQEatRLKAEAELLQQQKElaQEQAR 2333
Cdd:pfam02841 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEqmmeAQERSYQEhvKQLIEKMEAERE--QLLAEQERMLEHKL--QEQEE 276
|
90 100
....*....|....*....|.
gi 1920237968 2334 RLQEDKEQMAQQLAQETQGFQ 2354
Cdd:pfam02841 277 LLKEGFKTEAESLQKEIQDLK 297
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1873-2049 |
4.49e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.83 E-value: 4.49e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1873 AAQHKADIEARLAQLRKasesELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKE 1952
Cdd:COG4942 18 QADAAAEAEAELEQLQQ----EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIA 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1953 QAEQEAARQRQLAAEEERRRREAEER--------------VQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRErAE 2018
Cdd:COG4942 94 ELRAELEAQKEELAELLRALYRLGRQpplalllspedfldAVRRLQYLKYLAPARREQAEELRADLAELAALRAELE-AE 172
|
170 180 190
....*....|....*....|....*....|.
gi 1920237968 2019 QESARQLQLAQEAAQKRLQAEEKAHAFAVQQ 2049
Cdd:COG4942 173 RAELEALLAELEEERAALEALKAERQKLLAR 203
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1668-1768 |
4.58e-03 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 43.40 E-value: 4.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1668 AEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELA 1747
Cdd:PRK11448 143 LLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKEIT 222
|
90 100
....*....|....*....|.
gi 1920237968 1748 KvRAEMEVLLaskarAEEESR 1768
Cdd:PRK11448 223 D-QAAKRLEL-----SEEETR 237
|
|
| TolC |
COG1538 |
Outer membrane protein TolC [Cell wall/membrane/envelope biogenesis]; |
1307-1568 |
4.66e-03 |
|
Outer membrane protein TolC [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 441147 [Multi-domain] Cd Length: 367 Bit Score: 42.72 E-value: 4.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1307 DLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAAlekQRQLAEAHAQaKAQAEREAQGLQRRM 1386
Cdd:COG1538 77 EVAQAYFDLLAAQEQ-LALAEENLALAEELLELARARYEAGLASRLDVLQA---EAQLAQARAQ-LAQAEAQLAQARNAL 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1387 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEA------EIQAKARQVEAAERsRLRIEEEIRVVRLQLEATERQRGGA 1460
Cdd:COG1538 152 ALLLGLPPPAPLDLPDPLPPLPPLPPSLPGLPSEalerrpDLRAAEAQLEAAEA-EIGVARAAFLPSLSLSASYGYSSSD 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1461 EGELQ-------------------ALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1521
Cdd:COG1538 231 DLFSGgsdtwsvglslslplfdggRNRARVRAAKAQLEQAEAQYEQTVLQALQEVEDALAALRAAREQLEALEEALEAAE 310
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1522 RALQALEEL-------RLQAEEAERRLRQAEAERarqvqVALETAQRSAEAELQ 1568
Cdd:COG1538 311 EALELARARyraglasLLDVLDAQRELLQAQLNL-----IQARYDYLLALVQLY 359
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1476-1948 |
4.67e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 43.10 E-value: 4.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1476 AQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQR--ALQALEELRLQAEEAERRLRQAEAERARQvQ 1553
Cdd:COG3064 1 AQEALEEKAAEAAAQERLEQAEAEKRAAAEAEQKAKEEAEEERLAELeaKRQAEEEAREAKAEAEQRAAELAAEAAKK-L 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1554 VALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVA----VVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1629
Cdd:COG3064 80 AEAEKAAAEAEKKAAAEKAKAAKEAEAAAAAEKAAAAAekekAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARA 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1630 ALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRA 1709
Cdd:COG3064 160 AAAAAAAAAAAAARAAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEA 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1710 ETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELA 1789
Cdd:COG3064 240 TEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLA 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1790 EEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLL 1869
Cdd:COG3064 320 AAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGG 399
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237968 1870 EEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTL 1948
Cdd:COG3064 400 LLGLRLDLGAALLEAASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDGGAVLA 478
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1787-1915 |
4.70e-03 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 42.94 E-value: 4.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1787 ELAEEAARLRALAE-EAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQ 1865
Cdd:COG2268 196 EIIRDARIAEAEAErETEIAIAQANREAEEAELEQEREIETARIAEAEAELAKKKAEERREAETARAEAEAAYEIAEANA 275
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 1866 RRLLEEQAAQHKADIEARLAQLRKA-SESELERQKGLVEDTLRQRRQVEEE 1915
Cdd:COG2268 276 EREVQRQLEIAEREREIELQEKEAErEEAELEADVRKPAEAEKQAAEAEAE 326
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
2227-2355 |
4.75e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 42.65 E-value: 4.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2227 EELGKLKARIEAENRALVLRDKDSAQrlLQEeaekmkQVAEEAARLSVAAQEAARLRQLAEEdLAQQRALAEKMLKEKMQ 2306
Cdd:PRK09039 53 SALDRLNSQIAELADLLSLERQGNQD--LQD------SVANLRASLSAAEAERSRLQALLAE-LAGAGAAAEGRAGELAQ 123
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 2307 AVQEATRLKAEA----ELLQQQKELAQEQARRLQ--------EDKEQMAQ----------QLAQETQGFQK 2355
Cdd:PRK09039 124 ELDSEKQVSARAlaqvELLNQQIAALRRQLAALEaaldasekRDRESQAKiadlgrrlnvALAQRVQELNR 194
|
|
| Borrelia_P83 |
pfam05262 |
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins. |
1332-1519 |
4.92e-03 |
|
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
Pssm-ID: 114011 [Multi-domain] Cd Length: 489 Bit Score: 43.07 E-value: 4.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1332 RMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQglqrrmQEEVARREEVAVEAQEQKRSIQEEL 1411
Cdd:pfam05262 206 RESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQ------QEAKNLPKPADTSSPKEDKQVAENQ 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1412 QHLRQSSEAEIQAKARQVeaaersrlrieeeirvvrlqLEATERQRGGAEGElqalrARAEEAEAQKRqaQEEAERLRRQ 1491
Cdd:pfam05262 280 KREIEKAQIEIKKNDEEA--------------------LKAKDHKAFDLKQE-----SKASEKEAEDK--ELEAQKKREP 332
|
170 180
....*....|....*....|....*....
gi 1920237968 1492 VQDETQR-KRQAEAElalrVQAEAEAARE 1519
Cdd:pfam05262 333 VAEDLQKtKPQVEAQ----PTSLNEDAID 357
|
|
| PRK12705 |
PRK12705 |
hypothetical protein; Provisional |
1458-1598 |
4.96e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 237178 [Multi-domain] Cd Length: 508 Bit Score: 42.77 E-value: 4.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1458 GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQrkrqaEAELALRVQAEAEAAREKQralqaleelRLQAEEa 1537
Cdd:PRK12705 19 GVLVVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAK-----ELLLRERNQQRQEARRERE---------ELQREE- 83
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 1538 ERRLRQAEAERARQVQVALETAQRS-AEAELQSEHASFAEKTAQLERTLKEehvaVVQLREE 1598
Cdd:PRK12705 84 ERLVQKEEQLDARAEKLDNLENQLEeREKALSARELELEELEKQLDNELYR----VAGLTPE 141
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
2176-2519 |
5.22e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 43.09 E-value: 5.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2176 ELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELG----KLKARIEA-ENralvlrDKDS 2250
Cdd:TIGR04523 104 DLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNnkynDLKKQKEElEN------ELNL 177
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2251 AQRLLQEEAEKMKQVAEEAAR----LSVAAQEAARLRQLAEE--DLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQ 2324
Cdd:TIGR04523 178 LEKEKLNIQKNIDKIKNKLLKlellLSNLKKKIQKNKSLESQisELKKQNNQLKDNIEKKQQEINE---KTTEISNTQTQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2325 -KELAQEQarrlQEDKEQMAQQLAQETQGFQKTLETERQRQlEMSAEAERLRlrvaemsraqaraeedarrfRKQAEDIG 2403
Cdd:TIGR04523 255 lNQLKDEQ----NKIKKQLSEKQKELEQNNKKIKELEKQLN-QLKSEISDLN--------------------NQKEQDWN 309
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2404 ERLyRTELATQEKVmlVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQlkseemqtvrqEQLLQETQALQQs 2483
Cdd:TIGR04523 310 KEL-KSELKNQEKK--LEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQ-----------RELEEKQNEIEK- 374
|
330 340 350
....*....|....*....|....*....|....*.
gi 1920237968 2484 FLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQAL 2519
Cdd:TIGR04523 375 LKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQK 410
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
1334-1521 |
5.34e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.06 E-value: 5.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1334 EEEERLAEQQRAEERERLAEVEaalEKQRQLAEAHAQAKAQAER--------EAQGLQRRMQ-----EEVARREEVAVEA 1400
Cdd:TIGR00927 649 GERPTEAEGENGEESGGEAEQE---GETETKGENESEGEIPAERkgeqegegEIEAKEADHKgeteaEEVEHEGETEAEG 725
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRSIQ--EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGELQA---LRARAEEAE 1475
Cdd:TIGR00927 726 TEDEGEIEtgEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGE--------TEAEGKEDEDEGEIQAgedGEMKGDEGA 797
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1920237968 1476 AQKRQAQEEAERlRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1521
Cdd:TIGR00927 798 EGKVEHEGETEA-GEKDEHEGQSETQADDTEVKDETGEQELNAENQ 842
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
2164-2605 |
5.43e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 43.13 E-value: 5.43e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2164 EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQ-RGQVEEELFSLRVQMEELGKLKARIEAENRA 2242
Cdd:TIGR02169 233 EALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKiKDLGEEEQLRVKEKIGELEAEIASLERSIAE 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2243 LVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEE------DLAQQRALAEKMLKEKMQAVQEATRLKA 2316
Cdd:TIGR02169 313 KERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEyaelkeELEDLRAELEEVDKEFAETRDELKDYRE 392
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2317 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQetqgFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFR 2396
Cdd:TIGR02169 393 KLEKLKREINELKRELDRLQEELQRLSEELAD----LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2397 KQAEDIGERLYRTELATQEKVMLVQTLETQRQQSdRDAERLREAIAELEHEKDK--LKQEAQLLQLKSE----------- 2463
Cdd:TIGR02169 469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARAS-EERVRGGRAVEEVLKASIQgvHGTVAQLGSVGERyataievaagn 547
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2464 ---------EMQTVRQEQLLQETQALQQSFL-------SEKD-SLLQRERCI----------EQEKAKLEQLFQDEV--- 2513
Cdd:TIGR02169 548 rlnnvvvedDAVAKEAIELLKRRKAGRATFLplnkmrdERRDlSILSEDGVIgfavdlvefdPKYEPAFKYVFGDTLvve 627
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2514 ----------------------------------------------AKAQALREEQQRQQQQMQQEKQQLA---ASMEEA 2544
Cdd:TIGR02169 628 dieaarrlmgkyrmvtlegelfeksgamtggsraprggilfsrsepAELQRLRERLEGLKRELSSLQSELRrieNRLDEL 707
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237968 2545 RRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEenqrLRERLQHLEEERRAALARSEEIA 2605
Cdd:TIGR02169 708 SQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE----LEEDLSSLEQEIENVKSELKELE 764
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1460-1569 |
5.45e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.84 E-value: 5.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1460 AEGELQALRARAE-EAEAQKR----QAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALE-ELRLQ 1533
Cdd:PRK12704 36 AEEEAKRILEEAKkEAEAIKKeallEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREeELEKK 115
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237968 1534 AEEAERRLRQAEAERARqvqvaLETAQRSAEAELQS 1569
Cdd:PRK12704 116 EKELEQKQQELEKKEEE-----LEELIEEQLQELER 146
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1781-1890 |
5.56e-03 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 43.02 E-value: 5.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1781 EAGRFRELAEEAARLRALAEEAKRQRQLAEE--------DAVRQRAEAERVLAEKLAAISEATRLKTEAEIA-LKEKEAE 1851
Cdd:PRK11448 130 KPGPFVPPEDPENLLHALQQEVLTLKQQLELqarekaqsQALAEAQQQELVALEGLAAELEEKQQELEAQLEqLQEKAAE 209
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1920237968 1852 NErlrrlAEDEAFQRRLLEEQAAQHKAD-IEARL---AQLRKA 1890
Cdd:PRK11448 210 TS-----QERKQKRKEITDQAAKRLELSeEETRIlidQQLRKA 247
|
|
| CCCAP |
pfam15964 |
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ... |
2224-2519 |
5.72e-03 |
|
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.
Pssm-ID: 435040 [Multi-domain] Cd Length: 703 Bit Score: 42.97 E-value: 5.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2224 VQMEE---LGKLKARIEAEN-RALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLA-EEDLAQQRALAE 2298
Cdd:pfam15964 341 VQMTEeanFEKTKALIQCEQlKSELERQKERLEKELASQQEKRAQEKEALRKEMKKEREELGATMLAlSQNVAQLEAQVE 420
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2299 KMLKEKMQAVQEATrlKAEAELLQQQKELAQEQAR-RLQEDKEQMAQQLAQETQgfqKTLETERQRQLEMS-AEAERLRL 2376
Cdd:pfam15964 421 KVTREKNSLVSQLE--EAQKQLASQEMDVTKVCGEmRYQLNQTKMKKDEAEKEH---REYRTKTGRQLEIKdQEIEKLGL 495
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2377 RVAEMSRAQARAEEDARRFRKQAEDIGERLYRTE----LATQEKVMLVQTL----ETQRQQSDRDAERLREAIAELE--H 2446
Cdd:pfam15964 496 ELSESKQRLEQAQQDAARAREECLKLTELLGESEhqlhLTRLEKESIQQSFsneaKAQALQAQQREQELTQKMQQMEaqH 575
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2447 EKD----------------KLKQEAQLLQLKSEEM-QTVRQE--QLLQETQALQQSFLSEK-------DSLLQRERCIEQ 2500
Cdd:pfam15964 576 DKTvneqyslltsqntfiaKLKEECCTLAKKLEEItQKSRSEveQLSQEKEYLQDRLEKLQkrneeleEQCVQHGRMHER 655
|
330
....*....|....*....
gi 1920237968 2501 EKAKLEQLFQDEVAKAQAL 2519
Cdd:pfam15964 656 MKQRLRQLDKHCQATAQQL 674
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1924-2406 |
6.00e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 42.83 E-value: 6.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1924 EKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERvQKSLAAEEEAARQRKAALEEVERL 2003
Cdd:COG4717 49 ERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEE-LEELEAELEELREELEKLEKLLQL 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2004 KAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRseaeaARRAAEEAE 2083
Cdd:COG4717 128 LPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEE-----LQDLAEELE 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2084 AARERAEREAAQSRRQVEEAERLKQsAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFA 2163
Cdd:COG4717 203 ELQQRLAELEEELEEAQEELEELEE-ELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFL 281
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2164 EQAL---------RQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAArqrgqveEELFSLRVQMEELGKLKA 2234
Cdd:COG4717 282 VLGLlallflllaREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSP-------EELLELLDRIEELQELLR 354
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2235 RIE-AENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLkekmqAVQEATR 2313
Cdd:COG4717 355 EAEeLEEELQLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELL-----EALDEEE 429
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2314 LKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQ-ETQGFQKTLETERQRQLEMSAEAER--LRLRVAE--MSRAQARA 2388
Cdd:COG4717 430 LEEELEELEEELEELEEELEELREELAELEAELEQlEEDGELAELLQELEELKAELRELAEewAALKLALelLEEAREEY 509
|
490
....*....|....*....
gi 1920237968 2389 EEDAR-RFRKQAEDIGERL 2406
Cdd:COG4717 510 REERLpPVLERASEYFSRL 528
|
|
| SPFH_like_u3 |
cd03406 |
Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily; This ... |
1326-1434 |
6.02e-03 |
|
Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily; This model summarizes an uncharacterized family of proteins similar to stomatin, prohibitin, flotillin, HflK/C (SPFH) and podocin. The conserved domain common to the SPFH superfamily has also been referred to as the Band 7 domain. Many superfamily members are associated with lipid rafts. Individual proteins of the SPFH superfamily may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Microdomains formed from flotillin proteins may in addition be dynamic units with their own regulatory functions. Flotillins have been implicated in signal transduction, vesicle trafficking, cytoskeleton rearrangement and are known to interact with a variety of proteins. Stomatin interacts with and regulates members of the degenerin/epithelia Na+ channel family in mechanosensory cells of Caenorhabditis elegans and vertebrate neurons and participates in trafficking of Glut1 glucose transporters. Prohibitin may act as a chaperone for the stabilization of mitochondrial proteins. Prokaryotic HflK/C plays a role in the decision between lysogenic and lytic cycle growth during lambda phage infection. Flotillins have been implicated in the progression of prion disease, in the pathogenesis of neurodegenerative diseases such as Parkinson's and Alzheimer's disease and, in cancer invasion and metastasis. Mutations in the podocin gene give rise to autosomal recessive steroid resistant nephritic syndrome.
Pssm-ID: 259804 [Multi-domain] Cd Length: 293 Bit Score: 41.90 E-value: 6.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1326 ISETLRR----MEEEE-RLAeqqRAEERERLAEVEAALEKQRQLAEahaqakaqAEREAQGLQRRMQEEVARReevavEA 1400
Cdd:cd03406 160 IPEAIRRnyeaMEAEKtKLL---IAEQHQKVVEKEAETERKRAVIE--------AEKDAEVAKIQMQQKIMEK-----EA 223
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237968 1401 QEQKRSIQEELQHLRQSS--EAEIQAKARQVEAAER 1434
Cdd:cd03406 224 EKKISEIEDEMHLAREKAraDAEYYRALREAEANKL 259
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
2174-2604 |
6.06e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 42.89 E-value: 6.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2174 EQELTALRLQLEETDHQKSILDEELQRLKAEVTeAARQRGQV-EEELFSLRVQMEElgklKARIEAEnRALVLRDKDSAQ 2252
Cdd:pfam10174 302 ESELLALQTKLETLTNQNSDCKQHIEVLKESLT-AKEQRAAIlQTEVDALRLRLEE----KESFLNK-KTKQLQDLTEEK 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2253 RLLQEEAEKMKQVAEEAARLSVAAQEaaRLRQLAEEDLAQQRALAEkmLKEKMQAVQEATRLKAEAelLQQQKELAQEQA 2332
Cdd:pfam10174 376 STLAGEIRDLKDMLDVKERKINVLQK--KIENLQEQLRDKDKQLAG--LKERVKSLQTDSSNTDTA--LTTLEEALSEKE 449
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2333 R---RLQEDKEQMAQQLAQEtqgfqktLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRT 2409
Cdd:pfam10174 450 RiieRLKEQREREDRERLEE-------LESLKKENKDLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSL 522
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2410 ELATQEKVMLVQTLETQRQ--QSDRDAERLREAIAelehekDKLKQEAQLLQLKSEEMQTVRqeqllqetqalqqsflSE 2487
Cdd:pfam10174 523 EIAVEQKKEECSKLENQLKkaHNAEEAVRTNPEIN------DRIRLLEQEVARYKEESGKAQ----------------AE 580
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2488 KDSLLQRERCIEQEK-------AKLEQLFQDEVaKAQALREEQQRQQQQMQQEKQqlAASMEEARRRQHEAEEGVRRQQ- 2559
Cdd:pfam10174 581 VERLLGILREVENEKndkdkkiAELESLTLRQM-KEQNKKVANIKHGQQEMKKKG--AQLLEEARRREDNLADNSQQLQl 657
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2560 ----EELQRLAQQQQQQEKLLAEENQRLRERLQHLEE---ERRAALarsEEI 2604
Cdd:pfam10174 658 eelmGALEKTRQELDATKARLSSTQQSLAEKDGHLTNlraERRKQL---EEI 706
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1321-1515 |
6.06e-03 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 42.21 E-value: 6.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1321 QYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQlaeahaqakAQAEREAQgLQRRMQEEVARREEVAVEA 1400
Cdd:pfam13868 156 RILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQD---------EKAERDEL-RAKLYQEEQERKERQKERE 225
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1401 QEQKRsiQEELQHLRQSSEAEIQAKARQvEAAERSRLRIEEEiRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1480
Cdd:pfam13868 226 EAEKK--ARQRQELQQAREEQIELKERR-LAEEAEREEEEFE-RMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEE 301
|
170 180 190
....*....|....*....|....*....|....*
gi 1920237968 1481 AQEEAERLRRQVQDETQRKRQAEAELALRVQAEAE 1515
Cdd:pfam13868 302 REEQRAAEREEELEEGERLREEEAERRERIEEERQ 336
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1331-1738 |
6.14e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 42.72 E-value: 6.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1331 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAK-AQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE 1409
Cdd:COG3064 24 EKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEqRAAELAAEAAKKLAEAEKAAAEAEKKAAAEKAKAAKE 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1410 ELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLR 1489
Cdd:COG3064 104 AEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAAAALV 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1490 RQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQS 1569
Cdd:COG3064 184 AAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADLAAVGVLGAA 263
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1570 EHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQA 1649
Cdd:COG3064 264 LAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEAALSL 343
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1650 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQ 1729
Cdd:COG3064 344 LAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGLRLDLGAALLEAASAVELRVL 423
|
....*....
gi 1920237968 1730 REAAAATQK 1738
Cdd:COG3064 424 LALAGAAGA 432
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1003-1600 |
6.27e-03 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 42.87 E-value: 6.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1003 EPARECAQRITEQQKAQAEVDGLGKGVARLSaeaekvLALPepspaAPTLRSELEltlGKLEQVRSLSAIYlEKLKTISL 1082
Cdd:PRK10246 254 ELQQEASRRQQALQQALAAEEKAQPQLAALS------LAQP-----ARQLRPHWE---RIQEQSAALAHTR-QQIEEVNT 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1083 VIRSTQEAEEVLRAHeeQLKEAQAVPATLPELeatkaalkklrAQAEAQQPVFDALRDELRG-----AQEVGERLQQRhg 1157
Cdd:PRK10246 319 RLQSTMALRARIRHH--AAKQSAELQAQQQSL-----------NTWLAEHDRFRQWNNELAGwraqfSQQTSDREQLR-- 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1158 erdveveRWRERVTLLLERWQAVLAQT-----DVRQRELEQLGRQlRYYRESADPLGAWLRDAKQRQEQIQAvplANSQA 1232
Cdd:PRK10246 384 -------QWQQQLTHAEQKLNALPAITltltaDEVAAALAQHAEQ-RPLRQRLVALHGQIVPQQKRLAQLQV---AIQNV 452
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1233 VREQLRQEKALLEDIERHGEKVEE-------CQRFAKqyinaIKDYELQ--------------------LVTYKAqLEPV 1285
Cdd:PRK10246 453 TQEQTQRNAALNEMRQRYKEKTQQladvktiCEQEAR-----IKDLEAQraqlqagqpcplcgstshpaVEAYQA-LEPG 526
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1286 ASPAKKPKVQSGSESIIQEYVDLRtrySELSTLTSQYIRFISETLRRMEEEERLAEQQRaeererlaEVEAALEKQRQLA 1365
Cdd:PRK10246 527 VNQSRLDALEKEVKKLGEEGAALR---GQLDALTKQLQRDESEAQSLRQEEQALTQQWQ--------AVCASLNITLQPQ 595
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1366 EAHAQ-AKAQAEREAQGLQRRMQEEVarrEEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQV--EAAERSRL--RIE 1440
Cdd:PRK10246 596 DDIQPwLDAQEEHERQLRLLSQRHEL---QGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLpqEDEEASWLatRQQ 672
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1441 EEIRVVRLQLEATERQRGGAE--------GELQALRARAEEAEAQK-RQAQEEAERLRRQVQ-------DETQRKRQAEA 1504
Cdd:PRK10246 673 EAQSWQQRQNELTALQNRIQQltplletlPQSDDLPHSEETVALDNwRQVHEQCLSLHSQLQtlqqqdvLEAQRLQKAQA 752
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1505 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLrqaeaERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERT 1584
Cdd:PRK10246 753 QFDTALQASVFDDQQAFLAALLDEETLTQLEQLKQNL-----ENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVTVEQI 827
|
650
....*....|....*.
gi 1920237968 1585 LKEEHVAVVQLREEAT 1600
Cdd:PRK10246 828 QQELAQLAQQLRENTT 843
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
2163-2416 |
6.28e-03 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 42.97 E-value: 6.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2163 AEQALRQKAQVEQELTALRLQLEETDHQK----------SILDEELQRLKAEVTEAARQRGQVEEELF-SLRVQMEELGK 2231
Cdd:PLN02939 158 LEKILTEKEALQGKINILEMRLSETDARIklaaqekihvEILEEQLEKLRNELLIRGATEGLCVHSLSkELDVLKEENML 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2232 LKARIEAEnRALVLRDKDSAQRLLQEEAEKM---KQVAEEAARLSVAAQEAARLRQLAEED------------------- 2289
Cdd:PLN02939 238 LKDDIQFL-KAELIEVAETEERVFKLEKERSlldASLRELESKFIVAQEDVSKLSPLQYDCwwekvenlqdlldratnqv 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2290 ------LAQQRALAEK--MLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTL---- 2357
Cdd:PLN02939 317 ekaalvLDQNQDLRDKvdKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQESIKEFQDTLsklk 396
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2358 ETERQRQLEMSAEA------ERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEK 2416
Cdd:PLN02939 397 EESKKRSLEHPADDmpsefwSRILLLIDGWLLEKKISNNDAKLLREMVWKRDGRIREAYLSCKGK 461
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1420-1573 |
6.44e-03 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 42.58 E-value: 6.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1420 AEIQAKARQVEAAERSRLRIEEEIrvvrlqlEATERQRGGAEGElqALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1499
Cdd:PRK12678 29 PELRALAKQLGIKGTSGMRKGELI-------AAIKEARGGGAAA--AAATPAAPAAAARRAARAAAAARQAEQPAAEAAA 99
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1500 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHAS 1573
Cdd:PRK12678 100 AKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERR 173
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1109-1552 |
6.60e-03 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 42.59 E-value: 6.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1109 ATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQ 1188
Cdd:COG5278 83 EARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLA 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1189 RELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAI 1268
Cdd:COG5278 163 LALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALAL 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1269 KDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEER 1348
Cdd:COG5278 243 ALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAA 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1349 ERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQ 1428
Cdd:COG5278 323 AALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAA 402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1429 VEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAL 1508
Cdd:COG5278 403 AAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAA 482
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1920237968 1509 RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV 1552
Cdd:COG5278 483 ALAEAEAAAALAAAAALSLALALAALLLAAAEAALAAALAAALA 526
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3674-3712 |
6.65e-03 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 36.92 E-value: 6.65e-03
10 20 30
....*....|....*....|....*....|....*....
gi 1920237968 3674 YLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVARQL 3712
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
2098-2479 |
6.67e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 42.57 E-value: 6.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2098 RQVEEAERlkqsaeeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAeQALRQKAQVEQEL 2177
Cdd:pfam07888 73 RQRRELES--------------------RVAELKEELRQSREKHEELEEKYKELSASSEELSEEKD-ALLAQRAAHEARI 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2178 TALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRAL--VLRDKDSAQRLL 2255
Cdd:pfam07888 132 RELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKEFQELrnSLAQRDTQVLQL 211
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2256 QEEAEKMKQVAEEAARlSVAAQEAAR--LRQLAEEDLAQQRALAekMLKEKMQAVQeATRLKAEAELLQQQKELAQ---- 2329
Cdd:pfam07888 212 QDTITTLTQKLTTAHR-KEAENEALLeeLRSLQERLNASERKVE--GLGEELSSMA-AQRDRTQAELHQARLQAAQltlq 287
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2330 --EQARRLQEDKEQMaqqlAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEmsraqaraeedARRFRKQAEdigerly 2407
Cdd:pfam07888 288 laDASLALREGRARW----AQERETLQQSAEADKDRIEKLSAELQRLEERLQE-----------ERMEREKLE------- 345
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237968 2408 rTELATQEKVMLVQTLETQRQQSDrdaerLREAIAELEHEKDKLKQEAQllqlksEEMQTVRQEQLLQETQA 2479
Cdd:pfam07888 346 -VELGREKDCNRVQLSESRRELQE-----LKASLRVAQKEKEQLQAEKQ------ELLEYIRQLEQRLETVA 405
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1695-2106 |
6.95e-03 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 42.59 E-value: 6.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1695 QQRLAAEQELIRLRAETEQGEQQRQLL---EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTS 1771
Cdd:COG5278 83 EARAEIDELLAELRSLTADNPEQQARLdelEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLA 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1772 EKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAE 1851
Cdd:COG5278 163 LALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALAL 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1852 NERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKA 1931
Cdd:COG5278 243 ALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAA 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1932 ELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEAR 2011
Cdd:COG5278 323 AALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAA 402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2012 RLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAER 2091
Cdd:COG5278 403 AAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAA 482
|
410
....*....|....*
gi 1920237968 2092 EAAQSRRQVEEAERL 2106
Cdd:COG5278 483 ALAEAEAAAALAAAA 497
|
|
| Nop14 |
pfam04147 |
Nop14-like family; Emg1 and Nop14 are novel proteins whose interaction is required for the ... |
1741-1862 |
7.02e-03 |
|
Nop14-like family; Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production.
Pssm-ID: 461196 Cd Length: 835 Bit Score: 42.61 E-value: 7.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1741 ELEAELAKVRAEM--EVLLASKARAEEesrstseKSKQRLEAEAGRFR---ELAEEAARLRALAEEAKRQRQLAEEDAVR 1815
Cdd:pfam04147 155 EEEPERKKSKKEVmeEVIAKSKLHKYE-------RQKAKEEDEELREEldkELKDLRSLLSGSKRPKPEQAKKPEEKPDR 227
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 1816 QRAEAE-----RVLA-EKLAAISEatRLKTEAEIALKEKE----AENERLRRLAEDE 1862
Cdd:pfam04147 228 KKPDDDydklvRELAfDKRAKPSD--RTKTEEELAEEEKErlekLEEERLRRMRGEE 282
|
|
| CH_PLS_FIM_rpt2 |
cd21218 |
second calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ... |
50-145 |
7.03e-03 |
|
second calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5; they cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409067 Cd Length: 114 Bit Score: 39.20 E-value: 7.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 50 KWVNKHLikhWRAEAQR-HISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM---RFHKLQNVQIALDYLRhrQVKLVN- 124
Cdd:cd21218 17 RWVNYHL---KKAGPTKkRVTNFSSDLKDGEVYALLLHSLAPELCDKELVLEvlsEEDLEKRAEKVLQAAE--KLGCKYf 91
|
90 100
....*....|....*....|.
gi 1920237968 125 IRNDDIADGNPKLTLGLIWTI 145
Cdd:cd21218 92 LTPEDIVSGNPRLNLAFVATL 112
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
2194-2289 |
7.10e-03 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 42.63 E-value: 7.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2194 LDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALvlrdkdsAQRLLQEEAEKMKQVAEEAARLS 2273
Cdd:PRK11448 147 LQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQEL-------EAQLEQLQEKAAETSQERKQKRK 219
|
90
....*....|....*.
gi 1920237968 2274 VAAQEAARLRQLAEED 2289
Cdd:PRK11448 220 EITDQAAKRLELSEEE 235
|
|
| ATAD3_N |
pfam12037 |
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal ... |
1667-1807 |
7.19e-03 |
|
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal domain of ATPase family AAA domain-containing protein 3 (ATAD3) which is involved in dimerization and interacts with the inner surface of the outer mitochondrial membrane. This domain is found associated with the AAA ATPase domain (pfam00004). ATAD3 is essential for mitochondrial network organization, mitochondrial metabolism and cell growth at organizm and cellular level. It may also play an important role in mitochondrial protein synthesis.
Pssm-ID: 463442 [Multi-domain] Cd Length: 264 Bit Score: 41.51 E-value: 7.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1667 KAEEQaVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA-- 1744
Cdd:pfam12037 52 KKQEQ-TRQAELQAKIKEYEAAQEQLKIERQRVEYEERRKTLQEETKQKQQRAQYQDELARKRYQDQLEAQRRRNEELlr 130
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237968 1745 ---ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEA-GRFRELAE-EAARLRALAEEAKRQRQ 1807
Cdd:pfam12037 131 kqeESVAKQEAMRIQAQRRQTEEHEAELRRETERAKAEAEAeARAKEEREnEDLNLEQLREKANEERE 198
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1695-1957 |
7.24e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 41.82 E-value: 7.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1695 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRstseKS 1774
Cdd:COG1340 1 SKTDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVK----EL 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1775 KQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlaEEDAVRQR--------------AEAERVLAEKLAaiseatRLKTE 1840
Cdd:COG1340 77 KEERDELNEKLNELREELDELRKELAELNKAGG--SIDKLRKEierlewrqqtevlsPEEEKELVEKIK------ELEKE 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1841 AEIALKEKEAENERLRRLAEDEAFQrrlleEQAAQHKADIEARLAQLRKASES------ELERQKGLVEDTLRQRRQVEE 1914
Cdd:COG1340 149 LEKAKKALEKNEKLKELRAELKELR-----KEAEEIHKKIKELAEEAQELHEEmielykEADELRKEADELHKEIVEAQE 223
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1920237968 1915 EILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQE 1957
Cdd:COG1340 224 KADELHEEIIELQKELRELRKELKKLRKKQRALKREKEKEELE 266
|
|
| MAP7 |
pfam05672 |
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ... |
1328-1441 |
7.38e-03 |
|
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.
Pssm-ID: 461709 [Multi-domain] Cd Length: 153 Bit Score: 40.02 E-value: 7.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1328 ETLRRMEEEERLAEQQRA-EERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL-QRRMQEEVARREEVAVEAQEQKR 1405
Cdd:pfam05672 11 EAARILAEKRRQAREQRErEEQERLEKEEEERLRKEELRRRAEEERARREEEARRLeEERRREEEERQRKAEEEAEEREQ 90
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237968 1406 SIQEELQHLRQSSEAeiqAKARQVEAAERSRLRIEE 1441
Cdd:pfam05672 91 REQEEQERLQKQKEE---AEAKAREEAERQRQEREK 123
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1167-1438 |
7.47e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 7.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1167 RERVTLLLERWQAVLAQTDVR---QRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKAL 1243
Cdd:COG4942 2 RKLLLLALLLALAAAAQADAAaeaEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1244 LEDIERHGEKVEECQRFAKQYINAIKdyELQLVTYKAQLEPvaspakKPKVQSGSESIIQEYvdlrtRYSELSTLTSQYI 1323
Cdd:COG4942 82 EAELAELEKEIAELRAELEAQKEELA--ELLRALYRLGRQP------PLALLLSPEDFLDAV-----RRLQYLKYLAPAR 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1324 RFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRqlaeahaQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQ 1403
Cdd:COG4942 149 REQAEELRADLAELAALRAELEAERAELEALLAELEEER-------AALEALKAERQKLLARLEKELAELAAELAELQQE 221
|
250 260 270
....*....|....*....|....*....|....*
gi 1920237968 1404 KRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLR 1438
Cdd:COG4942 222 AEELEALIARLEAEAAAAAERTPAAGFAALKGKLP 256
|
|
| ERM_helical |
pfam20492 |
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ... |
1675-1768 |
7.52e-03 |
|
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.
Pssm-ID: 466641 [Multi-domain] Cd Length: 120 Bit Score: 39.52 E-value: 7.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1675 QRELAEQELEKQRQLAEGTAQQRLAAEQELIRLraeteqgEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEME 1754
Cdd:pfam20492 21 ETKKAQEELEESEETAEELEEERRQAEEEAERL-------EQKRQEAEEEKERLEESAEMEAEEKEQLEAELAEAQEEIA 93
|
90
....*....|....
gi 1920237968 1755 VLLASKARAEEESR 1768
Cdd:pfam20492 94 RLEEEVERKEEEAR 107
|
|
| PspA_IM30 |
pfam04012 |
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent ... |
1325-1541 |
7.78e-03 |
|
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homolog in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma(54) containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator.
Pssm-ID: 461130 [Multi-domain] Cd Length: 215 Bit Score: 41.20 E-value: 7.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1325 FISETLRRMEEEERLAEQQRAEERERLAEVEaalekqRQLAEAHAQAKaQAEREAqglqRRMQEEVARREEVAVEAQEQK 1404
Cdd:pfam04012 12 NIHEGLDKAEDPEKMLEQAIRDMQSELVKAR------QALAQTIARQK-QLERRL----EQQTEQAKKLEEKAQAALTKG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1405 RsiqeelQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEE 1484
Cdd:pfam04012 81 N------EELAREALAEKKSLEKQAEALETQLAQQRSAVEQLRKQLAALETKIQQLKAKKNLLKARLKAAKAQEAVQTSL 154
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 1485 AERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQaLEELRLQAEEAERRL 1541
Cdd:pfam04012 155 GSLSTSSATDSFERIEEKIEEREARADAAAELASAVDLDAK-LEQAGIQMEVSEDVL 210
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1486-2468 |
7.81e-03 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 42.73 E-value: 7.81e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1486 ERLRRQVQDETQRKRQAEAELALRVQAEAEA--------AREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALE 1557
Cdd:TIGR00606 189 ETLRQVRQTQGQKVQEHQMELKYLKQYKEKAceirdqitSKEAQLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLD 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1558 ---TAQRSAEAELQSEHASFAEKTAQL----ERTLKE-EHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1629
Cdd:TIGR00606 269 neiKALKSRKKQMEKDNSELELKMEKVfqgtDEQLNDlYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1630 ALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQrelaEQELEKQRQLAEG-TAQQRLAAEQELIRLR 1708
Cdd:TIGR00606 349 QGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKN----FHTLVIERQEDEAkTAAQLCADLQSKERLK 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1709 aeteqgEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKAR--AEEESRSTSEKSKQRLEAEAGRFR 1786
Cdd:TIGR00606 425 ------QEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRilELDQELRKAERELSKAEKNSLTET 498
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1787 ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLaaiseaTRLKTEA-EIALKEKEAENERLRRLAEDEAFQ 1865
Cdd:TIGR00606 499 LKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEML------TKDKMDKdEQIRKIKSRHSDELTSLLGYFPNK 572
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1866 RRLLE--EQAAQHKADIEARLAQLRKasesELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAElELELGRIRGT 1943
Cdd:TIGR00606 573 KQLEDwlHSKSKEINQTRDRLAKLNK----ELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDE-ESDLERLKEE 647
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1944 AEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2023
Cdd:TIGR00606 648 IEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDE 727
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2024 QLQLAqEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEA 2103
Cdd:TIGR00606 728 MLGLA-PGRQSIIDLKEKE----------------------IPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESA 784
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2104 ERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaADAEMEKHKQFAEQALRQKAQVEQEltalrlq 2183
Cdd:TIGR00606 785 KVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEK-QEKQHELDTVVSKIELNRKLIQDQQ------- 856
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2184 lEETDHQKSILDeELQRLKAEVTEAARQRGQVEEELFSLRVQMEELgklkARIEAENRALVLRDKDSAQRLLQEEAEKMK 2263
Cdd:TIGR00606 857 -EQIQHLKSKTN-ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSL----IREIKDAKEQDSPLETFLEKDQQEKEELIS 930
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2264 QVAEEAARLSVAAQE-AARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2342
Cdd:TIGR00606 931 SKETSNKKAQDKVNDiKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQ 1010
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2343 AQQLAQETQGFQKTLETERQRQLEMSAE---AERLRLRVAEMSRAQARAEEDARRFRKQaedigERLYRTELATQEKVML 2419
Cdd:TIGR00606 1011 KIQERWLQDNLTLRKRENELKEVEEELKqhlKEMGQMQVLQMKQEHQKLEENIDLIKRN-----HVLALGRQKGYEKEIK 1085
|
970 980 990 1000 1010
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237968 2420 VQTLETQRQQSDRDAERLREAIAELEHEKDKLKQ--------EAQLLQLKSEEMQTV 2468
Cdd:TIGR00606 1086 HFKKELREPQFRDAEEKYREMMIVMRTTELVNKDldiyyktlDQAIMKFHSMKMEEI 1142
|
|
| CH_PARVA_B_rpt2 |
cd21306 |
second calponin homology (CH) domain found in the alpha/beta parvin subfamily; The alpha/beta ... |
43-149 |
7.99e-03 |
|
second calponin homology (CH) domain found in the alpha/beta parvin subfamily; The alpha/beta parvin subfamily includes alpha-parvin and beta-parvin. Alpha-parvin, also called actopaxin, calponin-like integrin-linked kinase-binding protein (CH-ILKBP), or matrix-remodeling-associated protein 2, plays a role in sarcomere organization and in smooth muscle cell contraction. It is required for normal development of the embryonic cardiovascular system, and for normal septation of the heart outflow tract. Beta-parvin, also called affixin, is an adapter protein that plays a role in integrin signaling via ILK and in activation of the GTPases Cdc42 and Rac1 by guanine exchange factors, such as ARHGEF6. Both alpha-parvin and beta-parvin are involved in the reorganization of the actin cytoskeleton and the formation of lamellipodia, and both play roles in cell adhesion, cell spreading, establishment or maintenance of cell polarity, and cell migration. Members of this subfamily contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409155 Cd Length: 121 Bit Score: 39.32 E-value: 7.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 43 VQKKTFTKWVNKHLIKhwraeAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRF----HKLQNVQIALDYLRHR 118
Cdd:cd21306 16 VVKKSLITFVNKHLNK-----LNLEVTDLDTQFHDGVYLVLLMGLLEGYFVPLHSFHLTPtsfeQKVHNVQFAFELMQDA 90
|
90 100 110
....*....|....*....|....*....|.
gi 1920237968 119 QVKLVNIRNDDIADGNPKLTLGLIWTIILHF 149
Cdd:cd21306 91 GLPKPKARPEDIVNLDLKSTLRVLYNLFTKY 121
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
1207-1407 |
8.40e-03 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 41.99 E-value: 8.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1207 PLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVEECQRfakqyinAIKDYELQLVTYKAQL-EPV 1285
Cdd:PRK11637 37 AFSAHASDNRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASR-------KLRETQNTLNQLNKQIdELN 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1286 ASPAKKPKVQSGSESIIQEYVDLRTRYSE-------LSTLTSQ-------YIRFISE----------------TLRRMEE 1335
Cdd:PRK11637 110 ASIAKLEQQQAAQERLLAAQLDAAFRQGEhtglqliLSGEESQrgerilaYFGYLNQarqetiaelkqtreelAAQKAEL 189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1336 EERLAEQQ----------------RAEERERLAEVEAALEK-QRQLAE--------------AHAQAKAQAEREAQGLQR 1384
Cdd:PRK11637 190 EEKQSQQKtllyeqqaqqqkleqaRNERKKTLTGLESSLQKdQQQLSElranesrlrdsiarAEREAKARAEREAREAAR 269
|
250 260
....*....|....*....|....
gi 1920237968 1385 -RMQEEVARREEVAVEAQEQKRSI 1407
Cdd:PRK11637 270 vRDKQKQAKRKGSTYKPTESERSL 293
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
2160-2351 |
8.42e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 42.50 E-value: 8.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2160 KQFAEQALRQKAQVEQELTALRlqleETDHQKSILDEELQRLkaevtEAARQRGQVEEELFSLRVQMEELGKLKARIEAE 2239
Cdd:pfam10174 568 ARYKEESGKAQAEVERLLGILR----EVENEKNDKDKKIAEL-----ESLTLRQMKEQNKKVANIKHGQQEMKKKGAQLL 638
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2240 NRALVLRD---KDSAQRLLQE---EAEKMKQVAEEA-ARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEAT 2312
Cdd:pfam10174 639 EEARRREDnlaDNSQQLQLEElmgALEKTRQELDATkARLSSTQQSLAEKDGHLTNLRAERRKQLEEILEMKQEALLAAI 718
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1920237968 2313 RLK----AEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQ 2351
Cdd:pfam10174 719 SEKdaniALLELSSSKKKKTQEEVMALKREKDRLVHQLKQQTQ 761
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1332-1597 |
8.46e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.81 E-value: 8.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1332 RMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEEL 1411
Cdd:COG4372 3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEEL 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1412 QHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQ 1491
Cdd:COG4372 83 EELNE----QLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQ 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1492 VQDETQRKRQAEAELALRVQAEAEaaREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEH 1571
Cdd:COG4372 159 LESLQEELAALEQELQALSEAEAE--QALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALS 236
|
250 260
....*....|....*....|....*.
gi 1920237968 1572 ASFAEKTAQLERTLKEEHVAVVQLRE 1597
Cdd:COG4372 237 ALLDALELEEDKEELLEEVILKEIEE 262
|
|
| PRK07353 |
PRK07353 |
F0F1 ATP synthase subunit B'; Validated |
1331-1426 |
8.60e-03 |
|
F0F1 ATP synthase subunit B'; Validated
Pssm-ID: 235999 [Multi-domain] Cd Length: 140 Bit Score: 39.60 E-value: 8.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1331 RRMEEEERLAEQQRAEERERLAEVEA-ALEKQRQLAEAHAQAK---AQAEREAQGLqrrmqeevaRREEVAV---EAQEQ 1403
Cdd:PRK07353 32 KVVEEREDYIRTNRAEAKERLAEAEKlEAQYEQQLASARKQAQaviAEAEAEADKL---------AAEALAEaqaEAQAS 102
|
90 100
....*....|....*....|...
gi 1920237968 1404 KRSIQEELQHLRQSSEAEIQAKA 1426
Cdd:PRK07353 103 KEKARREIEQQKQAALAQLEQQV 125
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3673-3709 |
8.66e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 36.69 E-value: 8.66e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1920237968 3673 RYLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVA 3709
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| Apolipoprotein |
pfam01442 |
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ... |
1349-1520 |
8.80e-03 |
|
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.
Pssm-ID: 460211 [Multi-domain] Cd Length: 175 Bit Score: 40.32 E-value: 8.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1349 ERLAEVEAALEK-QRQLAEAHAQAKAQAEREAQGLQRRMQEEV-ARREEVAVEAQEQKRSIQEELQHLRQsseaeiQAKA 1426
Cdd:pfam01442 4 DSLDELSTYAEElQEQLGPVAQELVDRLEKETEALRERLQKDLeEVRAKLEPYLEELQAKLGQNVEELRQ------RLEP 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1427 RQVEAAERSRLRIEEEIRVVRlqlEATERQRGGAEGELQALRARAEE-AEAQKRQAQEEAERLRRQVQDETQ----RKRQ 1501
Cdd:pfam01442 78 YTEELRKRLNADAEELQEKLA---PYGEELRERLEQNVDALRARLAPyAEELRQKLAERLEELKESLAPYAEevqaQLSQ 154
|
170
....*....|....*....
gi 1920237968 1502 AEAELALRVQAEAEAAREK 1520
Cdd:pfam01442 155 RLQELREKLEPQAEDLREK 173
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
2413-2663 |
8.94e-03 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 42.27 E-value: 8.94e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2413 TQEKVMLVQTLETQRQQSDRDAERLREaIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLL 2492
Cdd:pfam02463 165 SRLKRKKKEALKKLIEETENLAELIID-LEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQ 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2493 QRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQ 2572
Cdd:pfam02463 244 ELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEK 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2573 EKLLAEENQRLRERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFD-GLRRKVPAQRLQEVG 2651
Cdd:pfam02463 324 KKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSsAAKLKEEELELKSEE 403
|
250
....*....|..
gi 1920237968 2652 VLSAEELQQLAQ 2663
Cdd:pfam02463 404 EKEAQLLLELAR 415
|
|
| PRK00106 |
PRK00106 |
ribonuclease Y; |
2261-2436 |
9.15e-03 |
|
ribonuclease Y;
Pssm-ID: 178867 [Multi-domain] Cd Length: 535 Bit Score: 42.16 E-value: 9.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2261 KMKQvAEEAARLSV--AAQEAARLRQLAEEDLAQQRALAEKMLKEK-----MQAVQEAT--RLKAEAELLQQQKELAQEQ 2331
Cdd:PRK00106 25 KMKS-AKEAAELTLlnAEQEAVNLRGKAERDAEHIKKTAKRESKALkkellLEAKEEARkyREEIEQEFKSERQELKQIE 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2332 ARRLQE------------DKEQMAQQLAQETQGFQKTLEtERQRQLEMSAEAERLRL-RVAEMSRAQAR----------- 2387
Cdd:PRK00106 104 SRLTERatsldrkdenlsSKEKTLESKEQSLTDKSKHID-EREEQVEKLEEQKKAELeRVAALSQAEAReiilaetenkl 182
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 1920237968 2388 AEEDARRFRKQAEDIGERLYRTelatqEKVMLVQTLetQRQQSDRDAER 2436
Cdd:PRK00106 183 THEIATRIREAEREVKDRSDKM-----AKDLLAQAM--QRLAGEYVTEQ 224
|
|
| PLN02316 |
PLN02316 |
synthase/transferase |
1472-1535 |
9.40e-03 |
|
synthase/transferase
Pssm-ID: 215180 [Multi-domain] Cd Length: 1036 Bit Score: 42.16 E-value: 9.40e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237968 1472 EEAEAQKRQAQEEAERLRrqvQDETQRKRQAE--AELALRVQAEAEAAREKQRALQALEELRLQAE 1535
Cdd:PLN02316 253 EKRRELEKLAKEEAERER---QAEEQRRREEEkaAMEADRAQAKAEVEKRREKLQNLLKKASRSAD 315
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1670-1835 |
9.44e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 41.74 E-value: 9.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1670 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1749
Cdd:COG3883 115 SDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAA 194
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1750 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1829
Cdd:COG3883 195 EAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGAAGAAAGSAGAAGAAAGAAGA 274
|
....*.
gi 1920237968 1830 AISEAT 1835
Cdd:COG3883 275 GAAAAS 280
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
2201-2495 |
9.54e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 41.81 E-value: 9.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2201 LKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRAlvlrdkdsAQRLLQEEAEKMKQVAEEAARLSVAAQEAA 2280
Cdd:COG4372 29 LSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQ--------ARSELEQLEEELEELNEQLQAAQAELAQAQ 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2281 RLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETE 2360
Cdd:COG4372 101 EELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAE 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2361 RQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREA 2440
Cdd:COG4372 181 AEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEI 260
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237968 2441 IAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRE 2495
Cdd:COG4372 261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALED 315
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1926-2427 |
9.71e-03 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 42.15 E-value: 9.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1926 AAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKA 2005
Cdd:COG3899 736 PPDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYANLGLLLLGDYEEAYEFGEL 815
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2006 KVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQ-----KEQELQQTLQQEQSVLERLRSEAEAARRAAE 2080
Cdd:COG3899 816 ALALAERLGDRRLEARALFNLGFILHWLGPLREALELLREALEAgletgDAALALLALAAAAAAAAAAAALAAAAAAAAR 895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2081 EAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHK 2160
Cdd:COG3899 896 LLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAA 975
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2161 QFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAEN 2240
Cdd:COG3899 976 AAAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAA 1055
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2241 RALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAEL 2320
Cdd:COG3899 1056 AAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALAALALAAAARAAAAL 1135
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 2321 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAE 2400
Cdd:COG3899 1136 LLLAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALLALAARLAALLALALL 1215
|
490 500
....*....|....*....|....*..
gi 1920237968 2401 DIGERLYRTELATQEKVMLVQTLETQR 2427
Cdd:COG3899 1216 ALEAAALLLLLLLAALALAAALLALRL 1242
|
|
| WEMBL |
pfam05701 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
1330-1713 |
9.86e-03 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 41.94 E-value: 9.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1330 LRRMEEEERLAEQQRAEERERLAEVEAALEKQRQL-AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAveaqeqkrSIQ 1408
Cdd:pfam05701 228 LKQAEEELQRLNQQLLSAKDLKSKLETASALLLDLkAELAAYMESKLKEEADGEGNEKKTSTSIQAALA--------SAK 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1409 EELQHLRQSSE-AEIQAKARQVeAAERSRLRIEEEIRVVrlqleATERQRGGA--------EGELQALRARAEEAEAQKR 1479
Cdd:pfam05701 300 KELEEVKANIEkAKDEVNCLRV-AAASLRSELEKEKAEL-----ASLRQREGMasiavsslEAELNRTKSEIALVQAKEK 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1480 QAQEEAERLRRQVQdetqrkrqaeaelalrvQAEAEAAREKQRALQALEELRLQAEEAErrlrQAEAErARQVQVALETA 1559
Cdd:pfam05701 374 EAREKMVELPKQLQ-----------------QAAQEAEEAKSLAQAAREELRKAKEEAE----QAKAA-ASTVESRLEAV 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237968 1560 QRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaeaeraraeaerelerwqLKANEALRLRLQAEE 1639
Cdd:pfam05701 432 LKEIEAAKASEKLALAAIKALQESESSAESTNQEDSPRGVT------------------------LSLEEYYELSKRAHE 487
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237968 1640 VAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQ 1713
Cdd:pfam05701 488 AEELANKRVAEAVSQIEEAKESELRSLEKLEEVNREMEERKEALKIALEKAEKAKEGKLAAEQELRKWRAEHEQ 561
|
|
|