|
Name |
Accession |
Description |
Interval |
E-value |
| CH_PLEC-like_rpt1 |
cd21188 |
first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ... |
41-157 |
2.92e-73 |
|
first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409037 Cd Length: 105 Bit Score: 240.00 E-value: 2.92e-73
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 41 DRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERdvirssrlprekGRMRFHKLQNVQIALD 120
Cdd:cd21188 1 DAVQKKTFTKWVNKHLIKARRRVVDLFEDLRDGHNLISLLEVLSGESLPRER------------GRMRFHRLQNVQTALD 68
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 121 YLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQ 157
Cdd:cd21188 69 FLKYRKIKLVNIRAEDIVDGNPKLTLGLIWTIILHFQ 105
|
|
| CH_PLEC_rpt1 |
cd21235 |
first calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ... |
38-168 |
2.01e-70 |
|
first calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409084 Cd Length: 119 Bit Score: 232.61 E-value: 2.01e-70
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlpreKGRMRFHKLQNVQI 117
Cdd:cd21235 1 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRE------------KGRMRFHKLQNVQI 68
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 118 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSE 168
Cdd:cd21235 69 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSE 119
|
|
| CH_PLEC_rpt2 |
cd21238 |
second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ... |
170-275 |
1.14e-69 |
|
second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409087 Cd Length: 106 Bit Score: 229.91 E-value: 1.14e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 170 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 249
Cdd:cd21238 1 MTAKEKLLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPMLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 80
|
90 100
....*....|....*....|....*.
gi 1920237962 250 PEDVDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21238 81 PEDVDVPQPDEKSIITYVSSLYDAMP 106
|
|
| CH_DYST_rpt1 |
cd21236 |
first calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also ... |
34-166 |
1.35e-69 |
|
first calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. Dystonin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409085 Cd Length: 128 Bit Score: 230.64 E-value: 1.35e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 34 EGKKDERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlpreKGRMRFHKLQ 113
Cdd:cd21236 8 ERYKDERDKVQKKTFTKWINQHLMKVRKHVNDLYEDLRDGHNLISLLEVLSGDTLPRE------------KGRMRFHRLQ 75
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 114 NVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQ 166
Cdd:cd21236 76 NVQIALDYLKRRQVKLVNIRNDDITDGNPKLTLGLIWTIILHFQISDIHVTGE 128
|
|
| CH_PLEC-like_rpt2 |
cd21189 |
second calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ... |
171-275 |
4.00e-64 |
|
second calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409038 Cd Length: 105 Bit Score: 213.79 E-value: 4.00e-64
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 250
Cdd:cd21189 1 SAKEALLLWARRTTEGYPGVRVTNFTSSWRDGLAFNAIIHRNRPDLIDFRSVRNQSNRENLENAFNVAEKEFGVTRLLDP 80
|
90 100
....*....|....*....|....*
gi 1920237962 251 EDVDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21189 81 EDVDVPEPDEKSIITYVSSLYDVFP 105
|
|
| CH_MACF1_rpt1 |
cd21237 |
first calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, ... |
38-167 |
1.92e-60 |
|
first calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1) and similar proteins; MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. MACF1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409086 Cd Length: 118 Bit Score: 204.11 E-value: 1.92e-60
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGdslprerdvirsSRLPREKGRMRFHKLQNVQI 117
Cdd:cd21237 1 DERDRVQKKTFTKWVNKHLMKVRKHINDLYEDLRDGHNLISLLEVLSG------------VKLPREKGRMRFHRLQNVQI 68
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1920237962 118 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQS 167
Cdd:cd21237 69 ALDFLKQRQVKLVNIRNDDITDGNPKLTLGLIWTIILHFQISDIYISGES 118
|
|
| CH_DYST_rpt2 |
cd21239 |
second calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also ... |
171-275 |
1.43e-56 |
|
second calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. Dystonin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409088 Cd Length: 104 Bit Score: 192.51 E-value: 1.43e-56
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 250
Cdd:cd21239 1 SAKERLLLWSQQMTEGYTGIRCENFTTCWRDGRLFNAIIHKYRPDLIDMNTVAVQSNLANLEHAFYVAEK-LGVTRLLDP 79
|
90 100
....*....|....*....|....*
gi 1920237962 251 EDVDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21239 80 EDVDVSSPDEKSVITYVSSLYDVFP 104
|
|
| CH_MACF1_rpt2 |
cd21240 |
second calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, ... |
169-275 |
6.27e-50 |
|
second calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1) and similar proteins; MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. MACF1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409089 Cd Length: 107 Bit Score: 173.30 E-value: 6.27e-50
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 169 DMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLL 248
Cdd:cd21240 2 DMSAKEKLLLWTQKVTAGYTGIKCTNFSSCWSDGKMFNALIHRYRPDLVDMERVQIQSNRENLEQAFEVAER-LGVTRLL 80
|
90 100
....*....|....*....|....*..
gi 1920237962 249 DPEDVDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21240 81 DAEDVDVPSPDEKSVITYVSSIYDAFP 107
|
|
| CH_DMD-like_rpt1 |
cd21186 |
first calponin homology (CH) domain found in the dystrophin family; The dystrophin family ... |
43-158 |
9.84e-48 |
|
first calponin homology (CH) domain found in the dystrophin family; The dystrophin family includes dystrophin and its paralog, utrophin. Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. Dystrophin is also involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and links the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409035 Cd Length: 107 Bit Score: 167.17 E-value: 9.84e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 43 VQKKTFTKWVNKHLIKAQR-HISDLYEDLRDGHNLISLLEVLSGdslprerdvirsSRLPREKGRMRFHKLQNVQIALDY 121
Cdd:cd21186 2 VQKKTFTKWINSQLSKANKpPIKDLFEDLRDGTRLLALLEVLTG------------KKLKPEKGRMRVHHLNNVNRALQV 69
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21186 70 LEQNNVKLVNISSNDIVDGNPKLTLGLVWSIILHWQV 106
|
|
| CH_SPTB-like_rpt1 |
cd21246 |
first calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I ... |
38-154 |
9.52e-46 |
|
first calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I spectrin-like family includes beta-I, -II, -III and -IV spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-III spectrin, also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5), may play a crucial role as a longer actin-membrane cross-linker or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Members of this subfamily contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409095 Cd Length: 117 Bit Score: 161.77 E-value: 9.52e-46
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlprEKGRMRFHKLQNVQI 117
Cdd:cd21246 11 DEREAVQKKTFTKWVNSHLARVGCRINDLYTDLRDGRMLIKLLEVLSGERLPKP-----------TKGKMRIHCLENVDK 79
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 118 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 154
Cdd:cd21246 80 ALQFLKEQRVHLENMGSHDIVDGNHRLTLGLIWTIIL 116
|
|
| CH_beta_spectrin_rpt2 |
cd21194 |
second calponin homology (CH) domain found in the beta spectrin family; The beta spectrin ... |
171-271 |
6.68e-45 |
|
second calponin homology (CH) domain found in the beta spectrin family; The beta spectrin family includes beta-I, -II, -III, -IV and -V spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Beta-III spectrin is also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5). Beta-V spectrin, also called spectrin beta chain, non-erythrocytic 5 (SPTBN5), is a mammalian ortholog of Drosophila beta H spectrin. Beta-III and Beta-V spectrins may play crucial roles as longer actin-membrane cross-linkers or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409043 Cd Length: 105 Bit Score: 159.11 E-value: 6.68e-45
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 250
Cdd:cd21194 2 SAKDALLLWCQRKTAGYPGVNIQNFTTSWRDGLAFNALIHAHRPDLIDYNRLDPNDHLGNLNNAFDVAEQELGIAKLLDA 81
|
90 100
....*....|....*....|.
gi 1920237962 251 EDVDVPQPDEKSIITYVSSLY 271
Cdd:cd21194 82 EDVDVARPDEKSIMTYVASYY 102
|
|
| SAC6 |
COG5069 |
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton]; |
42-382 |
3.17e-43 |
|
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];
Pssm-ID: 227401 [Multi-domain] Cd Length: 612 Bit Score: 170.12 E-value: 3.17e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 42 RVQKKTFTKWVNKHLIKA-QRHISDLYEDLRDGHNLISLLEVLSGDSLprerdvIRSSRLPRekgrMRFHKLQNVQIALD 120
Cdd:COG5069 8 KVQKKTFTKWTNEKLISGgQKEFGDLDTDLKDGVKLAQLLEALQKDNA------GEYNETPE----TRIHVMENVSGRLE 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 121 YLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQvsgQSEDMTAKEKLLLWSQRMVEGCQ-GLRCDNFTTSW 199
Cdd:COG5069 78 FIKGKGVKLFNIGPQDIVDGNPKLILGLIWSLISRLTIATIN---EEGELTKHINLLLWCDEDTGGYKpEVDTFDFFRSW 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 200 RDGRLFNAIIHRHKPTLIDMNKVYRQTNLE--NLDQAFSVAERDLGVTRLLDPEDV-DVPQPDEKSIITYVS------SL 270
Cdd:COG5069 155 RDGLAFSALIHDSRPDTLDPNVLDLQKKNKalNNFQAFENANKVIGIARLIGVEDIvNVSIPDERSIMTYVSwyiirfGL 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 271 YD----AMPRVPDVQDGVKANElQLRwQEYRELVLLLLQWIRAHTAGFEERRFPSSFEEIEILWCQFLKFKETE--LPAK 344
Cdd:COG5069 235 LEkidiALHRVYRLLEADETLI-QLR-LPYEIILLRLLNLIHLKQANWKVVNFSKDVSDGENYTDLLNQLNALCsrAPLE 312
|
330 340 350
....*....|....*....|....*....|....*....
gi 1920237962 345 EAD-KNRSKGIYQSLEgAVQAGQLKVPPGYHPLDVEKEW 382
Cdd:COG5069 313 TTDlHSLAGQILQNAE-KYDCRKYLPPAGNPKLDLAFVA 350
|
|
| CH_SPTB_like_rpt2 |
cd21248 |
second calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I ... |
171-271 |
4.64e-43 |
|
second calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I spectrin-like family includes beta-I, -II, -III and -IV spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-III spectrin, also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5), may play a crucial role as a longer actin-membrane cross-linker or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Members of this subfamily contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409097 Cd Length: 105 Bit Score: 153.71 E-value: 4.64e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 250
Cdd:cd21248 2 SAKDALLLWCQMKTAGYPNVNVRNFTTSWRDGLAFNALIHKHRPDLIDYDKLSKSNALYNLQNAFNVAEQKLGLTKLLDP 81
|
90 100
....*....|....*....|.
gi 1920237962 251 EDVDVPQPDEKSIITYVSSLY 271
Cdd:cd21248 82 EDVNVEQPDEKSIITYVVTYY 102
|
|
| CH_beta_spectrin_rpt1 |
cd21193 |
first calponin homology (CH) domain found in the beta spectrin family; The beta spectrin ... |
37-154 |
4.34e-42 |
|
first calponin homology (CH) domain found in the beta spectrin family; The beta spectrin family includes beta-I, -II, -III, -IV and -V spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Beta-III spectrin is also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5). Beta-V spectrin, also called spectrin beta chain, non-erythrocytic 5 (SPTBN5), is a mammalian ortholog of Drosophila beta H spectrin. Beta-III and Beta-V spectrins may play crucial roles as longer actin-membrane cross-linkers or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409042 Cd Length: 116 Bit Score: 151.29 E-value: 4.34e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 37 KDERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlprEKGRMRFHKLQNVQ 116
Cdd:cd21193 10 QEERINIQKKTFTKWINSFLEKANLEIGDLFTDLSDGKLLLKLLEIISGEKLGKP-----------NRGRLRVQKIENVN 78
|
90 100 110
....*....|....*....|....*....|....*...
gi 1920237962 117 IALDYLrHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 154
Cdd:cd21193 79 KALAFL-KTKVRLENIGAEDIVDGNPRLILGLIWTIIL 115
|
|
| CH_SYNE1_rpt1 |
cd21241 |
first calponin homology (CH) domain found in synaptic nuclear envelope protein 1 and similar ... |
39-158 |
9.75e-42 |
|
first calponin homology (CH) domain found in synaptic nuclear envelope protein 1 and similar proteins; Synaptic nuclear envelope protein 1 (SYNE-1), also called nesprin-1, enaptin, KASH domain-containing protein 1 (KASH1), myocyte nuclear envelope protein 1 (MYNE-1), or nuclear envelope spectrin repeat protein 1, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-1 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409090 Cd Length: 113 Bit Score: 150.22 E-value: 9.75e-42
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRVQKKTFTKWVNKHLIKAQR--HISDLYEDLRDGHNLISLLEVLSGDslprerdvirssRLPREKGRM--RFHKLQN 114
Cdd:cd21241 1 EQERVQKKTFTNWINSYLAKRKPpmKVEDLFEDIKDGTKLLALLEVLSGE------------KLPCEKGRRlkRVHFLSN 68
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1920237962 115 VQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21241 69 INTALKFLESKKIKLVNINPTDIVDGKPSIVLGLIWTIILYFQI 112
|
|
| CH_SYNE1_rpt2 |
cd21243 |
second calponin homology (CH) domain found in synaptic nuclear envelope protein 1 (SYNE-1) and ... |
170-275 |
5.47e-41 |
|
second calponin homology (CH) domain found in synaptic nuclear envelope protein 1 (SYNE-1) and similar proteins; SYNE-1, also called nesprin-1, enaptin, KASH domain-containing protein 1 (KASH1), myocyte nuclear envelope protein 1 (MYNE-1), or nuclear envelope spectrin repeat protein 1, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-1 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409092 Cd Length: 109 Bit Score: 147.85 E-value: 5.47e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 170 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 249
Cdd:cd21243 4 GGAKKALLKWVQNAAAKRFGIEVKDFGPSWRDGVAFNAIIHSIRPDLVDMESLKRRSNRENLETAFTVAEKELGIPRLLD 83
|
90 100
....*....|....*....|....*.
gi 1920237962 250 PEDVDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21243 84 PEDVDVDKPDEKSIMTYVAQFLKKYP 109
|
|
| CH_SYNE-like_rpt1 |
cd21190 |
first calponin homology (CH) domain found in the synaptic nuclear envelope protein family; The ... |
39-158 |
7.26e-41 |
|
first calponin homology (CH) domain found in the synaptic nuclear envelope protein family; The synaptic nuclear envelope (SYNE) family includes SYNE-1, -2 and calmin. SYNE-1 (also called nesprin-1, enaptin, KASH domain-containing protein 1, KASH1, myocyte nuclear envelope protein 1, MYNE-1, or nuclear envelope spectrin repeat protein 1) and SYNE-2 (also called nesprin-2, KASH domain-containing protein 2, KASH2, nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE) may act redundantly. They are multi-isomeric modular proteins which form a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. They also act as components of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409039 Cd Length: 113 Bit Score: 147.72 E-value: 7.26e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRVQKKTFTKWVNKHLikaQRH-----ISDLYEDLRDGHNLISLLEVLSGDslprerdvirssRLPREKGRM--RFHK 111
Cdd:cd21190 1 EQERVQKKTFTNWINSHL---AKLsqpivINDLFVDIKDGTALLRLLEVLSGQ------------KLPIESGRVlqRAHK 65
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1920237962 112 LQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21190 66 LSNIRNALDFLTKRCIKLVNINSTDIVDGKPSIVLGLIWTIILYFQI 112
|
|
| CH_ACTN_rpt2 |
cd21216 |
second calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin ... |
158-273 |
8.66e-40 |
|
second calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) family includes alpha-actinin-1, -2, -3, and -4. They are F-actin cross-linking proteins which are thought to anchor actin to a variety of intracellular structures. ACTN1 mutations cause congenital macrothrombocytopenia. ACTN2 mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. ACTN3 is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. ACTN4 is associated with cell motility and cancer invasion. It is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409065 Cd Length: 115 Bit Score: 144.81 E-value: 8.66e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 158 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 237
Cdd:cd21216 1 IQDISV----EELSAKEGLLLWCQRKTAPYKNVNVQNFHTSWKDGLAFCALIHRHRPDLLDYDKLRKDDPRENLNLAFDV 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 238 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21216 77 AEKHLDIPKMLDAEDiVNTPRPDERSVMTYVSCYYHA 113
|
|
| CH_SPTBN4_rpt1 |
cd21318 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) ... |
38-154 |
3.41e-38 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN4, also called beta-IV spectrin, or spectrin, non-erythroid beta chain 3 (SPTBN3), is a novel spectrin isolated as an interactor of the receptor tyrosine phosphatase-like protein ICA512. Its mutation associates with congenital myopathy, neuropathy, and central deafness. SPTBN4 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409167 Cd Length: 139 Bit Score: 141.32 E-value: 3.41e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlprEKGRMRFHKLQNVQI 117
Cdd:cd21318 33 DEREAVQKKTFTKWVNSHLARVPCRINDLYTDLRDGYVLTRLLEVLSGEQLPKP-----------TRGRMRIHSLENVDK 101
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 118 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 154
Cdd:cd21318 102 ALQFLKEQRVHLENVGSHDIVDGNHRLTLGLIWTIIL 138
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1314-1889 |
1.14e-37 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 156.64 E-value: 1.14e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1314 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEErERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1393
Cdd:COG1196 226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELE-AELEELRLELEELELELEEAQAEEYELLAELARLEQDI 304
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1394 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLrIEEEIRVVRLQLEATERQRGGAEGELQA 1473
Cdd:COG1196 305 ARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEE-AEAELAEAEEALLEAEAELAEAEEELEE 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1474 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1553
Cdd:COG1196 384 LAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLE 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1554 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaEAERARAEAERELERWQLK 1633
Cdd:COG1196 464 LLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLL--------AGLRGLAGAVAVLIGVEAA 535
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1634 ANEALRLRLQAeevAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR 1713
Cdd:COG1196 536 YEAALEAALAA---ALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREAD 612
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1714 LRAETEQGEqqrqLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1793
Cdd:COG1196 613 ARYYVLGDT----LLGRTLVAARLEAALRR--AVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE---- 682
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1794 ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQR 1873
Cdd:COG1196 683 ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
|
570
....*....|....*.
gi 1920237962 1874 RLLEEQAAQHKADIEA 1889
Cdd:COG1196 763 EELERELERLEREIEA 778
|
|
| CH_ACTN_rpt1 |
cd21214 |
first calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) ... |
41-154 |
1.49e-37 |
|
first calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) family includes alpha-actinin-1, -2, -3, and -4. They are F-actin cross-linking proteins which are thought to anchor actin to a variety of intracellular structures. ACTN1 mutations cause congenital macrothrombocytopenia. ACTN2 mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. ACTN3 is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. ACTN4 is associated with cell motility and cancer invasion. It is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409063 Cd Length: 105 Bit Score: 137.91 E-value: 1.49e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 41 DRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPrerdvirssrlPREKGRMRFHKLQNVQIALD 120
Cdd:cd21214 3 EKQQRKTFTAWCNSHLRKAGTQIENIEEDFRDGLKLMLLLEVISGERLP-----------KPERGKMRFHKIANVNKALD 71
|
90 100 110
....*....|....*....|....*....|....
gi 1920237962 121 YLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 154
Cdd:cd21214 72 FIASKGVKLVSIGAEEIVDGNLKMTLGMIWTIIL 105
|
|
| CH_SPTBN2_rpt2 |
cd21321 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) ... |
167-271 |
2.07e-37 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN2, also called beta-III spectrin, or spinocerebellar ataxia 5 protein (SCA5), probably plays an important role in the neuronal membrane skeleton. Mutations in SPTBN2 is associated with spinocerebellar ataxia type 5. SPTBN2 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409170 Cd Length: 119 Bit Score: 138.27 E-value: 2.07e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 167 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 246
Cdd:cd21321 1 KEKKSAKDALLLWCQMKTAGYPNVNVHNFTTSWRDGLAFNAIVHKHRPDLIDFETLKKSNAHYNLQNAFNVAEKELGLTK 80
|
90 100
....*....|....*....|....*
gi 1920237962 247 LLDPEDVDVPQPDEKSIITYVSSLY 271
Cdd:cd21321 81 LLDPEDVNVDQPDEKSIITYVATYY 105
|
|
| CH_SPTBN2_rpt1 |
cd21317 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) ... |
37-154 |
3.66e-37 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN2, also called beta-III spectrin, or spinocerebellar ataxia 5 protein (SCA5), probably plays an important role in the neuronal membrane skeleton. Mutations in SPTBN2 is associated with spinocerebellar ataxia type 5. SPTBN2 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409166 Cd Length: 132 Bit Score: 137.88 E-value: 3.66e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 37 KDERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlprEKGRMRFHKLQNVQ 116
Cdd:cd21317 25 ADEREAVQKKTFTKWVNSHLARVTCRIGDLYTDLRDGRMLIRLLEVLSGEQLPKP-----------TKGRMRIHCLENVD 93
|
90 100 110
....*....|....*....|....*....|....*...
gi 1920237962 117 IALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 154
Cdd:cd21317 94 KALQFLKEQKVHLENMGSHDIVDGNHRLTLGLIWTIIL 131
|
|
| CH_SPTB_rpt2 |
cd21319 |
second calponin homology (CH) domain found in spectrin beta chain, erythrocytic (SPTB) and ... |
167-271 |
4.48e-37 |
|
second calponin homology (CH) domain found in spectrin beta chain, erythrocytic (SPTB) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTB, also called beta-I spectrin, may be involved in anaemia pathogenesis. SPTB contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409168 Cd Length: 112 Bit Score: 137.06 E-value: 4.48e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 167 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 246
Cdd:cd21319 1 RETRSAKDALLLWCQMKTAGYPNVNVTNFTSSWKDGLAFNALIHKHRPDLVDFGKLKKSNARHNLEHAFNVAERQLGITK 80
|
90 100
....*....|....*....|....*
gi 1920237962 247 LLDPEDVDVPQPDEKSIITYVSSLY 271
Cdd:cd21319 81 LLDPEDVFTENPDEKSIITYVVAFY 105
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1342-2047 |
1.04e-36 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 154.53 E-value: 1.04e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1342 EEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSiqEELQHL 1421
Cdd:PTZ00121 1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKA--EDAKKA 1178
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1422 RQSSEAEIQAKARQVEAAERSRlRIEEEIRVVRLQlEATERQRGGAEGELQALRaRAEEAeaqkRQAQEEAERLRRQVQD 1501
Cdd:PTZ00121 1179 EAARKAEEVRKAEELRKAEDAR-KAEAARKAEEER-KAEEARKAEDAKKAEAVK-KAEEA----KKDAEEAKKAEEERNN 1251
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1502 ETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRlQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASF 1581
Cdd:PTZ00121 1252 EEIRKFEEARMAHFARRQAAIKAEEARKA----DELK-KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1326
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1582 AEKTAQ-LERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEAlrlRLQAEEVAQQKSLTQaeaek 1660
Cdd:PTZ00121 1327 AKKKADaAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA---KKKAEEKKKADEAKK----- 1398
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1661 qkeeaerearrrgKAEEQAVRQRELAEQELEKQR-QLAEGTAQQRLAAEqELIRLRAETEQGEQQRQLLE-----EELAR 1734
Cdd:PTZ00121 1399 -------------KAEEDKKKADELKKAAAAKKKaDEAKKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEeakkaEEAKK 1464
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1735 LQREAAAATQKRRELE----AELAKVRAEmevllASKARAEEESRSTSEKSKqrleAEAGRFRELAEEAARLRAlAEEAK 1810
Cdd:PTZ00121 1465 KAEEAKKADEAKKKAEeakkADEAKKKAE-----EAKKKADEAKKAAEAKKK----ADEAKKAEEAKKADEAKK-AEEAK 1534
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1811 RQRQLAEEDAVRQRAEAERvlAEKLAAISEatrlKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEAR 1890
Cdd:PTZ00121 1535 KADEAKKAEEKKKADELKK--AEELKKAEE----KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1891 LAQLRKASESELERQKGLVEDTLRQRrqveEEILALKGSFEKAAAGKAELELELGRIRGT-----AEDTLRSKEQA--EQ 1963
Cdd:PTZ00121 1609 AEEAKKAEEAKIKAEELKKAEEEKKK----VEQLKKKEAEEKKKAEELKKAEEENKIKAAeeakkAEEDKKKAEEAkkAE 1684
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1964 EAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE----VERLKAKVEEARRLRERAEQESARQLQLAQEAA 2039
Cdd:PTZ00121 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKK 1764
|
....*...
gi 1920237962 2040 QKRLQAEE 2047
Cdd:PTZ00121 1765 EEEKKAEE 1772
|
|
| CH_SpAIN1-like_rpt1 |
cd21215 |
first calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like ... |
43-156 |
1.24e-36 |
|
first calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like protein 1 and similar proteins; Schizosaccharomyces pombe alpha-actinin-like protein 1 (SpAIN1) binds to actin and is involved in actin-ring formation and organization. It plays a role in cytokinesis and is involved in septation. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409064 Cd Length: 107 Bit Score: 135.61 E-value: 1.24e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 43 VQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERdvirssrlprEKGRMRFHKLQNVQIALDYL 122
Cdd:cd21215 4 VQKKTFTKWLNTKLSSRGLSITDLVTDLSDGVRLIQLLEIIGDESLGRYN----------KNPKMRVQKLENVNKALEFI 73
|
90 100 110
....*....|....*....|....*....|....
gi 1920237962 123 RHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 156
Cdd:cd21215 74 KSRGVKLTNIGAEDIVDGNLKLILGLLWTLILRF 107
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1347-1948 |
1.66e-36 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 152.78 E-value: 1.66e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1347 AEQQRAEERERLAEVEAAL-EKQRQLAEAHAQAKaQAEReAQGLQrrmqeevarreevaveAQEQKRSIQEELQHLRQSs 1425
Cdd:COG1196 177 AERKLEATEENLERLEDILgELERQLEPLERQAE-KAER-YRELK----------------EELKELEAELLLLKLREL- 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1426 EAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQR 1505
Cdd:COG1196 238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEER 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1506 KRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALEtaQRSAEAELQSEHASFAEKT 1585
Cdd:COG1196 318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE--AEEELEELAEELLEALRAA 395
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1586 AQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEA 1665
Cdd:COG1196 396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1666 EREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR------LRAETEQGEQQRQLLEEELARLQREA 1739
Cdd:COG1196 476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAgavavlIGVEAAYEAALEAALAAALQNIVVED 555
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1740 AAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED 1819
Cdd:COG1196 556 DEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAA 635
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1820 AVRQRAEAERVLAEKLAA--ISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKA 1897
Cdd:COG1196 636 LRRAVTLAGRLREVTLEGegGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEE 715
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1898 SESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIR 1948
Cdd:COG1196 716 RLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELE 766
|
|
| CH_SYNE2_rpt1 |
cd21242 |
first calponin homology (CH) domain found in synaptic nuclear envelope protein 2; Synaptic ... |
39-158 |
6.23e-36 |
|
first calponin homology (CH) domain found in synaptic nuclear envelope protein 2; Synaptic nuclear envelope protein 2 (SYNE-2), also called nesprin-2, KASH domain-containing protein 2 (KASH2), nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-2 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-2 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409091 Cd Length: 111 Bit Score: 133.80 E-value: 6.23e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRVQKKTFTKWVNKHLIKAQ--RHISDLYEDLRDGHNLISLLEVLSGDslprerdvirssRLPREKGRMRFHKLQNVQ 116
Cdd:cd21242 1 EQEQTQKRTFTNWINSQLAKHSppSVVSDLFTDIQDGHRLLDLLEVLSGQ------------QLPREKGHNVFQCRSNIE 68
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1920237962 117 IALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21242 69 TALSFLKNKSIKLINIHVPDIIEGKPSIILGLIWTIILHFHI 110
|
|
| CH_SPTBN5_rpt2 |
cd21249 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) ... |
170-271 |
1.12e-35 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN5, also called beta-V spectrin, is a mammalian ortholog of Drosophila beta H spectrin that may play a crucial role as a longer actin-membrane cross-linker or to fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. SPTBN5 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409098 Cd Length: 109 Bit Score: 132.68 E-value: 1.12e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 170 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 249
Cdd:cd21249 3 RSAKEALLIWCQRKTAGYTNVNVQDFSRSWRDGLAFNALIHAHRPDLIDYGSLRPDRPLYNLANAFLVAEQELGISQLLD 82
|
90 100
....*....|....*....|..
gi 1920237962 250 PEDVDVPQPDEKSIITYVSSLY 271
Cdd:cd21249 83 PEDVAVPHPDERSIMTYVSLYY 104
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1474-2074 |
3.68e-35 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 148.16 E-value: 3.68e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1474 LRARAEEAEAQKRQAQEEAERL---RRQVqdETQR---KRQAE-AELALRVQAEAEAaREKQRALQALEELRLQAEEAER 1546
Cdd:COG1196 170 YKERKEEAERKLEATEENLERLediLGEL--ERQLeplERQAEkAERYRELKEELKE-LEAELLLLKLRELEAELEELEA 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1547 RLRQAEAERARqvqvaLETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAvvQLREEATRRAQQQAEAERARAEAERE 1626
Cdd:COG1196 247 ELEELEAELEE-----LEAELAELEAELEELRLELEELELELEEAQAEEYEL--LAELARLEQDIARLEERRRELEERLE 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1627 LERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA 1706
Cdd:COG1196 320 ELEEELAELEEELEELEEELEELEEELEEAEEELEEAE---------AELAEAEEALLEAEAELAEAEEELEELAEELLE 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1707 AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLE 1786
Cdd:COG1196 391 ALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLE 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1787 AEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA----AISEATRLKTEAEIALKEKEAENERL 1862
Cdd:COG1196 471 EAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRglagAVAVLIGVEAAYEAALEAALAAALQN 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1863 RRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELEL 1942
Cdd:COG1196 551 IVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAA 630
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1943 ELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRE 2022
Cdd:COG1196 631 RLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELA 710
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2023 RAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLER 2074
Cdd:COG1196 711 EAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
|
|
| CH_DMD-like_rpt2 |
cd21187 |
second calponin homology (CH) domain found in the dystrophin family; The dystrophin family ... |
174-275 |
1.23e-34 |
|
second calponin homology (CH) domain found in the dystrophin family; The dystrophin family includes dystrophin and its paralog, utrophin. Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. Dystrophin is also involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409036 Cd Length: 104 Bit Score: 129.47 E-value: 1.23e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 174 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 252
Cdd:cd21187 2 EKTLLaWCRQSTRGYEQVDVKNFTTSWRDGLAFNALIHRHRPDLFDFDSLVKDSPESRLEHAFTVAHEHLGIEKLLDPED 81
|
90 100
....*....|....*....|...
gi 1920237962 253 VDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21187 82 VNVEQPDKKSILMYVTSLFQVLP 104
|
|
| CH_SPTBN4_rpt2 |
cd21322 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) ... |
155-271 |
1.54e-34 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN4, also called beta-IV spectrin, or spectrin, non-erythroid beta chain 3 (SPTBN3), is a novel spectrin isolated as an interactor of the receptor tyrosine phosphatase-like protein ICA512. Its mutation associates with congenital myopathy, neuropathy, and central deafness. SPTBN4 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409171 Cd Length: 130 Bit Score: 130.56 E-value: 1.54e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 155 HFQISDIQVSGQSEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQA 234
Cdd:cd21322 1 QIQVIKIETEDNRETRSAKDALLLWCQMKTAGYPEVNIQNFTTSWRDGLAFNALIHRHRPDLIDFSKLTKSNATYNLQQA 80
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 235 FSVAERDLGVTRLLDPEDVDVPQPDEKSIITYVSSLY 271
Cdd:cd21322 81 FNTAEQHLGLTKLLDPEDVNMEAPDEKSIITYVVSFY 117
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1677-2245 |
2.86e-34 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 145.46 E-value: 2.86e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQLAegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1756
Cdd:COG1196 210 EKAERYRELKEELKELEAELL---LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1757 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLA 1836
Cdd:COG1196 287 QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEE-ELEEAEEELEEAEAELAEAEE 365
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1837 AISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKgLVEDTLRQR 1916
Cdd:COG1196 366 ALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEE-EEEEEEEAL 444
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1917 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEA 1996
Cdd:COG1196 445 EEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAG 524
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1997 ARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRL-----QAEEKAHAFAVQQKEQELQQTLQQEQSV 2071
Cdd:COG1196 525 AVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAgratfLPLDKIRARAALAAALARGAIGAAVDLV 604
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2072 LERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQA 2151
Cdd:COG1196 605 ASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEEL 684
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2152 ALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRV 2231
Cdd:COG1196 685 AERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEE 764
|
570
....*....|....
gi 1920237962 2232 QMEELGKLKARIEA 2245
Cdd:COG1196 765 LERELERLEREIEA 778
|
|
| CH_DMD_rpt1 |
cd21231 |
first calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, ... |
38-158 |
6.60e-34 |
|
first calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. It is involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Mutations in dystrophin lead to Duchenne muscular dystrophy (DMD). Moreover, dystrophin deficiency is associated with abnormal cerebral diffusion and perfusion, as well as in acute Trypanosoma cruzi infection. The dystrophin subfamily has been characterized by a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, dystrophin contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, approximately 24 spectrin repeats (SRs) and a WW domain. This model corresponds to the first CH domain.
Pssm-ID: 409080 Cd Length: 111 Bit Score: 127.73 E-value: 6.60e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQKKTFTKWVNKHLIKAQR-HISDLYEDLRDGHNLISLLEVLSGdslprerdvirsSRLPREKGRMRFHKLQNVQ 116
Cdd:cd21231 1 YEREDVQKKTFTKWINAQFAKFGKpPIEDLFTDLQDGRRLLELLEGLTG------------QKLVKEKGSTRVHALNNVN 68
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1920237962 117 IALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21231 69 KALQVLQKNNVDLVNIGSADIVDGNHKLTLGLIWSIILHWQV 110
|
|
| CH_SPTBN1_rpt2 |
cd21320 |
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) ... |
171-271 |
5.66e-32 |
|
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN1, also called beta-II spectrin, fodrin beta chain, or spectrin, non-erythroid beta chain 1, is also a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. SPTBN1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409169 Cd Length: 108 Bit Score: 122.13 E-value: 5.66e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 250
Cdd:cd21320 2 SAKDALLLWCQMKTAGYPNVNIHNFTTSWRDGMAFNALIHKHRPDLIDFDKLKKSNAHYNLQNAFNLAEQHLGLTKLLDP 81
|
90 100
....*....|....*....|.
gi 1920237962 251 EDVDVPQPDEKSIITYVSSLY 271
Cdd:cd21320 82 EDISVDHPDEKSIITYVVTYY 102
|
|
| CH_dFLNA-like_rpt1 |
cd21311 |
first calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and ... |
42-159 |
6.05e-32 |
|
first calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and similar proteins; Drosophila melanogaster filamin-A (dFLNA or dFLN-A), also called actin-binding protein 280 (ABP-280) or filamin-1, is involved in germline ring canal formation. It may tether actin microfilaments within the ovarian ring canal to the cell membrane and contributes to actin microfilament organization. dFLNA contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409160 Cd Length: 124 Bit Score: 122.56 E-value: 6.05e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 42 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlpREKGRMRFHKLQNVQIALDY 121
Cdd:cd21311 14 RIQQNTFTRWANEHLKTANKHIADLETDLSDGLRLIALVEVLSGKKFPKF----------NKRPTFRSQKLENVSVALKF 83
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237962 122 LRHRQ-VKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 159
Cdd:cd21311 84 LEEDEgIKIVNIDSSDIVDGKLKLILGLIWTLILHYSIS 122
|
|
| CH_jitterbug-like_rpt1 |
cd21227 |
first calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ... |
43-158 |
1.09e-31 |
|
first calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409076 Cd Length: 109 Bit Score: 121.24 E-value: 1.09e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 43 VQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRerdVIRssrlpreKGRMRFHKLQNVQIALDYL 122
Cdd:cd21227 4 IQKNTFTNWVNEQLKPTGMSVEDLATDLEDGVKLIALVEILQGRKLGR---VIK-------KPLNQHQKLENVTLALKAM 73
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237962 123 RHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21227 74 AEDGIKLVNIGNEDIVNGNLKLILGLIWHLILRYQI 109
|
|
| CH_SYNE-like_rpt2 |
cd21192 |
second calponin homology (CH) domain found in the synaptic nuclear envelope protein (SYNE) ... |
170-268 |
4.19e-31 |
|
second calponin homology (CH) domain found in the synaptic nuclear envelope protein (SYNE) family; The SYNE family includes SYNE-1, -2 and calmin. SYNE-1 (also called nesprin-1, enaptin, KASH domain-containing protein 1, KASH1, myocyte nuclear envelope protein 1, MYNE-1, or nuclear envelope spectrin repeat protein 1) and SYNE-2 (also called nesprin-2, KASH domain-containing protein 2, KASH2, nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE) may act redundantly. They are multi-isomeric modular proteins which form a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. They also act as components of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409041 Cd Length: 107 Bit Score: 119.84 E-value: 4.19e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 170 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 249
Cdd:cd21192 2 GSAEKALLKWVQAEIGKYYGIRVTDFDKSWRDGVAFLALIHAIRPDLVDMKTVKNRSPRDNLELAFRIAEQHLNIPRLLE 81
|
90
....*....|....*....
gi 1920237962 250 PEDVDVPQPDEKSIITYVS 268
Cdd:cd21192 82 VEDVLVDKPDERSIMTYVS 100
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1335-1939 |
1.19e-30 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 134.50 E-value: 1.19e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQRAE------------ERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREE 1402
Cdd:PTZ00121 1209 EEERKAEEARKAEDAKKAEavkkaeeakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEE 1288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1403 V--AVEAQ--EQKRSIQEELQHLRQSSEAEiQAKARQVEA---AERSRLRIEEEirvvRLQLEATERQRGGAEGELQALR 1475
Cdd:PTZ00121 1289 KkkADEAKkaEEKKKADEAKKKAEEAKKAD-EAKKKAEEAkkkADAAKKKAEEA----KKAAEAAKAEAEAAADEAEAAE 1363
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1476 ARAEEAEAQKRQAQEEAERLRRQVQ-----DETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRlQAEEAERRLRQ 1550
Cdd:PTZ00121 1364 EKAEAAEKKKEEAKKKADAAKKKAEekkkaDEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKK-KADEAKKKAEE 1442
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1551 A--------EAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLR--EEATRRAQQQAEAERAR 1620
Cdd:PTZ00121 1443 AkkadeakkKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKkaAEAKKKADEAKKAEEAK 1522
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1621 AEAERELERWQLKANEALRlrlqAEEVAQQKSLTQAEAEKQKEEAEREARRRgKAEEqavrQRELAEQELEKQRQLAEGT 1700
Cdd:PTZ00121 1523 KADEAKKAEEAKKADEAKK----AEEKKKADELKKAEELKKAEEKKKAEEAK-KAEE----DKNMALRKAEEAKKAEEAR 1593
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1701 AQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEK 1780
Cdd:PTZ00121 1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1781 SKQRLEAEagrfRELAEEAARLRALAEEAKRQRQLAEedaVRQRAEAERVLAEKLAAISEATRLKteAEIALKEKEAENE 1860
Cdd:PTZ00121 1674 KKKAEEAK----KAEEDEKKAAEALKKEAEEAKKAEE---LKKKEAEEKKKAEELKKAEEENKIK--AEEAKKEAEEDKK 1744
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1861 RLRRLAEDEAFQRRLleeqaAQHKADIEARLAQLRKASESELErqKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAE 1939
Cdd:PTZ00121 1745 KAEEAKKDEEEKKKI-----AHLKKEEEKKAEEIRKEKEAVIE--EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKE 1816
|
|
| CH_SpAIN1-like_rpt2 |
cd21291 |
second calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like ... |
158-273 |
1.60e-30 |
|
second calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like protein 1 and similar proteins; Schizosaccharomyces pombe alpha-actinin-like protein 1 (SpAIN1) binds to actin and is involved in actin-ring formation and organization. It plays a role in cytokinesis and is involved in septation. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409140 Cd Length: 115 Bit Score: 118.40 E-value: 1.60e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 158 ISDIQvsgqSEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 237
Cdd:cd21291 1 IADIN----EEGLTAKEGLLLWCQRKTAGYDEVDVQDFTTSWTDGLAFCALIHRHRPDLIDYDKLDKKDHRGNMQLAFDI 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 238 AERDLGVTRLLDPEDV-DVPQPDEKSIITYVSSLYDA 273
Cdd:cd21291 77 ASKEIGIPQLLDVEDVcDVAKPDERSIMTYVAYYFHA 113
|
|
| CH_SYNE2_rpt2 |
cd21244 |
second calponin homology (CH) domain found in synaptic nuclear envelope protein 2 (SYNE-2) and ... |
170-268 |
1.66e-30 |
|
second calponin homology (CH) domain found in synaptic nuclear envelope protein 2 (SYNE-2) and similar proteins; SYNE-2, also called nesprin-2, KASH domain-containing protein 2 (KASH2), nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-2 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-2 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409093 Cd Length: 109 Bit Score: 118.01 E-value: 1.66e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 170 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 249
Cdd:cd21244 4 MSARKALLLWAQEQCAKVGSISVTDFKSSWRNGLAFLAIIHALRPGLVDMEKLKGRSNRENLEEAFRIAEQELKIPRLLE 83
|
90
....*....|....*....
gi 1920237962 250 PEDVDVPQPDEKSIITYVS 268
Cdd:cd21244 84 PEDVDVVNPDEKSIMTYVA 102
|
|
| CH_UTRN_rpt2 |
cd21234 |
second calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also ... |
174-275 |
1.80e-30 |
|
second calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Like dystrophin, utrophin has a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, it contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, up to 24 spectrin repeats (SRs), and a WW domain. However, utrophin lacks the intrinsic microtubule binding activity of dystrophin SRs. This model corresponds to the second CH domain.
Pssm-ID: 409083 [Multi-domain] Cd Length: 104 Bit Score: 117.75 E-value: 1.80e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 174 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 252
Cdd:cd21234 2 EKILLsWVRQSTRPYSQVNVLNFTTSWTDGLAFNAVLHRHKPDLFSWDKVVKMSPVERLEHAFSKAKNHLGIEKLLDPED 81
|
90 100
....*....|....*....|...
gi 1920237962 253 VDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21234 82 VAVQLPDKKSIIMYLTSLFEVLP 104
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1335-1967 |
1.86e-30 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 134.11 E-value: 1.86e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQRAEERERLAEVEAALE--KQRQLAEAHAQAKAQAEREAQGLQR----------RMQEEVARREE 1402
Cdd:PTZ00121 1161 EDARKAEEARKAEDAKKAEAARKAEEVRKAEElrKAEDARKAEAARKAEEERKAEEARKaedakkaeavKKAEEAKKDAE 1240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1403 VAVEAQEQKRSIQ-EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRlQLEATERQRGGAEGELQALRAR-AEE 1480
Cdd:PTZ00121 1241 EAKKAEEERNNEEiRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD-EAKKAEEKKKADEAKKKAEEAKkADE 1319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1481 AEAQKRQAQEEAERLRRQVQdETQRKRQAEAELALRVQAEAEAAREKQRALQ-ALEELRLQAEEAERRlrqaeAERARQV 1559
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKKAE-EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEkKKEEAKKKADAAKKK-----AEEKKKA 1393
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1560 QVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARaeaerelerwqlKANEALR 1639
Cdd:PTZ00121 1394 DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE------------EAKKAEE 1461
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1640 LRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELiRLRAETE 1719
Cdd:PTZ00121 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEA-KKADEAK 1540
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1720 QGEQQR---QLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEaEAGRFRELA 1796
Cdd:PTZ00121 1541 KAEEKKkadELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE-EAKKAEEAK 1619
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1797 EEAARLRALAEEAKRQRQLAEEDA--------VRQRAEAERVLAEKLAAISEATrlKTEAEIALKEKEAENERLRRLAED 1868
Cdd:PTZ00121 1620 IKAEELKKAEEEKKKVEQLKKKEAeekkkaeeLKKAEEENKIKAAEEAKKAEED--KKKAEEAKKAEEDEKKAAEALKKE 1697
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1869 EAFQRRLleEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVeEEILALKGSFEKAAAGKAELELELGRIR 1948
Cdd:PTZ00121 1698 AEEAKKA--EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA-EEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
|
650
....*....|....*....
gi 1920237962 1949 GTAEDTLRSKEQAEQEAAR 1967
Cdd:PTZ00121 1775 KEKEAVIEEELDEEDEKRR 1793
|
|
| CH_SPTBN1_rpt1 |
cd21316 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) ... |
38-154 |
2.28e-30 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN1, also called beta-II spectrin, fodrin beta chain, or spectrin, non-erythroid beta chain 1, is also a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. SPTBN1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409165 Cd Length: 154 Bit Score: 119.38 E-value: 2.28e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlprEKGRMRFHKLQNVQI 117
Cdd:cd21316 48 DEREAVQKKTFTKWVNSHLARVSCRITDLYMDLRDGRMLIKLLEVLSGERLPKP-----------TKGRMRIHCLENVDK 116
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 118 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 154
Cdd:cd21316 117 ALQFLKEQRVHLENMGSHDIVDGNHRLTLGLIWTIIL 153
|
|
| Spectrin_like |
pfam18373 |
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links ... |
890-967 |
4.19e-30 |
|
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links desmosomal cadherins to the intermediate filaments. The N-terminal region of DP contains a plakin domain common to members of the plakin family. Plakin domains contain multiple copies of spectrin repeats (SRs) pfam00435. Spectrin repeats (SRs) consist of three alpha-helices (A, B, and C) that form an antiparallel triple-helical bundle. This entry describes SR6 which has a divergent structure relative to the other SRs. SR6 shows significant deviations in helices A and B where they are significantly shorter than in other repeats. Structural comparison revealed that SR6 is more similar to other three-helix-bundle proteins, including target of Myb1 and the syntaxin Habc domain, than to other SR proteins. Due to these differences with other spectrin repeats, this region is termed spectrin-like repeat.
Pssm-ID: 465730 Cd Length: 78 Bit Score: 115.78 E-value: 4.19e-30
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 890 LAWQSLGRDMQLIRSWSLATFRTLKPEEQRQALRSLELHYQAFLRDSQDAGGFGPEDRLQAEREYGSCSRHYQQLLQS 967
Cdd:pfam18373 1 VSWQYLLKDIQRINSWTISMLKTMRPEEYRQVLKNLETHYQDFLRDSQESEMFGAEDRRQLEREVNSAQQHYQTLLVS 78
|
|
| CH_UTRN_rpt1 |
cd21232 |
first calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also ... |
43-158 |
6.06e-30 |
|
first calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Like dystrophin, utrophin has a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, it contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, up to 24 spectrin repeats (SRs), and a WW domain. However, utrophin lacks the intrinsic microtubule binding activity of dystrophin SRs. This model corresponds to the first CH domain.
Pssm-ID: 409081 Cd Length: 107 Bit Score: 116.26 E-value: 6.06e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 43 VQKKTFTKWVNKHLIKAQR-HISDLYEDLRDGHNLISLLEVLSGDSLPRERdvirssrlprekGRMRFHKLQNVQIALDY 121
Cdd:cd21232 2 VQKKTFTKWINARFSKSGKpPIKDMFTDLRDGRKLLDLLEGLTGKSLPKER------------GSTRVHALNNVNRVLQV 69
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21232 70 LHQNNVELVNIGGTDIVDGNHKLTLGLLWSIILHWQV 106
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1744-2455 |
1.32e-29 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 130.44 E-value: 1.32e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1744 QKRRELEAELAKVRAEMEvllaskaRAEEEsrsTSEKSKQ--RLEAEAgrfrELAEEAARLRALAEEAKRQRQLAEedav 1821
Cdd:COG1196 172 ERKEEAERKLEATEENLE-------RLEDI---LGELERQlePLERQA----EKAERYRELKEELKELEAELLLLK---- 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1822 RQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAqhkadiEARLAQLRKASESE 1901
Cdd:COG1196 234 LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYEL------LAELARLEQDIARL 307
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1902 LERQkglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRRE 1981
Cdd:COG1196 308 EERR----RELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1982 AEERVQKSLAAEEEAARQRKAALEEVERLKAkvEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQEL 2061
Cdd:COG1196 384 LAEELLEALRAAAELAAQLEELEEAEEALLE--RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEAL 461
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2062 QQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQE 2141
Cdd:COG1196 462 LELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALE 541
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2142 AARRAQAEQAALRQKQAAdaemekhkqfAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRgq 2221
Cdd:COG1196 542 AALAAALQNIVVEDDEVA----------AAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLR-- 609
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2222 vEEELFSLRVQMEELGklkarieaenRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2301
Cdd:COG1196 610 -EADARYYVLGDTLLG----------RTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAE 678
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2302 ALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERL 2381
Cdd:COG1196 679 AELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPE 758
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2382 RLRVAEMSRAQARAEEDARRFRK-------QAEDIGERLyrTELATQekvmlVQTLETQRqqsdrdaERLREAIAELEHE 2454
Cdd:COG1196 759 PPDLEELERELERLEREIEALGPvnllaieEYEELEERY--DFLSEQ-----REDLEEAR-------ETLEEAIEEIDRE 824
|
.
gi 1920237962 2455 K 2455
Cdd:COG1196 825 T 825
|
|
| CH_DMD_rpt2 |
cd21233 |
second calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, ... |
174-276 |
3.05e-29 |
|
second calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. It is involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Mutations in dystrophin lead to Duchenne muscular dystrophy (DMD). Moreover, dystrophin deficiency is associated with abnormal cerebral diffusion and perfusion, as well as in acute Trypanosoma cruzi infection. The dystrophin subfamily has been characterized by a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, dystrophin contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, approximately 24 spectrin repeats (SRs) and a WW domain. The model corresponds to the second CH domain.
Pssm-ID: 409082 Cd Length: 111 Bit Score: 114.64 E-value: 3.05e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 174 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN-LENLDQAFSVAERDLGVTRLLDPE 251
Cdd:cd21233 2 EKILLsWVRQSTRNYPQVNVINFTSSWSDGLAFNALIHSHRPDLFDWNSVVSQQSaTERLDHAFNIARQHLGIEKLLDPE 81
|
90 100
....*....|....*....|....*
gi 1920237962 252 DVDVPQPDEKSIITYVSSLYDAMPR 276
Cdd:cd21233 82 DVATAHPDKKSILMYVTSLFQVLPQ 106
|
|
| CH_MICALL2 |
cd21253 |
calponin homology (CH) domain found in MICAL-like protein 2 and similar proteins; MICAL-like ... |
176-271 |
3.45e-29 |
|
calponin homology (CH) domain found in MICAL-like protein 2 and similar proteins; MICAL-like protein 2 (MICAL-L2), also called junctional Rab13-binding protein (JRAB), or molecule interacting with CasL-like 2, acts as an effector of small Rab GTPases which is involved in junctional complexes assembly through the regulation of cell adhesion molecule transport to the plasma membrane, and actin cytoskeleton reorganization. It regulates the endocytic recycling of occludins, claudins, and E-cadherin to the plasma membrane and may thereby regulate the establishment of tight junctions and adherens junctions. Members of this subfamily contain a single copy of CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409102 Cd Length: 106 Bit Score: 113.98 E-value: 3.45e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 176 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED-VD 254
Cdd:cd21253 6 LQQWCRQQTEGYRDVKVTNMTTSWRDGLAFCAIIHRFRPDLIDFDSLSKENVYENNKLAFTVAEKELGIPALLDAEDmVA 85
|
90
....*....|....*..
gi 1920237962 255 VPQPDEKSIITYVSSLY 271
Cdd:cd21253 86 LKVPDKLSILTYVSQYY 102
|
|
| CH_CLMN_rpt1 |
cd21191 |
first calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called ... |
39-160 |
3.72e-29 |
|
first calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Calmin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409040 Cd Length: 114 Bit Score: 114.21 E-value: 3.72e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRVQKKTFTKWVNKHLIKAQR--HISDLYEDLRDGHNLISLLEVLSGDSLPRErdvIRSSRlprekgrMRFHKLQNVQ 116
Cdd:cd21191 1 ERENVQKRTFTRWINLHLEKCNPplEVKDLFVDIQDGKILMALLEVLSGQNLLQE---YKPSS-------HRIFRLNNIA 70
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1920237962 117 IALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISD 160
Cdd:cd21191 71 KALKFLEDSNVKLVSIDAAEIADGNPSLVLGLIWNIILFFQIKE 114
|
|
| CH_FLN-like_rpt1 |
cd21183 |
first calponin homology (CH) domain found in the filamin family; The filamin family includes ... |
42-156 |
7.20e-29 |
|
first calponin homology (CH) domain found in the filamin family; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. This family also includes Drosophila melanogaster protein jitterbug (Jbug), which is an actin-meshwork organizing protein containing three copies of the CH domain. Other members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409032 Cd Length: 108 Bit Score: 113.34 E-value: 7.20e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 42 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERDvirssrlprEKGRMRFHKLQNVQIALDY 121
Cdd:cd21183 3 RIQANTFTRWCNEHLKERGMQIHDLATDFSDGLCLIALLENLSTRPLKRSYN---------RRPAFQQHYLENVSTALKF 73
|
90 100 110
....*....|....*....|....*....|....*
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 156
Cdd:cd21183 74 IEADHIKLVNIGSGDIVNGNIKLILGLIWTLILHY 108
|
|
| CH_ACTN4_rpt2 |
cd21290 |
second calponin homology (CH) domain found in alpha-actinin-4; Alpha-actinin-4 (ACTN4), also ... |
156-273 |
3.57e-28 |
|
second calponin homology (CH) domain found in alpha-actinin-4; Alpha-actinin-4 (ACTN4), also called non-muscle alpha-actinin 4, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. It is associated with cell motility and cancer invasion. ACTN4 is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409139 Cd Length: 125 Bit Score: 112.10 E-value: 3.57e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 156 FQISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAF 235
Cdd:cd21290 2 FAIQDISV----EETSAKEGLLLWCQRKTAPYKNVNVQNFHISWKDGLAFNALIHRHRPELIEYDKLRKDDPVTNLNNAF 77
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237962 236 SVAERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21290 78 EVAEKYLDIPKMLDAEDiVNTARPDEKAIMTYVSSFYHA 116
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1190-1808 |
3.76e-28 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 125.43 E-value: 3.76e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1190 QTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVpLANSQAVREQLRQEKALLED-IERHGEKveecqrfa 1268
Cdd:COG1196 219 KEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAE-LAELEAELEELRLELEELELeLEEAQAE-------- 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1269 kqyinaikdyELQLVTYKAQLEPVASPAKkpkvqsgsESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAE 1348
Cdd:COG1196 290 ----------EYELLAELARLEQDIARLE--------ERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEE 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1349 QQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAqglqrrmqeEVARREEVAVEAQEQKRSIQEELQHLRQSSEAE 1428
Cdd:COG1196 352 ELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELL---------EALRAAAELAAQLEELEEAEEALLERLERLEEE 422
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1429 IQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRqvQDETQRKRQ 1508
Cdd:COG1196 423 LEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAA--RLLLLLEAE 500
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1509 AEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAE----K 1584
Cdd:COG1196 501 ADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATflplD 580
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1585 TAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEE 1664
Cdd:COG1196 581 KIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGG 660
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1665 AEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQ 1744
Cdd:COG1196 661 SLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEE 740
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1745 KRRELEAELAKVRAEMEvllaskaraEEESRSTSEKSKQRLEAEAGRF--------RELAEEAARLRALAEE 1808
Cdd:COG1196 741 LLEEEELLEEEALEELP---------EPPDLEELERELERLEREIEALgpvnllaiEEYEELEERYDFLSEQ 803
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1504-2219 |
4.09e-28 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 125.43 E-value: 4.09e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1504 QRKRQAEAELAlrvQAEAEAAR--------EKQralqaLEELRLQAEEAERRLRQAEAERARQVQVALetaqrsaeAELQ 1575
Cdd:COG1196 172 ERKEEAERKLE---ATEENLERledilgelERQ-----LEPLERQAEKAERYRELKEELKELEAELLL--------LKLR 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1576 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRaqqqaeaeraraeaerelerwqlkanEALRLRLQAEEvaqqksltq 1655
Cdd:COG1196 236 ELEAELEELEAELEELEAELEELEAELAELEAEL--------------------------EELRLELEELE--------- 280
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1656 aeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARL 1735
Cdd:COG1196 281 ------------------LELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEEL 342
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1736 QREAAAATQKRRELEAELAKVRAEmevLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQL 1815
Cdd:COG1196 343 EEELEEAEEELEEAEAELAEAEEA---LLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL 419
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1816 AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLR 1895
Cdd:COG1196 420 EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA 499
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1896 KASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEE 1975
Cdd:COG1196 500 EADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPL 579
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1976 ERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ 2055
Cdd:COG1196 580 DKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2056 QKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLR 2135
Cdd:COG1196 660 GSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2136 KEAEQEAARRAQAEQAALRQKQAADAEMEKHKqfAEQALRQKAQV----EQELTALRLQLEETDHQKSILDEELQRLK-- 2209
Cdd:COG1196 740 ELLEEEELLEEEALEELPEPPDLEELERELER--LEREIEALGPVnllaIEEYEELEERYDFLSEQREDLEEARETLEea 817
|
730
....*....|.
gi 1920237962 2210 -AEVTEAARQR 2219
Cdd:COG1196 818 iEEIDRETRER 828
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1718-2608 |
1.01e-26 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 121.79 E-value: 1.01e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1718 TEQGEQQRQLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFREL-- 1795
Cdd:PTZ00121 1033 TEYGNNDDVLKEKDIIDEDIDGNHEG--KAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETgk 1110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1796 AEEAARlralAEEAKRQrqlAEEdaVRQRAEAERvlAEKLAAISEATRLKTE--AEIALKEKEAENERLRRLAEDeafQR 1873
Cdd:PTZ00121 1111 AEEARK----AEEAKKK---AED--ARKAEEARK--AEDARKAEEARKAEDAkrVEIARKAEDARKAEEARKAED---AK 1176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1874 RLLEEQAAqhkadIEARLA-QLRKASESelerqkglvedtlrqrRQVEEeilalkgsfekaaAGKAELELELGRIRgTAE 1952
Cdd:PTZ00121 1177 KAEAARKA-----EEVRKAeELRKAEDA----------------RKAEA-------------ARKAEEERKAEEAR-KAE 1221
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1953 DTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERlkaKVEEARRLRERAEQESARQL 2032
Cdd:PTZ00121 1222 DAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEAR---KADELKKAEEKKKADEAKKA 1298
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2033 QLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERlrseaeaarraaeeaeaareraereaaqsRRQVEEAER 2112
Cdd:PTZ00121 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE-----------------------------AKKAAEAAK 1349
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2113 LKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQaLRQKAQVEQELTALRLQLE 2192
Cdd:PTZ00121 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE-LKKAAAAKKKADEAKKKAE 1428
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2193 ETDHQksildEELQRLKAEVTEAARQRGQVEEelfslRVQMEELGKlkaRIEAENRALVLRDKDSAQRLLQE---EAEKM 2269
Cdd:PTZ00121 1429 EKKKA-----DEAKKKAEEAKKADEAKKKAEE-----AKKAEEAKK---KAEEAKKADEAKKKAEEAKKADEakkKAEEA 1495
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2270 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK----EKMQAVQEATRLKAEAELLQQQKELAQEQARRLQED 2345
Cdd:PTZ00121 1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKadeaKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2346 KeQMAQQLAQETQgfqktlETERQRQLEMSAEAERLRLRVAEmsraQARAEEDARrfrKQAEDIGErlyrtelaTQEKVM 2425
Cdd:PTZ00121 1576 K-NMALRKAEEAK------KAEEARIEEVMKLYEEEKKMKAE----EAKKAEEAK---IKAEELKK--------AEEEKK 1633
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2426 LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQalqqsflsekdslLQRErci 2505
Cdd:PTZ00121 1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA-------------LKKE--- 1697
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2506 EQEKAKLEQL---FQDEVAKAQALREEQQRQQQqmqqekqqlaaSMEEARRRQHE----AEEgVRRQQEELQRLAQQQQQ 2578
Cdd:PTZ00121 1698 AEEAKKAEELkkkEAEEKKKAEELKKAEEENKI-----------KAEEAKKEAEEdkkkAEE-AKKDEEEKKKIAHLKKE 1765
|
890 900 910
....*....|....*....|....*....|....
gi 1920237962 2579 QEKLLAEENQR----LRERLQHLEEERRAALARS 2608
Cdd:PTZ00121 1766 EEKKAEEIRKEkeavIEEELDEEDEKRRMEVDKK 1799
|
|
| CH_MICAL_EHBP-like |
cd22198 |
calponin homology (CH) domain found in the MICAL and EHBP families; This group is composed of ... |
174-273 |
1.10e-26 |
|
calponin homology (CH) domain found in the MICAL and EHBP families; This group is composed of the molecule interacting with CasL protein (MICAL) and EH domain-binding protein (EHBP) families. MICAL is a large, multidomain, cytosolic protein with a single LIM domain, a calponin homology (CH) domain and a flavoprotein monooxygenase (MO) domain. In Drosophila, MICAL is expressed in axons, interacts with the neuronal A (PlexA) receptor and is required for Semaphorin 1a (Sema-1a)-PlexA-mediated repulsive axon guidance. The LIM and CH domains mediate interactions with the cytoskeleton, cytoskeletal adaptor proteins, and other signaling proteins. The flavoprotein MO is required for semaphorin-plexin repulsive axon guidance during axonal pathfinding in the Drosophila neuromuscular system. The EHBP family includes EHBP1 and EHBP1-like protein (EHBP1L1). EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP proteins contain a single CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409188 Cd Length: 105 Bit Score: 106.99 E-value: 1.10e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 174 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED- 252
Cdd:cd22198 3 EELLSWCQEQTEGYRGVKVTDLTSSWRSGLALCAIIHRFRPDLIDFSSLDPENIAENNQLAFDVAEQELGIPPVMTGQEm 82
|
90 100
....*....|....*....|.
gi 1920237962 253 VDVPQPDEKSIITYVSSLYDA 273
Cdd:cd22198 83 ASLAVPDKLSMVSYLSQFYEA 103
|
|
| CH_ACTN1_rpt2 |
cd21287 |
second calponin homology (CH) domain found in alpha-actinin-1; Alpha-actinin-1 (ACTN1), also ... |
158-273 |
1.66e-26 |
|
second calponin homology (CH) domain found in alpha-actinin-1; Alpha-actinin-1 (ACTN1), also called alpha-actinin cytoskeletal isoform, or non-muscle alpha-actinin-1, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN1 is a bundling protein. Its mutations cause congenital macrothrombocytopenia. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409136 Cd Length: 124 Bit Score: 107.09 E-value: 1.66e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 158 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 237
Cdd:cd21287 1 IQDISV----EETSAKEGLLLWCQRKTAPYKNVNIQNFHISWKDGLGFCALIHRHRPELIDYGKLRKDDPLTNLNTAFDV 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 238 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21287 77 AEKYLDIPKMLDAEDiVGTARPDEKAIMTYVSSFYHA 113
|
|
| CH_FLN_rpt1 |
cd21228 |
first calponin homology (CH) domain found in filamins; The filamin family includes filamin-A ... |
42-156 |
4.07e-26 |
|
first calponin homology (CH) domain found in filamins; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. Members of this family contain two copies of the CH domain. The model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409077 Cd Length: 108 Bit Score: 105.65 E-value: 4.07e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 42 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERDvirssrlprEKGRMRFHKLQNVQIALDY 121
Cdd:cd21228 3 KIQQNTFTRWCNEHLKCVNKRIYNLETDLSDGLRLIALLEVLSQKRMYKKYN---------KRPTFRQMKLENVSVALEF 73
|
90 100 110
....*....|....*....|....*....|....*
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 156
Cdd:cd21228 74 LERESIKLVSIDSSAIVDGNLKLILGLIWTLILHY 108
|
|
| CH_ACTN3_rpt2 |
cd21289 |
second calponin homology (CH) domain found in alpha-actinin-3; Alpha-actinin-3 (ACTN3), also ... |
158-273 |
8.09e-26 |
|
second calponin homology (CH) domain found in alpha-actinin-3; Alpha-actinin-3 (ACTN3), also called alpha-actinin skeletal muscle isoform 3, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN3 is a bundling protein. It is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409138 Cd Length: 124 Bit Score: 105.19 E-value: 8.09e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 158 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 237
Cdd:cd21289 1 IQDISV----EETSAKEGLLLWCQRKTAPYRNVNVQNFHTSWKDGLALCALIHRHRPDLIDYAKLRKDDPIGNLNTAFEV 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 238 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21289 77 AEKYLDIPKMLDAEDiVNTPKPDEKAIMTYVSCFYHA 113
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1884-2612 |
1.13e-25 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 118.32 E-value: 1.13e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1884 KADIEARLAQLRKASESELERQKGLVEDTlRQRRQVEEEilalKGSFE---KAAAGKAELELELGRIRGTAEDTLRSKE- 1959
Cdd:PTZ00121 1059 KAEAKAHVGQDEGLKPSYKDFDFDAKEDN-RADEATEEA----FGKAEeakKTETGKAEEARKAEEAKKKAEDARKAEEa 1133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1960 -QAE-----QEAARQRQLAAEEERRRREAEERVQKSLAAEE----EAARQ----RKAA----LEEVERLKA--KVEEARR 2019
Cdd:PTZ00121 1134 rKAEdarkaEEARKAEDAKRVEIARKAEDARKAEEARKAEDakkaEAARKaeevRKAEelrkAEDARKAEAarKAEEERK 1213
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2020 LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAERE 2099
Cdd:PTZ00121 1214 AEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2100 AAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADA--EMEKHKQFAEQALRQK 2177
Cdd:PTZ00121 1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAadEAEAAEEKAEAAEKKK 1373
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2178 AQVEQELTALRLQLEETDHQksildEELQRLKAEVTEAARQRGQVEEElfslrvqmeelgKLKARiEAENRALVLRDKDS 2257
Cdd:PTZ00121 1374 EEAKKKADAAKKKAEEKKKA-----DEAKKKAEEDKKKADELKKAAAA------------KKKAD-EAKKKAEEKKKADE 1435
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2258 AQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQlAEEdlAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2335
Cdd:PTZ00121 1436 AKKKAEEakKADEAKKKAEEAKKAEEAKKKAEEAKK-ADE--AKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2336 QEqARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQA--RAEEDARRFRKQAEDigerL 2413
Cdd:PTZ00121 1513 DE-AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK-KAEEKKKAEEakKAEEDKNMALRKAEE----A 1586
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2414 YRTELATQEKVM-LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFL 2492
Cdd:PTZ00121 1587 KKAEEARIEEVMkLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2493 SEKDsllqrerciEQEKAKLEQLFQDEVAKAQAlrEEQQRQQQQMQQEKQQLAASMEEARRRQHE---AEEGVRRQQEEL 2569
Cdd:PTZ00121 1667 AKKA---------EEDKKKAEEAKKAEEDEKKA--AEALKKEAEEAKKAEELKKKEAEEKKKAEElkkAEEENKIKAEEA 1735
|
730 740 750 760
....*....|....*....|....*....|....*....|....*.
gi 1920237962 2570 QRLAQQ-QQQQEKLLAEENQrlRERLQHL--EEERRAALARSEEIA 2612
Cdd:PTZ00121 1736 KKEAEEdKKKAEEAKKDEEE--KKKIAHLkkEEEKKAEEIRKEKEA 1779
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1410-2212 |
1.61e-25 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 117.55 E-value: 1.61e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1410 QKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEaEAQKRQAQ 1489
Cdd:PTZ00121 1043 KEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEE-ARKAEEAK 1121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1490 EEAERLR-----RQVQD--ETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE----AERARQ 1558
Cdd:PTZ00121 1122 KKAEDARkaeeaRKAEDarKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEelrkAEDARK 1201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1559 VQVAletaqRSAEAELQSEHASFAEKTAQLERTLKEEHVAvvQLREEATRRAQQQAEAERARAEAERELERWQLKANEAL 1638
Cdd:PTZ00121 1202 AEAA-----RKAEEERKAEEARKAEDAKKAEAVKKAEEAK--KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKA 1274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1639 RLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQavRQRELAEQELEKQRQLAEGTAQQrlaAEQEliRLRAET 1718
Cdd:PTZ00121 1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEA--KKADEAKKKAEEAKKKADAAKKK---AEEA--KKAAEA 1347
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1719 EQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEmEVLLA--SKARAEEESRSTSEKSKQrlEAEAGRFRELA 1796
Cdd:PTZ00121 1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAE-EKKKAdeAKKKAEEDKKKADELKKA--AAAKKKADEAK 1424
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1797 EEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVlAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLL 1876
Cdd:PTZ00121 1425 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK-AEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAK 1503
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1877 EEQAAQHKADiEARLAQLRKASEsELERQKglvedtlrQRRQVEEeilaLKGSFEKAAAGKAELELELGRirgtAEDTlR 1956
Cdd:PTZ00121 1504 KAAEAKKKAD-EAKKAEEAKKAD-EAKKAE--------EAKKADE----AKKAEEKKKADELKKAEELKK----AEEK-K 1564
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1957 SKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRlrerAEQESARQLQLaq 2036
Cdd:PTZ00121 1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK----AEEEKKKVEQL-- 1638
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2037 eaaqKRLQAEEKAHAFAVQQKEQELQQTLqqeqsvlERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQs 2116
Cdd:PTZ00121 1639 ----KKKEAEEKKKAEELKKAEEENKIKA-------AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEE- 1706
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2117 aeeQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADaemEKHKQFAEQALRQKAQVEQELTALRLQLEETDH 2196
Cdd:PTZ00121 1707 ---LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE---EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAV 1780
|
810
....*....|....*.
gi 1920237962 2197 QKSILDEELQRLKAEV 2212
Cdd:PTZ00121 1781 IEEELDEEDEKRRMEV 1796
|
|
| CH_FLNC_rpt1 |
cd21310 |
first calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C ... |
42-159 |
2.59e-25 |
|
first calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C (FLN-C), also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. FLN-C contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409159 Cd Length: 125 Bit Score: 103.96 E-value: 2.59e-25
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 42 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERDvirssrlPREKgrMRFHKLQNVQIALDY 121
Cdd:cd21310 15 KIQQNTFTRWCNEHLKCVQKRLNDLQKDLSDGLRLIALLEVLSQKKMYRKYH-------PRPN--FRQMKLENVSVALEF 85
|
90 100 110
....*....|....*....|....*....|....*...
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 159
Cdd:cd21310 86 LDREHIKLVSIDSKAIVDGNLKLILGLIWTLILHYSIS 123
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1347-2558 |
1.64e-24 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 114.15 E-value: 1.64e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1347 AEQQRAEERERLAEVEAalEKQRQLAE-AHAQAKAQAEREAQGLQRRMQ--EEVARREEvAVEAQEQKRSIQEELQHLRQ 1423
Cdd:NF041483 85 ADQLRADAERELRDARA--QTQRILQEhAEHQARLQAELHTEAVQRRQQldQELAERRQ-TVESHVNENVAWAEQLRART 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1424 SSEA-----EIQAKARQVEAAERSRlrieeeirVVRLQLEAteRQRGGAEGElqalRARAEeAEAQKRQAQEEAERLRRQ 1498
Cdd:NF041483 162 ESQArrlldESRAEAEQALAAARAE--------AERLAEEA--RQRLGSEAE----SARAE-AEAILRRARKDAERLLNA 226
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1499 VQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQaeEAERRLRQAEAERARQVQVALETA-QRSAEAELQSE 1577
Cdd:NF041483 227 ASTQAQEATDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQ--EAEEALREARAEAEKVVAEAKEAAaKQLASAESANE 304
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1578 hasfaektaQLERTLKEEhvaVVQLREEATRRAQQQAEAERARAEAERELERWQL-KANEALRLRLQAEEVAQQKSLTQA 1656
Cdd:NF041483 305 ---------QRTRTAKEE---IARLVGEATKEAEALKAEAEQALADARAEAEKLVaEAAEKARTVAAEDTAAQLAKAART 372
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1657 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQ-RLAAEQELIRLRAETEQgeqqrqlLEEELARL 1735
Cdd:NF041483 373 AEEVLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQlKGAAKDDTKEYRAKTVE-------LQEEARRL 445
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1736 QREAAaatQKRRELEAELAKVRAEmevllaskARAEeesrstsekSKQRLEAEAGRFRELAEEAarlRALAEEAKRQrql 1815
Cdd:NF041483 446 RGEAE---QLRAEAVAEGERIRGE--------ARRE---------AVQQIEEAARTAEELLTKA---KADADELRST--- 499
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1816 aeedavrQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAA-QHKADIEARLAQL 1894
Cdd:NF041483 500 -------ATAESERVRTE---AIERATTLRRQAEETLERTRAEAERLRAEAEEQAEEVRAAAERAArELREETERAIAAR 569
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1895 RKASESELERQKGLVEdtlrQRRQVEEEilALKGSFEKAAAGKAELELELGRIRGTAEDTLRS-KEQAEQEAARQRqlaa 1973
Cdd:NF041483 570 QAEAAEELTRLHTEAE----ERLTAAEE--ALADARAEAERIRREAAEETERLRTEAAERIRTlQAQAEQEAERLR---- 639
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1974 eeerrRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLR-------ERAEQESARQLQLAQ-EAAQKRLQ 2044
Cdd:NF041483 640 -----TEAAADASAARAEGENVAVRLRSEAAAEAERLKSEAQEsADRVRaeaaaaaERVGTEAAEALAAAQeEAARRRRE 714
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2045 AEE---KAHAFAVQQKEQELQQTLQQEQSVLERLrseaeaarraaeeaeaareraEREAAQSRRQVEEAERlkqsaeeqa 2121
Cdd:NF041483 715 AEEtlgSARAEADQERERAREQSEELLASARKRV---------------------EEAQAEAQRLVEEADR--------- 764
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2122 qaqaqaqaaaeklrkeaeqeaarraqaeqaalRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEET-DHQKSI 2200
Cdd:NF041483 765 --------------------------------RATELVSAAEQTAQQVRDSVAGLQEQAEEEIAGLRSAAEHAaERTRTE 812
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2201 LDEELQRLKAEvteAARQRGQVEEELFSLRVQ-MEELGKLKARIEAEnralVLRDKDSAQRLLQEEAEKMKQVAEEAA-R 2278
Cdd:NF041483 813 AQEEADRVRSD---AYAERERASEDANRLRREaQEETEAAKALAERT----VSEAIAEAERLRSDASEYAQRVRTEASdT 885
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2279 LSVAAQEAARLRQLAEEDLAQQRALA---EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAq 2355
Cdd:NF041483 886 LASAEQDAARTRADAREDANRIRSDAaaqADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIA- 964
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2356 etqgfqktleterqrqlEMSAEAERLRLRVAE-MSRAQARAE---EDARRFRKQAEDIGERLyRTELATQEKVMLVQTLE 2431
Cdd:NF041483 965 -----------------EATGEAERLRAEAAEtVGSAQQHAErirTEAERVKAEAAAEAERL-RTEAREEADRTLDEARK 1026
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2432 TQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLL----QETQALQQSFLSEKDSLLQRERcieq 2507
Cdd:NF041483 1027 DANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVgaarKEAERIVAEATVEGNSLVEKAR---- 1102
|
1210 1220 1230 1240 1250
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 2508 ekAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLaasMEEARRRQHEA 2558
Cdd:NF041483 1103 --TDADELLVGARRDATAIRERAEELRDRITGEIEEL---HERARRESAEQ 1148
|
|
| CH_ACTN2_rpt2 |
cd21288 |
second calponin homology (CH) domain found in alpha-actinin-2; Alpha-actinin-2 (ACTN2), also ... |
158-273 |
2.36e-24 |
|
second calponin homology (CH) domain found in alpha-actinin-2; Alpha-actinin-2 (ACTN2), also called alpha-actinin skeletal muscle isoform 2, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN2 is a bundling protein. Its mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409137 Cd Length: 124 Bit Score: 100.92 E-value: 2.36e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 158 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 237
Cdd:cd21288 1 IQDISV----EETSAKEGLLLWCQRKTAPYRNVNIQNFHTSWKDGLGLCALIHRHRPDLIDYSKLNKDDPIGNINLAMEI 76
|
90 100 110
....*....|....*....|....*....|....*..
gi 1920237962 238 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21288 77 AEKHLDIPKMLDAEDiVNTPKPDERAIMTYVSCFYHA 113
|
|
| CH_SPTBN5_rpt1 |
cd21247 |
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) ... |
37-158 |
5.46e-24 |
|
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN5, also called beta-V spectrin, is a mammalian ortholog of Drosophila beta H spectrin that may play a crucial role as a longer actin-membrane cross-linker or to fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. SPTBN5 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409096 Cd Length: 125 Bit Score: 100.22 E-value: 5.46e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 37 KDERDRVQKKTFTKWVNKHLIKAQR--HISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlprEKGRMRFHKLQN 114
Cdd:cd21247 14 QEQRMTMQKKTFTKWMNNVFSKNGAkiEITDIYTELKDGIHLLRLLELISGEQLPRP-----------SRGKMRVHFLEN 82
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1920237962 115 VQIALDYLRHR-QVKLVNIRNddIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21247 83 NSKAITFLKTKvPVKLIGPEN--IVDGDRTLILGLIWIIILRFQI 125
|
|
| CH_MICALL |
cd21197 |
calponin homology (CH) domain found in the MICAL-like protein family; The MICAL-L family ... |
176-271 |
6.36e-24 |
|
calponin homology (CH) domain found in the MICAL-like protein family; The MICAL-L family includes MICAL-L1 and MICAL-L2. MICAL-L1, also called molecule interacting with Rab13 (MIRab13), is a probable lipid-binding protein with higher affinity for phosphatidic acid, a lipid enriched in recycling endosome membranes. It is a tubular endosomal membrane hub that connects Rab35 and Arf6 with Rab8a. It may be involved in a late step of receptor-mediated endocytosis regulating endocytosed-EGF receptor trafficking. Alternatively, it may regulate slow endocytic recycling of endocytosed proteins back to the plasma membrane. MICAL-L1 may indirectly play a role in neurite outgrowth. MICAL-L2, also called junctional Rab13-binding protein (JRAB), or molecule interacting with CasL-like 2, acts as an effector of small Rab GTPases which is involved in junctional complexes assembly through the regulation of cell adhesion molecule transport to the plasma membrane, and actin cytoskeleton reorganization. It regulates the endocytic recycling of occludins, claudins, and E-cadherin to the plasma membrane and may thereby regulate the establishment of tight junctions and adherens junctions. Members of this family contain a single copy of CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409046 Cd Length: 105 Bit Score: 99.15 E-value: 6.36e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 176 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED-VD 254
Cdd:cd21197 5 LLRWCRRQCEGYPGVNITNLTSSFRDGLAFCAILHRHRPELIDFHSLKKDNWLENNRLAFRVAETSLGIPALLDAEDmVT 84
|
90
....*....|....*..
gi 1920237962 255 VPQPDEKSIITYVSSLY 271
Cdd:cd21197 85 MHVPDRLSIITYVSQYY 101
|
|
| CH |
smart00033 |
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ... |
46-155 |
6.44e-24 |
|
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
Pssm-ID: 214479 [Multi-domain] Cd Length: 101 Bit Score: 98.93 E-value: 6.44e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 46 KTFTKWVNKHLIKA-QRHISDLYEDLRDGHNLISLLEVLSGDSLPrERDVirssrlprEKGRMRFHKLQNVQIALDYLRH 124
Cdd:smart00033 1 KTLLRWVNSLLAEYdKPPVTNFSSDLKDGVALCALLNSLSPGLVD-KKKV--------AASLSRFKKIENINLALSFAEK 71
|
90 100 110
....*....|....*....|....*....|.
gi 1920237962 125 RQVKLVNIRNDDIADGnPKLTLGLIWTIILH 155
Cdd:smart00033 72 LGGKVVLFEPEDLVEG-PKLILGVIWTLISL 101
|
|
| CH |
pfam00307 |
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ... |
170-276 |
6.83e-24 |
|
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.
Pssm-ID: 425596 [Multi-domain] Cd Length: 109 Bit Score: 99.28 E-value: 6.83e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 170 MTAKEKLLLWSQRMVEGC-QGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVY--RQTNLENLDQAFSVAERDLGVTR 246
Cdd:pfam00307 1 LELEKELLRWINSHLAEYgPGVRVTNFTTDLRDGLALCALLNKLAPGLVDKKKLNksEFDKLENINLALDVAEKKLGVPK 80
|
90 100 110
....*....|....*....|....*....|.
gi 1920237962 247 -LLDPEDVDvpQPDEKSIITYVSSLYDAMPR 276
Cdd:pfam00307 81 vLIEPEDLV--EGDNKSVLTYLASLFRRFQA 109
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1384-2244 |
9.82e-24 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 111.30 E-value: 9.82e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1384 REAQGLQRRMQEEVARREEVAVEAQEQKRSIQ------EELQHLR-QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ 1456
Cdd:TIGR02168 175 KETERKLERTRENLDRLEDILNELERQLKSLErqaekaERYKELKaELRELELALLVLRLEELREELEELQEELKEAEEE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1457 LEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEE 1536
Cdd:TIGR02168 255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1537 LRLQAEEAERRLrqaeaerarqvqvaletaqrsaeAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaea 1616
Cdd:TIGR02168 335 LAEELAELEEKL-----------------------EELKEELESLEAELEELEAELEELESRLEELEEQLE--------- 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1617 eraraeaerelerwQLKANEALRLRLQAEEVAQQKSLTQAEaekqkeeaerearrrgkaeEQAVRQRELAEQELEKQRQL 1696
Cdd:TIGR02168 383 --------------TLRSKVAQLELQIASLNNEIERLEARL-------------------ERLEDRRERLQQEIEELLKK 429
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1697 AEgtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRS 1776
Cdd:TIGR02168 430 LE--EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEG 507
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1777 TSE--KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlaeEDAVRQRAEAERVLAEKLAAiSEATRLKTEAEIALKE 1854
Cdd:TIGR02168 508 VKAllKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRL---QAVVVENLNAAKKAIAFLKQ-NELGRVTFLPLDSIKG 583
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1855 KEAENERLRRLAEDEAFQRRL--LEEQAAQHKADIEARLAQLRKASEselerqkglVEDTLRQRRQVEEEIL-------- 1924
Cdd:TIGR02168 584 TEIQGNDREILKNIEGFLGVAkdLVKFDPKLRKALSYLLGGVLVVDD---------LDNALELAKKLRPGYRivtldgdl 654
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1925 -----ALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaaeeerRRREAEERVQKSLAAEEEAARQ 1999
Cdd:TIGR02168 655 vrpggVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK-------ELEELEEELEQLRKELEELSRQ 727
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2000 RKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAfaVQQKEQELQQTLQQEQSVLERLRSEA 2079
Cdd:TIGR02168 728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAE--AEAEIEELEAQIEQLKEELKALREAL 805
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2080 EAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAA 2159
Cdd:TIGR02168 806 DELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASL 885
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2160 DAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLK---AEVTEAARQRGQVEEELFSLRVQMEEL 2236
Cdd:TIGR02168 886 EEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEvriDNLQERLSEEYSLTLEEAEALENKIED 965
|
....*...
gi 1920237962 2237 GKLKARIE 2244
Cdd:TIGR02168 966 DEEEARRR 973
|
|
| CH |
pfam00307 |
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ... |
43-158 |
3.59e-23 |
|
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.
Pssm-ID: 425596 [Multi-domain] Cd Length: 109 Bit Score: 96.97 E-value: 3.59e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 43 VQKKTFTKWVNKHLIKAQRH--ISDLYEDLRDGHNLISLLEVLSGDSLPrerdvirssrlPREKGRMRFHKLQNVQIALD 120
Cdd:pfam00307 2 ELEKELLRWINSHLAEYGPGvrVTNFTTDLRDGLALCALLNKLAPGLVD-----------KKKLNKSEFDKLENINLALD 70
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237962 121 YLRHRQ-VKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:pfam00307 71 VAEKKLgVPKVLIEPEDLVEGDNKSVLTYLASLFRRFQA 109
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1310-2046 |
6.57e-23 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 108.61 E-value: 6.57e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1310 QEYVDLRTRYSELS-TLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAAL--------EKQRQLAEA------ 1374
Cdd:TIGR02168 213 ERYKELKAELRELElALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLeelrlevsELEEEIEELqkelya 292
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1375 HAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE------ELQHLRQSSEAEIQAKARQVEAAERSRLRIEE 1448
Cdd:TIGR02168 293 LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDElaeelaELEEKLEELKEELESLEAELEELEAELEELES 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1449 EIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQ 1528
Cdd:TIGR02168 373 RLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI--EELLKKLEEAELKELQAELEELEEELE 450
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1529 RALQALEELRLQAEEAERRLRQAEAERarqvqvaletaqRSAEAELQsEHASFAEKTAQLERTLKEEHVAVVQLREEAtr 1608
Cdd:TIGR02168 451 ELQEELERLEEALEELREELEEAEQAL------------DAAERELA-QLQARLDSLERLQENLEGFSEGVKALLKNQ-- 515
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1609 raQQQAEAERARAEAERELERWQLKANEALRLRLQA---EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQREL 1685
Cdd:TIGR02168 516 --SGLSGILGVLSELISVDEGYEAAIEAALGGRLQAvvvENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREI 593
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1686 AEQELEKQRQLA---EGTAQQRLAAEQELIRLR-AETEQG--EQQRQLLEEELA------------RLQREAAAATQKRR 1747
Cdd:TIGR02168 594 LKNIEGFLGVAKdlvKFDPKLRKALSYLLGGVLvVDDLDNalELAKKLRPGYRIvtldgdlvrpggVITGGSAKTNSSIL 673
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1748 ELEAELAKVRAEMEvLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEA 1827
Cdd:TIGR02168 674 ERRREIEELEEKIE-ELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQL 752
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1828 ERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAED-----EAFQRRLLEEQAAQHKADIEARLAQLRKAS-ESE 1901
Cdd:TIGR02168 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQlkeelKALREALDELRAELTLLNEEAANLRERLESlERR 832
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1902 LERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELgrirgtaEDTLRSKEQAEQEAARQRQLAAEEERRRRE 1981
Cdd:TIGR02168 833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL-------EALLNERASLEEALALLRSELEELSEELRE 905
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1982 AEERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLRERA--EQESARQLQLAQEAAQKRLQAE 2046
Cdd:TIGR02168 906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNlQERLSEEYslTLEEAEALENKIEDDEEEARRR 973
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1702-2494 |
7.71e-23 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 108.61 E-value: 7.71e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1702 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAAtQKRRELEAELAKVRAEmevLLASKARAEEESRSTSEKS 1781
Cdd:TIGR02168 172 ERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKA-ERYKELKAELRELELA---LLVLRLEELREELEELQEE 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1782 KQRLEAEagrFRELAEEAARLRALAEEAKRQRQLAEEDAvrqrAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENER 1861
Cdd:TIGR02168 248 LKEAEEE---LEELTAELQELEEKLEELRLEVSELEEEI----EELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1862 LRRLAEDEAFQRRLLEEQAAQHKADIEaRLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFekaaagkAELE 1941
Cdd:TIGR02168 321 LEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV-------AQLE 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1942 LELGRIRGTAEDTLRSKEQAEQEAARQRQ-LAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRL 2020
Cdd:TIGR02168 393 LQIASLNNEIERLEARLERLEDRRERLQQeIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2021 RERAEQESARQLQLAQE--AAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAER 2098
Cdd:TIGR02168 473 AEQALDAAERELAQLQArlDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVV 552
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2099 EAAQSRRQVEEAerLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAaDAEMEKHKQFAeqaLRQKA 2178
Cdd:TIGR02168 553 ENLNAAKKAIAF--LKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKF-DPKLRKALSYL---LGGVL 626
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2179 QVEQELTALRLQlEETDHQKSI--LDEELQRLKAEVTEAARQRGQVeeeLFSLRVQMEELGKLKARIEAENRAL--VLRD 2254
Cdd:TIGR02168 627 VVDDLDNALELA-KKLRPGYRIvtLDGDLVRPGGVITGGSAKTNSS---ILERRREIEELEEKIEELEEKIAELekALAE 702
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2255 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE-----------KMQAVQEATRLKA 2323
Cdd:TIGR02168 703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTEleaeieeleerLEEAEEELAEAEA 782
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2324 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlrvaEMSRAQARAEEDARRFR 2403
Cdd:TIGR02168 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLE----DLEEQIEELSEDIESLA 858
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2404 KQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTvRQEQLLQE 2483
Cdd:TIGR02168 859 AEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL-RLEGLEVR 937
|
810
....*....|.
gi 1920237962 2484 TQALQQSFLSE 2494
Cdd:TIGR02168 938 IDNLQERLSEE 948
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1338-2049 |
9.64e-23 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 108.37 E-value: 9.64e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1338 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLA-EAHAQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQE 1416
Cdd:NF041483 254 RQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAkEAAAKQLASAESANEQRTRTAKEEIAR---LVGEATKEAEALKA 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1417 ELQHLRQSSEAEiqAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGElQALRARAEEAEAQKRQAQEEAERLR 1496
Cdd:NF041483 331 EAEQALADARAE--AEKLVAEAAEKARTVAAED--------TAAQLAKAARTAE-EVLTKASEDAKATTRAAAEEAERIR 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1497 RQVQDETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRLQaEEAeRRLRqAEAERARqvqvaletaqrsaeaelqs 1576
Cdd:NF041483 400 REAEAEADRLRGEAADQAEQLKGAAKDDTKEYRA----KTVELQ-EEA-RRLR-GEAEQLR------------------- 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1577 ehasfAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELErwqlkANEALRLRLQAEEVAqqKSLTQA 1656
Cdd:NF041483 454 -----AEAVAEGERIRGEARREAVQQIEEAARTAEELLTKAKADADELRSTA-----TAESERVRTEAIERA--TTLRRQ 521
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1657 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA-AEQELIRLRAETEQ----GEQQRQLLEEE 1731
Cdd:NF041483 522 AEETLERTRAEAERLRAEAEEQAEEVRAAAERAARELREETERAIAARQAeAAEELTRLHTEAEErltaAEEALADARAE 601
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1732 LARLQREAAAATQKRRELEAE-----LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrelAEEAARLRALA 1806
Cdd:NF041483 602 AERIRREAAEETERLRTEAAErirtlQAQAEQEAERLRTEAAADASAARAEGENVAVRLRSEA------AAEAERLKSEA 675
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1807 EE-AKRQRQLAEEDAVRQRAEAERVLAeklAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAfqrrllEEQAAQHKA 1885
Cdd:NF041483 676 QEsADRVRAEAAAAAERVGTEAAEALA---AAQEEAARRRREAEETLGSARAEADQERERAREQS------EELLASARK 746
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1886 DIEARLAQLRKASESELERQKGLV---EDTLRQRR--------QVEEEILALKGSFEKAAA-GKAELELELGRIRGtaeD 1953
Cdd:NF041483 747 RVEEAQAEAQRLVEEADRRATELVsaaEQTAQQVRdsvaglqeQAEEEIAGLRSAAEHAAErTRTEAQEEADRVRS---D 823
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1954 TLRSKEQAEQEAARQRQlaaeeerrrreaeERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLRERAeqeSARQL 2032
Cdd:NF041483 824 AYAERERASEDANRLRR-------------EAQEETEAAKALAERTVSEAIAEAERLRSDASEyAQRVRTEA---SDTLA 887
|
730
....*....|....*..
gi 1920237962 2033 QLAQEAAQKRLQAEEKA 2049
Cdd:NF041483 888 SAEQDAARTRADAREDA 904
|
|
| CH_MICALL1 |
cd21252 |
calponin homology (CH) domain found in MICAL-like protein 1; MICAL-like protein 1 (MICAL-L1), ... |
172-271 |
9.89e-23 |
|
calponin homology (CH) domain found in MICAL-like protein 1; MICAL-like protein 1 (MICAL-L1), also called molecule interacting with Rab13 (MIRab13), is a probable lipid-binding protein with higher affinity for phosphatidic acid, a lipid enriched in recycling endosome membranes. It is a tubular endosomal membrane hub that connects Rab35 and Arf6 with Rab8a. It may be involved in a late step of receptor-mediated endocytosis regulating endocytosed-EGF receptor trafficking. Alternatively, it may regulate slow endocytic recycling of endocytosed proteins back to the plasma membrane. MICAL-L1 may indirectly play a role in neurite outgrowth. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409101 Cd Length: 107 Bit Score: 95.71 E-value: 9.89e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 172 AKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPE 251
Cdd:cd21252 1 ARRALQAWCRRQCEGYPGVEIRDLSSSFRDGLAFCAILHRHRPDLIDFDSLSKDNVYENNRLAFEVAERELGIPALLDPE 80
|
90 100
....*....|....*....|.
gi 1920237962 252 D-VDVPQPDEKSIITYVSSLY 271
Cdd:cd21252 81 DmVSMKVPDCLSIMTYVSQYY 101
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1873-2514 |
1.18e-22 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 107.72 E-value: 1.18e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1873 RRLLEEQA--AQHKADIEARLAQLRKASE---------SELERQKglveDTLRQRRQVEEEILALKGSFEKAaagkaELE 1941
Cdd:COG1196 158 RAIIEEAAgiSKYKERKEEAERKLEATEEnlerledilGELERQL----EPLERQAEKAERYRELKEELKEL-----EAE 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1942 LELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLR 2021
Cdd:COG1196 229 LLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2022 ERAEQESARQLQLAQEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqsvlERLRSEAEAARRAAEEAEAARERAEREAA 2101
Cdd:COG1196 309 ERRRELEERLEELEEELAELEEELEELE-----------------------EELEELEEELEEAEEELEEAEAELAEAEE 365
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2102 QSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVE 2181
Cdd:COG1196 366 ALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALE 445
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2182 QELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAenrALVLRDKDSAQRL 2261
Cdd:COG1196 446 EAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEG---VKAALLLAGLRGL 522
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2262 LQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARR 2341
Cdd:COG1196 523 AGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVD 602
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2342 LQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQ 2421
Cdd:COG1196 603 LVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE 682
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2422 EKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQR 2501
Cdd:COG1196 683 ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
|
650
....*....|...
gi 1920237962 2502 ERcIEQEKAKLEQ 2514
Cdd:COG1196 763 EE-LERELERLER 774
|
|
| CH_CTX_rpt2 |
cd21226 |
second calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling ... |
174-274 |
1.71e-22 |
|
second calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling proteins that play a critical role in regulating cell morphology and actin cytoskeleton reorganization. They play a major role in cytokinesis and contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409075 Cd Length: 103 Bit Score: 94.84 E-value: 1.71e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 174 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPEDV 253
Cdd:cd21226 3 DGLLAWCRQTTEGYDGVNITSFKSSFNDGRAFLALLHAYDPELFKQAAIEQMDAEARLNLAFDFAEKKLGIPKLLEAEDV 82
|
90 100
....*....|....*....|.
gi 1920237962 254 DVPQPDEKSIITYVSSLYDAM 274
Cdd:cd21226 83 MTGNPDERSIVLYTSLFYHAF 103
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1334-2050 |
1.86e-22 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 107.22 E-value: 1.86e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1334 SETLRRMEEEERL---AEQQRAE---ERERLaEVEAALEKQRQLAEAHAQAK---AQAEREAQGLQRRMQEEVAR-REEV 1403
Cdd:NF041483 433 AKTVELQEEARRLrgeAEQLRAEavaEGERI-RGEARREAVQQIEEAARTAEellTKAKADADELRSTATAESERvRTEA 511
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1404 AVEAQEQKRSIQEELQHLRqsSEAEiQAKARQVEAAERSRLRIEEEIRVVRLQLE-ATERQRGGAEGELQALRARAEE-- 1480
Cdd:NF041483 512 IERATTLRRQAEETLERTR--AEAE-RLRAEAEEQAEEVRAAAERAARELREETErAIAARQAEAAEELTRLHTEAEErl 588
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1481 --AEAQKRQAQEEAERLRRQVQDETQRKRQAEAE--LALRVQAEAEAAREKQRALQALEELRLQAEEAERRLR-QAEAER 1555
Cdd:NF041483 589 taAEEALADARAEAERIRREAAEETERLRTEAAEriRTLQAQAEQEAERLRTEAAADASAARAEGENVAVRLRsEAAAEA 668
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1556 ARQVQVALETAQRsaeaeLQSEHASFAEKTAQlertlkeehvavvqlreeatrraqqqaeaeraraeaerelerwqlKAN 1635
Cdd:NF041483 669 ERLKSEAQESADR-----VRAEAAAAAERVGT---------------------------------------------EAA 698
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1636 EALrlrlqaeevaqqksltqaeaekqkeeaerearrrGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI--- 1712
Cdd:NF041483 699 EAL----------------------------------AAAQEEAARRRREAEETLGSARAEADQERERAREQSEELLasa 744
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1713 RLRAETEQGEQQRqLLEEELARLQREAAAATQKRRELEAELAKV--RAEMEV--LLASKARAEEESRSTSEKSKQRLEAE 1788
Cdd:NF041483 745 RKRVEEAQAEAQR-LVEEADRRATELVSAAEQTAQQVRDSVAGLqeQAEEEIagLRSAAEHAAERTRTEAQEEADRVRSD 823
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1789 AGRFRELA-EEAARLRALA-EEAKRQRQLAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLA 1866
Cdd:NF041483 824 AYAERERAsEDANRLRREAqEETEAAKALAERTVSEAIAEAERLRSD---ASEYAQRVRTEASDTLASAEQDAARTRADA 900
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1867 EDEAFQRRllEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAElelelgR 1946
Cdd:NF041483 901 REDANRIR--SDAAAQADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEATGEAE------R 972
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1947 IRGTAEDTLRSkeqAEQEAARQRQLAAEEERRrreaeervqkslaAEEEAARQRKAALEEVERL--KAKVEEARRLRERA 2024
Cdd:NF041483 973 LRAEAAETVGS---AQQHAERIRTEAERVKAE-------------AAAEAERLRTEAREEADRTldEARKDANKRRSEAA 1036
|
730 740
....*....|....*....|....*.
gi 1920237962 2025 EQESARQLQLAQEAAQKRLQAEEKAH 2050
Cdd:NF041483 1037 EQADTLITEAAAEADQLTAKAQEEAL 1062
|
|
| CH_FLN-like_rpt2 |
cd21184 |
second calponin homology (CH) domain found in the filamin family; The filamin family includes ... |
171-269 |
2.09e-22 |
|
second calponin homology (CH) domain found in the filamin family; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. This family also includes Drosophila melanogaster protein jitterbug (Jbug), which is an actin-meshwork organizing protein containing three copies of the CH domain. Other members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409033 Cd Length: 103 Bit Score: 94.61 E-value: 2.09e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCqglRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVY-RQTNLENLDQAFSVAERDLGVTRLLD 249
Cdd:cd21184 1 SGKSLLLEWVNSKIPEY---KVKNFTTDWNDGKALAALVDALKPGLIPDNESLdKENPLENATKAMDIAEEELGIPKIIT 77
|
90 100
....*....|....*....|
gi 1920237962 250 PEDVDVPQPDEKSIITYVSS 269
Cdd:cd21184 78 PEDMVSPNVDELSVMTYLSY 97
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1681-2451 |
2.29e-22 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 107.06 E-value: 2.29e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1681 RQRELAEQELEKQRQLAEgtaqqrlaAEQELIRLRAETeqgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEM 1760
Cdd:TIGR02168 207 RQAEKAERYKELKAELRE--------LELALLVLRLEE---------LREELEELQEELKEAEEELEELTAELQELEEKL 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1761 EVLLASKARAEEEsrstsekskqrLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISE 1840
Cdd:TIGR02168 270 EELRLEVSELEEE-----------IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEE 338
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1841 ATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQH---KADIEARLAQLRKASeSELERQKGLVEDTLRQRR 1917
Cdd:TIGR02168 339 LAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLetlRSKVAQLELQIASLN-NEIERLEARLERLEDRRE 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1918 QVEEEILALKGSFEKAAagKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRrreaeervQKSLAAEEEAA 1997
Cdd:TIGR02168 418 RLQQEIEELLKKLEEAE--LKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA--------LDAAERELAQL 487
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1998 RQRKAALEEV-ERLKAKVEEARRLRERAEQ------------ESARQLQLAQEAA-QKRLQA------EEKAHAFAVQQK 2057
Cdd:TIGR02168 488 QARLDSLERLqENLEGFSEGVKALLKNQSGlsgilgvlseliSVDEGYEAAIEAAlGGRLQAvvvenlNAAKKAIAFLKQ 567
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2058 EQELQQTLQQEQSVLER-LRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE-------AERLKQSAEEQAQAQAQAQA 2129
Cdd:TIGR02168 568 NELGRVTFLPLDSIKGTeIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYllggvlvVDDLDNALELAKKLRPGYRI 647
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2130 AAEKLRKEAEQEAARRAQAEQAALRQKQaaDAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLK 2209
Cdd:TIGR02168 648 VTLDGDLVRPGGVITGGSAKTNSSILER--RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELS 725
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2210 AEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQ----- 2284
Cdd:TIGR02168 726 RQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE-IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKalrea 804
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2285 ------------EAARLRQLAEEDLAQQRALAEKML----KEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQ 2348
Cdd:TIGR02168 805 ldelraeltllnEEAANLRERLESLERRIAATERRLedleEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAS 884
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2349 MAQQLAQ---ETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDigerLYRTELATQEKvm 2425
Cdd:TIGR02168 885 LEEALALlrsELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSE----EYSLTLEEAEA-- 958
|
810 820
....*....|....*....|....*.
gi 1920237962 2426 LVQTLETQRQQSDRDAERLREAIAEL 2451
Cdd:TIGR02168 959 LENKIEDDEEEARRRLKRLENKIKEL 984
|
|
| CH_EHBP |
cd21198 |
calponin homology (CH) domain found in the EH domain-binding protein (EHBP) family; The EHBP ... |
171-271 |
2.98e-22 |
|
calponin homology (CH) domain found in the EH domain-binding protein (EHBP) family; The EHBP family includes EHBP1 and EHBP1-like protein (EHBP1L1). EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP1 is associated with aggressive prostate cancer and insulin-stimulated trafficking and cell migration. EHBP1L1 may also act as Rab effector protein and play a role in vesicle trafficking. It coordinates Rab8 and Bin1 to regulate apical-directed transport in polarized epithelial cells. Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409047 Cd Length: 105 Bit Score: 94.41 E-value: 2.98e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 250
Cdd:cd21198 1 SSGQDLLEWCQEVTKGYRGVKITNLTTSWRNGLAFCAILHHFRPDLIDFSSLSPHDIKENCKLAFDAAAK-LGIPRLLDP 79
|
90 100
....*....|....*....|..
gi 1920237962 251 EDVDVPQ-PDEKSIITYVSSLY 271
Cdd:cd21198 80 ADMVLLSvPDKLSVMTYLHQIR 101
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1988-2603 |
4.38e-22 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 105.79 E-value: 4.38e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1988 KSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQ 2067
Cdd:COG1196 203 EPLERQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELE 282
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2068 EQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQ 2147
Cdd:COG1196 283 LEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2148 AEQAALRQKQAADAEMEKHKQFAEQALRQkaqvEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELF 2227
Cdd:COG1196 363 AEEALLEAEAELAEAEEELEELAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEE 438
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2228 SLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM 2307
Cdd:COG1196 439 EEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAG 518
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2308 LKEKMQAVQEATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRV 2385
Cdd:COG1196 519 LRGLAGAVAVLIGVEAAYEAALEAALAAalQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIG 598
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2386 AEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLL 2465
Cdd:COG1196 599 AAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAE 678
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2466 QLKSEEMQTVRQEQLLQETQALQQsflsekdsLLQRERCIEQEKAKLEQlfqdevakaqalreeqqrqqqqmqqekqqlA 2545
Cdd:COG1196 679 AELEELAERLAEEELELEEALLAE--------EEEERELAEAEEERLEE------------------------------E 720
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 2546 ASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRA 2603
Cdd:COG1196 721 LEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEA 778
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1785-2600 |
6.26e-22 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 105.52 E-value: 6.26e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1785 LEAEAGRFRELAEEAA---RLRALAEEAKRQRQLAEEDAVR------------QRAEAERVLAEKLAAISEATRlKTEAE 1849
Cdd:TIGR02168 150 IEAKPEERRAIFEEAAgisKYKERRKETERKLERTRENLDRledilnelerqlKSLERQAEKAERYKELKAELR-ELELA 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1850 IALKEKEAENERLRRLAEDEAFQRRLLEEQAAQhKADIEARLAQLRKAS---ESELERQKGLVEDTLRQRRQVEEEILAL 1926
Cdd:TIGR02168 229 LLVLRLEELREELEELQEELKEAEEELEELTAE-LQELEEKLEELRLEVselEEEIEELQKELYALANEISRLEQQKQIL 307
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1927 KGSFEKAAAGKAELELELgrirgtaEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 2006
Cdd:TIGR02168 308 RERLANLERQLEELEAQL-------EELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQ 380
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2007 VERLKAKVEEARR--LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ-QKEQELQQTLQQEQSVLERLRSEAEAAR 2083
Cdd:TIGR02168 381 LETLRSKVAQLELqiASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEeAELKELQAELEELEEELEELQEELERLE 460
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2084 RAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQaeqaaLRQKQAADAEM 2163
Cdd:TIGR02168 461 EALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGV-----LSELISVDEGY 535
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2164 EKHKQFAEQALRQKAQVEQELTALRLQleetDHQKsilDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARI 2243
Cdd:TIGR02168 536 EAAIEAALGGRLQAVVVENLNAAKKAI----AFLK---QNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLV 608
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2244 EAENRA-----------LVLRDKDSAQRLLQEEAEKMKQVAEEAARLS---VAAQEAARLRQLAeedLAQQRALAEkmLK 2309
Cdd:TIGR02168 609 KFDPKLrkalsyllggvLVVDDLDNALELAKKLRPGYRIVTLDGDLVRpggVITGGSAKTNSSI---LERRREIEE--LE 683
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2310 EKMQAVQEAtrlkaEAELLQQQKELAQEQarrlqedkeqmaQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMS 2389
Cdd:TIGR02168 684 EKIEELEEK-----IAELEKALAELRKEL------------EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLE 746
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2390 RAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKS 2469
Cdd:TIGR02168 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERL 826
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2470 EEMQTvRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAASME 2549
Cdd:TIGR02168 827 ESLER-RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEEL-ESELEALLNERASLEEALALLRSELEELSEELR 904
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 2550 EARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEE 2600
Cdd:TIGR02168 905 ELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEE 955
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1012-1576 |
6.31e-22 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 105.40 E-value: 6.31e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1012 ARECAQRITEQQKAQAEVDGLGKGVARLSAEAEKvlalpepspaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVI 1091
Cdd:COG1196 252 EAELEELEAELAELEAELEELRLELEELELELEE-------------AQAEEYELLAELARLEQDIARLEERRRELEERL 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1092 RSTQEAEEVLRAHEEQLKEAQAvpATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVE 1171
Cdd:COG1196 319 EELEEELAELEEELEELEEELE--ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAA 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1172 RWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL 1251
Cdd:COG1196 397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLE 476
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1252 EDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1331
Cdd:COG1196 477 AALAE-----------LLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA------ 539
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1332 fisetlrrMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA----HAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1407
Cdd:COG1196 540 --------LEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRAtflpLDKIRARAALAAALARGAIGAAVDLVASDLREA 611
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1487
Cdd:COG1196 612 DARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEE 691
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1488 AQEEAERLRRQvqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA-LETA 1566
Cdd:COG1196 692 ELELEEALLAE---EEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEeLERE 768
|
570
....*....|
gi 1920237962 1567 QRSAEAELQS 1576
Cdd:COG1196 769 LERLEREIEA 778
|
|
| CH_CLMN_rpt2 |
cd21245 |
second calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called ... |
171-275 |
7.21e-22 |
|
second calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Calmin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409094 Cd Length: 106 Bit Score: 93.32 E-value: 7.21e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGcQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 250
Cdd:cd21245 3 KAIKALLNWVQRRTRK-YGVAVQDFGSSWRSGLAFLALIKAIDPSLVDMRQALEKSPRENLEDAFRIAQESLGIPPLLEP 81
|
90 100
....*....|....*....|....*
gi 1920237962 251 EDVDVPQPDEKSIITYVSSLYDAMP 275
Cdd:cd21245 82 EDVMVDSPDEQSIMTYVAQFLEHFP 106
|
|
| CH_FLNA_rpt1 |
cd21308 |
first calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A ... |
42-159 |
4.19e-21 |
|
first calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A (FLN-A) is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also anchors various transmembrane proteins to the actin cytoskeleton and serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-A contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409157 Cd Length: 129 Bit Score: 92.07 E-value: 4.19e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 42 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERDvirssrlprEKGRMRFHKLQNVQIALDY 121
Cdd:cd21308 19 KIQQNTFTRWCNEHLKCVSKRIANLQTDLSDGLRLIALLEVLSQKKMHRKHN---------QRPTFRQMQLENVSVALEF 89
|
90 100 110
....*....|....*....|....*....|....*...
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 159
Cdd:cd21308 90 LDRESIKLVSIDSKAIVDGNLKLILGLIWTLILHYSIS 127
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1481-2615 |
4.94e-21 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 102.60 E-value: 4.94e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1481 AEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE--AEAAREKQRALQALEELRLQAE-EAERRLRQAEAERAR 1557
Cdd:NF041483 81 AQIQADQLRADAERELRDARAQTQRILQEHAEHQARLQAElhTEAVQRRQQLDQELAERRQTVEsHVNENVAWAEQLRAR 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1558 QVQVA---LETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVvqlREEATRRAQQQAEAERARAEAERELERWQLKA 1634
Cdd:NF041483 161 TESQArrlLDESRAEAEQALAAARAEAERLAEEARQRLGSEAESA---RAEAEAILRRARKDAERLLNAASTQAQEATDH 237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1635 NEALRLRLQAE-EVAQQKSltqaeaekqkeeaereaRRRGKAEEQAVRQrelAEQELEKQRQLAEGTAQQrlAAEQELIR 1713
Cdd:NF041483 238 AEQLRSSTAAEsDQARRQA-----------------AELSRAAEQRMQE---AEEALREARAEAEKVVAE--AKEAAAKQ 295
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1714 LRAETEQGEQQRQLLEEELARLQREAAA-ATQKRRELEAELAKVRAEMEVLLAS---KARAEEESRSTSEKSKQRLEAEA 1789
Cdd:NF041483 296 LASAESANEQRTRTAKEEIARLVGEATKeAEALKAEAEQALADARAEAEKLVAEaaeKARTVAAEDTAAQLAKAARTAEE 375
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1790 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAI---------------SEATRLKTEAEIALKE 1854
Cdd:NF041483 376 VLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQLKGAAkddtkeyraktvelqEEARRLRGEAEQLRAE 455
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1855 KEAENERLRRLAEDEAFQR-----RLLEEQAAQHKADIEarlaQLRKASESELERQKG-LVEDTLRQRRQVEEeilALKG 1928
Cdd:NF041483 456 AVAEGERIRGEARREAVQQieeaaRTAEELLTKAKADAD----ELRSTATAESERVRTeAIERATTLRRQAEE---TLER 528
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1929 SFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQ------LAAEEERRRREAEERVQKSLAAEEEAARQRKA 2002
Cdd:NF041483 529 TRAEAERLRAEAEEQAEEVRAAAERAARELREETERAIAARQaeaaeeLTRLHTEAEERLTAAEEALADARAEAERIRRE 608
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2003 ALEEVERLKAKV-EEARRLRERAEQESARqlqLAQEAAQKRLQAEEKAHAFAVqqkeqelqqtlqqeqsvleRLRSEAEa 2081
Cdd:NF041483 609 AAEETERLRTEAaERIRTLQAQAEQEAER---LRTEAAADASAARAEGENVAV-------------------RLRSEAA- 665
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2082 arraaeeaeaareraereaaqsrrqvEEAERLKqsaeeqaqaqAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaada 2161
Cdd:NF041483 666 --------------------------AEAERLK----------SEAQESADRVRAEAAAAAERVGTEAAEALAAAQ---- 705
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2162 emekhkqfaEQALRQKAQVEQELTALRLQLEEtdhqksildeelqrlkaevtEAARQRGQVEEELFSLRVQMEELGKLKA 2241
Cdd:NF041483 706 ---------EEAARRRREAEETLGSARAEADQ--------------------ERERAREQSEELLASARKRVEEAQAEAQ 756
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2242 RI--EAENRA--LVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA-RLRQLAEEDLAQQRALAekmLKEKMQAVQ 2316
Cdd:NF041483 757 RLveEADRRAteLVSAAEQTAQQVRDSVAGLQEQAEEEIAGLRSAAEHAAeRTRTEAQEEADRVRSDA---YAERERASE 833
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2317 EATRLKAEAellQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLemsAEAERlrlrvaEMSRAQARAE 2396
Cdd:NF041483 834 DANRLRREA---QEETEAAKALAERTVSEAIAEAERLRSDASEYAQRVRTEASDTL---ASAEQ------DAARTRADAR 901
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2397 EDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREaiaelehEKDKLKQEAQLLQLKSEEMQTVR 2476
Cdd:NF041483 902 EDANRIRSDAAAQADRLIGEATSEAERLTAEARAEAERLRDEARAEAERV-------RADAAAQAEQLIAEATGEAERLR 974
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2477 QEQLLQETQALQQSflsekdsllqrerciEQEKAKLEQLFQDEVAKAQALReeqqrqqqqmqqekqqlAASMEEARRRQH 2556
Cdd:NF041483 975 AEAAETVGSAQQHA---------------ERIRTEAERVKAEAAAEAERLR-----------------TEAREEADRTLD 1022
|
1130 1140 1150 1160 1170
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 2557 EAeegvrrQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPSR 2615
Cdd:NF041483 1023 EA------RKDANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTM 1075
|
|
| CH_FLNB_rpt1 |
cd21309 |
first calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B ... |
42-159 |
5.85e-21 |
|
first calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B (FLN-B) is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton. It may promote orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It anchors various transmembrane proteins to the actin cytoskeleton. FLN-B contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409158 Cd Length: 131 Bit Score: 91.68 E-value: 5.85e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 42 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERdvirssrlpREKGRMRFHKLQNVQIALDY 121
Cdd:cd21309 16 KIQQNTFTRWCNEHLKCVNKRIGNLQTDLSDGLRLIALLEVLSQKRMYRKY---------HQRPTFRQMQLENVSVALEF 86
|
90 100 110
....*....|....*....|....*....|....*...
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 159
Cdd:cd21309 87 LDRESIKLVSIDSKAIVDGNLKLILGLVWTLILHYSIS 124
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1093-1897 |
1.27e-20 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 101.29 E-value: 1.27e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1093 STQEAEEVLRAHEEQLKEAQAvpatlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVER 1172
Cdd:TIGR02168 247 ELKEAEEELEELTAELQELEE------KLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1173 WRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLE 1252
Cdd:TIGR02168 321 LEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNN 400
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1253 DIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQyirf 1332
Cdd:TIGR02168 401 EIER-----------LEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEE---- 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1333 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQE-EVARREEVAVEAQ--- 1408
Cdd:TIGR02168 466 LREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELiSVDEGYEAAIEAAlgg 545
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1409 -------EQKRSIQEELQHLRQSSE------AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALR 1475
Cdd:TIGR02168 546 rlqavvvENLNAAKKAIAFLKQNELgrvtflPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGV 625
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1476 ARAEEAEAQKRQAQEEAERLR-------------------RQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEE 1536
Cdd:TIGR02168 626 LVVDDLDNALELAKKLRPGYRivtldgdlvrpggvitggsAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK 705
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1537 LRLQAEEAERRLRQAEAERARQVqvaleTAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQqqaea 1616
Cdd:TIGR02168 706 ELEELEEELEQLRKELEELSRQI-----SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEE----- 775
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1617 eraraeaerelerwQLKANEALRLRLQAeEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELaEQELEKQRQL 1696
Cdd:TIGR02168 776 --------------ELAEAEAEIEELEA-QIEQLKEELKALREALDELRAELTLLNEEAANLRERLESL-ERRIAATERR 839
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1697 AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAE---EE 1773
Cdd:TIGR02168 840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRrelEE 919
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1774 SRSTSEKSKQRLEAEAGRFRELAEE-AARLRALAEEAKRQRQLAEEDAVRQRAEAERvLAEKLAAISEATRLkteaeiAL 1852
Cdd:TIGR02168 920 LREKLAQLELRLEGLEVRIDNLQERlSEEYSLTLEEAEALENKIEDDEEEARRRLKR-LENKIKELGPVNLA------AI 992
|
810 820 830 840
....*....|....*....|....*....|....*....|....*
gi 1920237962 1853 KEKEAENERLRRLaedeafqrrlleeqaAQHKADIEARLAQLRKA 1897
Cdd:TIGR02168 993 EEYEELKERYDFL---------------TAQKEDLTEAKETLEEA 1022
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1994-2613 |
2.90e-20 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 99.63 E-value: 2.90e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1994 EEAARQRKAALEEVERLKAKVEEARR----LRERAEQ-ESARQLQlaqeAAQKRLQAEEKAHAFAVQQKEqelqqtlqqe 2068
Cdd:COG1196 175 EEAERKLEATEENLERLEDILGELERqlepLERQAEKaERYRELK----EELKELEAELLLLKLRELEAE---------- 240
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2069 qsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQsaeeqaqaqaqaqAAAEKLRKEAEQEAARRAQA 2148
Cdd:COG1196 241 ---LEELEAELEELEAELEELEAELAELEAELEELRLELEELELELE-------------EAQAEEYELLAELARLEQDI 304
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2149 EQAALRQKQAADAEMEKHKQfAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2228
Cdd:COG1196 305 ARLEERRRELEERLEELEEE-LAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2229 LRVQMEELgklkARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKML 2308
Cdd:COG1196 384 LAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEE 459
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2309 KEKmQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAER---LRLRV 2385
Cdd:COG1196 460 ALL-ELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIgveAAYEA 538
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2386 AEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLL 2465
Cdd:COG1196 539 ALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVL 618
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2466 Q--LKSEEMQTVRQEQLLQETQALQQSFLS---EKDSLLQRERCIEQEKAKLEQlfqdEVAKAQALREEQQRQQQQMQQE 2540
Cdd:COG1196 619 GdtLLGRTLVAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLA----ALLEAEAELEELAERLAEEELE 694
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 2541 KQQLAASMEEARRRQHEAEEGVRRQQEELQRLAqqqqqqEKLLAEENQRLRERLQHLEEERRAALARSEEIAP 2613
Cdd:COG1196 695 LEEALLAEEEEERELAEAEEERLEEELEEEALE------EQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1095-1864 |
7.82e-20 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 98.59 E-value: 7.82e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1095 QEAEEVLRAHEEQLKEAQAVpatlpeLEATKAALKKLRAQAEAQQPvFDALRDELRgaqevgerlqqrHGERDVEVERWR 1174
Cdd:TIGR02168 175 KETERKLERTRENLDRLEDI------LNELERQLKSLERQAEKAER-YKELKAELR------------ELELALLVLRLE 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1175 ErvtlLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVRE---QLRQEKALL 1251
Cdd:TIGR02168 236 E----LREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRleqQKQILRERL 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1252 EDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPV-----ASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLT 1326
Cdd:TIGR02168 312 ANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEElesleAELEELEAELEELESRLEELEEQLETLRSKVAQL 391
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1327 SQYIRFISETLRRMEEE-ERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAV 1405
Cdd:TIGR02168 392 ELQIASLNNEIERLEARlERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELE 471
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1406 EAQEQKRSIQEELQHLR------QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAE----GELQALR 1475
Cdd:TIGR02168 472 EAEQALDAAERELAQLQarldslERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEaalgGRLQAVV 551
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1476 ARAEEAEAQKRQAQEEAERLRR----------QVQDETQRKRQAEAELALRVQAEAEAAREK-QRALQALEELRLQAEEA 1544
Cdd:TIGR02168 552 VENLNAAKKAIAFLKQNELGRVtflpldsikgTEIQGNDREILKNIEGFLGVAKDLVKFDPKlRKALSYLLGGVLVVDDL 631
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1545 ERRLRQAEAERARQVQVALE---------TAQRSAEAEL-----QSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRA 1610
Cdd:TIGR02168 632 DNALELAKKLRPGYRIVTLDgdlvrpggvITGGSAKTNSsilerRREIEELEEKIEELEEKIAELEKALAELRKELEELE 711
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1611 QQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQEL 1690
Cdd:TIGR02168 712 EELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1691 EKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARA 1770
Cdd:TIGR02168 792 EQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEEL 871
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1771 EEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE------DAVRQRAEAERVLAEKLAAISEATRL 1844
Cdd:TIGR02168 872 ESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEElreklaQLELRLEGLEVRIDNLQERLSEEYSL 951
|
810 820
....*....|....*....|.
gi 1920237962 1845 KTEAEIALKEK-EAENERLRR 1864
Cdd:TIGR02168 952 TLEEAEALENKiEDDEEEARR 972
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1341-2077 |
1.40e-19 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 97.97 E-value: 1.40e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1341 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEahaQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQEELQh 1420
Cdd:NF041483 560 EETERAIAARQAEAAEELTRLHTEAEERLTAAE---EALADARAEAERIRREAAEETER---LRTEAAERIRTLQAQAE- 632
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1421 lrqsSEAEiqaKARQVEAAERSRLRIEEEIRVVRLQLEA-TERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQV 1499
Cdd:NF041483 633 ----QEAE---RLRTEAAADASAARAEGENVAVRLRSEAaAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQ 705
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1500 QDETQRKRQAEAELAlrvQAEAEAAREKQRALQALEELrlqAEEAERRLRQAEAERARQVqvalETAQRSAeaelqSEHA 1579
Cdd:NF041483 706 EEAARRRREAEETLG---SARAEADQERERAREQSEEL---LASARKRVEEAQAEAQRLV----EEADRRA-----TELV 770
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1580 SFAEKTAQLERTlkeehvAVVQLREEATRraqqqaeaeraraeaerelerwqlkanEALRLRLQAEEVAQQksLTQAEAE 1659
Cdd:NF041483 771 SAAEQTAQQVRD------SVAGLQEQAEE---------------------------EIAGLRSAAEHAAER--TRTEAQE 815
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1660 KQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQ--------------- 1724
Cdd:NF041483 816 EADRVRSDAYAERERASEDANRLRREAQEETEAAKALAERTVSE---AIAEAERLRSDASEYAQRvrteasdtlasaeqd 892
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1725 ----RQLLEEELARLQREAAA-----ATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrel 1795
Cdd:NF041483 893 aartRADAREDANRIRSDAAAqadrlIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEA------ 966
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1796 AEEAARLRALAEEAKRQrqlAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEialkekeAENERLRRLAEDEAFQRR- 1874
Cdd:NF041483 967 TGEAERLRAEAAETVGS---AQQHAERIRTEAERVKAE---AAAEAERLRTEAR-------EEADRTLDEARKDANKRRs 1033
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1875 ---------LLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELG 1945
Cdd:NF041483 1034 eaaeqadtlITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVGAARKEAERIVAEATVEGNSLVEKARTDADELLVGAR 1113
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1946 R----IRGTAEDtLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLK--------AK 2013
Cdd:NF041483 1114 RdataIRERAEE-LRDRITGEIEELHERARRESAEQMKSAGERCDALVKAAEEQLAEAEAKAKELVSDANseaskvriAA 1192
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2014 VEEARRLRERAEQESARQLQLAQ------EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRS 2077
Cdd:NF041483 1193 VKKAEGLLKEAEQKKAELVREAEkikaeaEAEAKRTVEEGKRELDVLVRRREDINAEISRVQDVLEALES 1262
|
|
| CH_MICAL2_3-like |
cd21195 |
calponin homology (CH) domain found in molecule interacting with CasL protein 2 (MICAL-2), ... |
175-272 |
1.91e-19 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 2 (MICAL-2), MICAL-3, and similar proteins; Molecule interacting with CasL protein (MICAL) is a large, multidomain, cytosolic protein with a single LIM domain, a calponin homology (CH) domain and a flavoprotein monooxygenase (MO) domain. In Drosophila, MICAL is expressed in axons, interacts with the neuronal A (PlexA) receptor and is required for Semaphorin 1a (Sema-1a)-PlexA-mediated repulsive axon guidance. The LIM and CH domains mediate interactions with the cytoskeleton, cytoskeletal adaptor proteins, and other signaling proteins. The flavoprotein MO is required for semaphorin-plexin repulsive axon guidance during axonal pathfinding in the Drosophila neuromuscular system. In addition, MICAL functions to interact with Rab13 and Rab8 to coordinate the assembly of tight junctions and adherens junctions in epithelial cells. Thus, MICAL is also called junctional Rab13-binding protein (JRAB). Members of this family, which includes MICAL-2, MICAL-3, and similar proteins, contain one CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409044 [Multi-domain] Cd Length: 110 Bit Score: 86.63 E-value: 1.91e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 175 KLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD-PEDV 253
Cdd:cd21195 8 KLLTWCQQQTEGYQHVNVTDLTTSWRSGLALCAIIHRFRPELINFDSLNEDDAVENNQLAFDVAEREFGIPPVTTgKEMA 87
|
90
....*....|....*....
gi 1920237962 254 DVPQPDEKSIITYVSSLYD 272
Cdd:cd21195 88 SAQEPDKLSMVMYLSKFYE 106
|
|
| CH_EHBP1L1 |
cd21255 |
calponin homology (CH) domain found in EH domain-binding protein 1-like protein 1 and similar ... |
171-270 |
1.40e-18 |
|
calponin homology (CH) domain found in EH domain-binding protein 1-like protein 1 and similar proteins; EHBP1L1 may act as Rab effector protein and play a role in vesicle trafficking. It coordinates Rab8 and Bin1 to regulate apical-directed transport in polarized epithelial cells. Members of this subfamily contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409104 Cd Length: 105 Bit Score: 83.68 E-value: 1.40e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 250
Cdd:cd21255 1 SSSQSLLEWCQEVTAGYRGVRVTNFTTSWRNGLAFCAILHHFHPDLVDYESLDPLDIKENNKKAFEAFAS-LGVPRLLEP 79
|
90 100
....*....|....*....|.
gi 1920237962 251 ED-VDVPQPDEKSIITYVSSL 270
Cdd:cd21255 80 ADmVLLPIPDKLIVMTYLCQL 100
|
|
| SH3_10 |
pfam17902 |
SH3 domain; This entry represents an SH3 domain. |
789-855 |
3.40e-18 |
|
SH3 domain; This entry represents an SH3 domain.
Pssm-ID: 407754 Cd Length: 65 Bit Score: 81.15 E-value: 3.40e-18
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 789 QLKPRSpaHPMRGRVPLLAVCDYKQVEVTVHKGDECQMVGPAQPFYWKVLGSSCSEAAMPSVCFLVP 855
Cdd:pfam17902 1 PLKQRR--SPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
|
|
| CH_SMTN-like |
cd21200 |
calponin homology (CH) domain found in the smoothelin family; The smoothelin family includes ... |
171-271 |
4.08e-18 |
|
calponin homology (CH) domain found in the smoothelin family; The smoothelin family includes smoothelin and smoothelin-like proteins. Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. SMTNL1, also called calponin homology-associated smooth muscle protein (CHASM), plays a role in the regulation of contractile properties of both striated and smooth muscles. It can bind to calmodulin and tropomyosin. When it is unphosphorylated, SMTNL1 may inhibit myosin dephosphorylation. SMTNL2 is highly expressed in skeletal muscle and could be associated with differentiating myocytes. Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409049 Cd Length: 107 Bit Score: 82.78 E-value: 4.08e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 250
Cdd:cd21200 1 SIKQMLLEWCQAKTRGYEHVDITNFSSSWSDGMAFCALIHHFFPDAFDYSSLDPKNRRKNFELAFSTAEELADIAPLLEV 80
|
90 100
....*....|....*....|...
gi 1920237962 251 EDVDV--PQPDEKSIITYVSSLY 271
Cdd:cd21200 81 EDMVRmgNRPDWKCVFTYVQSLY 103
|
|
| CH_MICAL3 |
cd21251 |
calponin homology (CH) domain found in molecule interacting with CasL protein 3; MICAL-3 is a ... |
167-272 |
4.15e-18 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 3; MICAL-3 is a [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-3 seems to act as a Rab effector protein and plays a role in vesicle trafficking. It is involved in exocytic vesicle tethering and fusion. MICAL3 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409100 [Multi-domain] Cd Length: 111 Bit Score: 82.69 E-value: 4.15e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 167 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 246
Cdd:cd21251 1 NESVARSSKLLGWCQRQTEGYAGVNVTDLTMSWKSGLALCAIIHRYRPDLIDFDSLDEQDVEKNNQLAFDIAEKEFGISP 80
|
90 100
....*....|....*....|....*..
gi 1920237962 247 LLDPEDV-DVPQPDEKSIITYVSSLYD 272
Cdd:cd21251 81 IMTGKEMaSVGEPDKLSMVMYLTQFYE 107
|
|
| CH |
smart00033 |
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ... |
174-270 |
4.62e-18 |
|
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
Pssm-ID: 214479 [Multi-domain] Cd Length: 101 Bit Score: 82.36 E-value: 4.62e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 174 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN----LENLDQAFSVAERDLGVTRLLD 249
Cdd:smart00033 1 KTLLRWVNSLLAEYDKPPVTNFSSDLKDGVALCALLNSLSPGLVDKKKVAASLSrfkkIENINLALSFAEKLGGKVVLFE 80
|
90 100
....*....|....*....|.
gi 1920237962 250 PEDVDVPQPDEKSIITYVSSL 270
Cdd:smart00033 81 PEDLVEGPKLILGVIWTLISL 101
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1335-1923 |
5.37e-18 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 92.29 E-value: 5.37e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQR------AEERERLAEVEAALEKQRQLAEAHAQAKAQAERE-AQGLQRRMQEEVARREEVAVEA 1407
Cdd:COG4913 235 DDLERAHEALEDAREQIellepiRELAERYAAARERLAELEYLRAALRLWFAQRRLElLEAELEELRAELARLEAELERL 314
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRSIQEELQHLRQ-----------SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATE-----------RQRG 1465
Cdd:COG4913 315 EARLDALREELDELEAqirgnggdrleQLEREIERLERELEERERRRARLEALLAALGLPLPASAeefaalraeaaALLE 394
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1466 GAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELrLQAEEAE 1545
Cdd:COG4913 395 ALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALAEALGLDEAELPFVGEL-IEVRPEE 473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1546 RRLRQAeAERarqvqvALETAQRSaeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAER 1625
Cdd:COG4913 474 ERWRGA-IER------VLGGFALT----LLVPPEHYAAALRWVNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDF 542
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1626 ELERWQLKANEALRLR---LQAEEVAQ----QKSLTQAEAEKQKEEAEREARRRGKAEE-----QAVRQRELAEQELEK- 1692
Cdd:COG4913 543 KPHPFRAWLEAELGRRfdyVCVDSPEElrrhPRAITRAGQVKGNGTRHEKDDRRRIRSRyvlgfDNRAKLAALEAELAEl 622
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1693 QRQLAEGTAQ-QRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA---ELAKVRAEMEVLLASKA 1768
Cdd:COG4913 623 EEELAEAEERlEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERLDAssdDLAALEEQLEELEAELE 702
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1769 RAEEESRSTSEKsKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRlKTEA 1848
Cdd:COG4913 703 ELEEELDELKGE-IGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERID-ALRA 780
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1849 EIALKEKEAENERLRRLAEDEAFQ-------------RRLLEEQAAQHKADIEARLAQLRKasESELERQKGLVEDTLRQ 1915
Cdd:COG4913 781 RLNRAEEELERAMRAFNREWPAETadldadleslpeyLALLDRLEEDGLPEYEERFKELLN--ENSIEFVADLLSKLRRA 858
|
....*...
gi 1920237962 1916 RRQVEEEI 1923
Cdd:COG4913 859 IREIKERI 866
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1333-1904 |
2.26e-17 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 90.10 E-value: 2.26e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1333 ISETLRRMEEEERLAEQQR----------AEERERLAEVEAALEKQRQlaeahaqAKAQAEREAQGLQRRMQEEVARREE 1402
Cdd:PRK02224 218 LDEEIERYEEQREQARETRdeadevleehEERREELETLEAEIEDLRE-------TIAETEREREELAEEVRDLRERLEE 290
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1403 vaveaqeqkrsIQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAE 1479
Cdd:PRK02224 291 -----------LEEERDDLLAEAGlddADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAE 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1480 EAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQRalQALEELRLQAEEAERRLRQAEAErarqv 1559
Cdd:PRK02224 360 ELREEAAELESELEEAREAV--EDRREEIEELEEEIEELRERFGDAPVDL--GNAEDFLEELREERDELREREAE----- 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1560 qvaLETAQRSAEAELQSEHASFAE-KTAQLERTLKE-EHVAVVQLREEatrraqqqaeaeraraeaerelerwQLKANEA 1637
Cdd:PRK02224 431 ---LEATLRTARERVEEAEALLEAgKCPECGQPVEGsPHVETIEEDRE-------------------------RVEELEA 482
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1638 LRLRLQAEEVAQQKSLTQAEAEKqkeeaerearrrgKAEEQAVR---QRELAEQELEKQRQLAEGTAQQRLAAEQELIRL 1714
Cdd:PRK02224 483 ELEDLEEEVEEVEERLERAEDLV-------------EAEDRIERleeRREDLEELIAERRETIEEKRERAEELRERAAEL 549
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1715 RAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAeMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRE 1794
Cdd:PRK02224 550 EAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLER-IRTLLAAIADAEDEIERLREKREALAELNDERRER 628
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1795 LAEEAARLRALAEEAKRQRqLAEEDAVRQRAEA--ERVlAEKLAAISEAtRLKTEAEIALKEKEAEN-ERLR-RLAEDEA 1870
Cdd:PRK02224 629 LAEKRERKRELEAEFDEAR-IEEAREDKERAEEylEQV-EEKLDELREE-RDDLQAEIGAVENELEElEELReRREALEN 705
|
570 580 590
....*....|....*....|....*....|....*.
gi 1920237962 1871 FQRRL--LEEQAAQHKADIEARLAQLRKASESELER 1904
Cdd:PRK02224 706 RVEALeaLYDEAEELESMYGDLRAELRQRNVETLER 741
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1393-2298 |
4.94e-17 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 89.26 E-value: 4.94e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1393 MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERsrlriEEEIRVVRLQLEATERQRGGAEGELQ 1472
Cdd:pfam02463 158 IEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQA-----KKALEYYQLKEKLELEEEYLLYLDYL 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1473 ALRARAEEAEAQKRQAQEEAERLRRQVQDetqrKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE 1552
Cdd:pfam02463 233 KLNEERIDLLQELLRDEQEEIESSKQEIE----KEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1553 AERARQVQVALETAQRSAEAELQSEHASFAEKtaqleRTLKEEHVAVVQLREEatrraqqqaeaeraraeaerelerwql 1632
Cdd:pfam02463 309 KVDDEEKLKESEKEKKKAEKELKKEKEEIEEL-----EKELKELEIKREAEEE--------------------------- 356
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1633 kANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI 1712
Cdd:pfam02463 357 -EEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEE 435
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1713 RLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEA----- 1787
Cdd:pfam02463 436 EESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKvllal 515
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1788 -EAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLA 1866
Cdd:pfam02463 516 iKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIA 595
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1867 EDEAFQRRLLeeqAAQHKADIEARLAQ---------LRKASESELERQKGLVEDTLRQRRQVEEEILALKG---SFEKAA 1934
Cdd:pfam02463 596 VLEIDPILNL---AQLDKATLEADEDDkrakvvegiLKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEvkaSLSELT 672
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1935 AGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKV 2014
Cdd:pfam02463 673 KELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEE 752
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2015 EEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAhafAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARE 2094
Cdd:pfam02463 753 EKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLK---VEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK 829
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2095 RAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaadAEMEKHKQFAEQAL 2174
Cdd:pfam02463 830 IKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKE---EKEKEEKKELEEES 906
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2175 RQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEvteaarQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRD 2254
Cdd:pfam02463 907 QKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE------EADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAI 980
|
890 900 910 920
....*....|....*....|....*....|....*....|....
gi 1920237962 2255 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLA 2298
Cdd:pfam02463 981 EEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLE 1024
|
|
| CH_SF |
cd00014 |
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding ... |
45-154 |
5.12e-17 |
|
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding motifs, which may be present as a single copy or in tandem repeats (which increase binding affinity). They either function as autonomous actin binding motifs or serve a regulatory function. CH domains are found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, as well as proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
Pssm-ID: 409031 [Multi-domain] Cd Length: 103 Bit Score: 79.30 E-value: 5.12e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 45 KKTFTKWVNKHL-IKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRErdvirssrlpREKGRMRFHKLQNVQIALDYLR 123
Cdd:cd00014 1 EEELLKWINEVLgEELPVSITDLFESLRDGVLLCKLINKLSPGSIPKI----------NKKPKSPFKKRENINLFLNACK 70
|
90 100 110
....*....|....*....|....*....|...
gi 1920237962 124 HRQV-KLVNIRNDDI-ADGNPKLTLGLIWTIIL 154
Cdd:cd00014 71 KLGLpELDLFEPEDLyEKGNLKKVLGTLWALAL 103
|
|
| CH_NAV2-like |
cd21212 |
calponin homology (CH) domain found in neuron navigator (NAV) 2, NAV3, and similar proteins; ... |
44-156 |
5.44e-17 |
|
calponin homology (CH) domain found in neuron navigator (NAV) 2, NAV3, and similar proteins; This family includes neuron navigator 2 (NAV2) and NAV3, both of which contain a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs. NAV2, also called helicase APC down-regulated 1 (HELAD1), pore membrane and/or filament-interacting-like protein 2 (POMFIL2), retinoic acid inducible in neuroblastoma 1 (RAINB1), Steerin-2 (STEERIN2), or Unc-53 homolog 2 (unc53H2), possesses 3' to 5' helicase activity and exonuclease activity. It is involved in neuronal development, specifically in the development of different sensory organs. NAV3, also called pore membrane and/or filament-interacting-like protein 1 (POMFIL1), Steerin-3 (STEERIN3), or Unc-53 homolog 3 (unc53H3), may regulate IL2 production by T-cells. It may be involved in neuron regeneration.
Pssm-ID: 409061 Cd Length: 105 Bit Score: 79.16 E-value: 5.44e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 44 QKKTFTKWVNKHLIKA--QRHISDLYEDLRDGHNLISLLEVLSGDSLPRerdvirssrlPREKGRMRFHKLQNVQIALDY 121
Cdd:cd21212 1 EIEIYTDWANHYLEKGghKRIITDLQKDLGDGLTLVNLIEAVAGEKVPG----------IHSRPKTRAQKLENIQACLQF 70
|
90 100 110
....*....|....*....|....*....|....*
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 156
Cdd:cd21212 71 LAALGVDVQGITAEDIVDGNLKAILGLFFSLSRYK 105
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1340-2211 |
5.65e-17 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 89.26 E-value: 5.65e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1340 MEEEERLAEQQRAEERERLAEVEaaLEKQRQLAEAHAQAKAQAEREAQGLqrRMQEEVARREEVAVEAQEQKRSIQEELQ 1419
Cdd:pfam02463 179 IEETENLAELIIDLEELKLQELK--LKEQAKKALEYYQLKEKLELEEEYL--LYLDYLKLNEERIDLLQELLRDEQEEIE 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1420 HLRQSSEAEIQakarqvEAAERSRLRIEEE--IRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRR 1497
Cdd:pfam02463 255 SSKQEIEKEEE------KLAQVLKENKEEEkeKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEK 328
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1498 QVQDETQRKRQAEAELAL--RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERA--RQVQVALETAQRSAEAE 1573
Cdd:pfam02463 329 ELKKEKEEIEELEKELKEleIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAakLKEEELELKSEEEKEAQ 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1574 LQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSL 1653
Cdd:pfam02463 409 LLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLEL 488
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1654 TQAEAEKQKEEAEREARRRGKAEEQAVRQRE---LAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEE 1730
Cdd:pfam02463 489 LLSRQKLEERSQKESKARSGLKVLLALIKDGvggRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVR 568
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1731 ELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEaeagrfRELAEEAARLRALAEEAK 1810
Cdd:pfam02463 569 ALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKV------VEGILKDTELTKLKESAK 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1811 RQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEAR 1890
Cdd:pfam02463 643 AKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEEL 722
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1891 LAQLRKaseSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQ 1970
Cdd:pfam02463 723 LADRVQ---EAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQ 799
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1971 laaeeerrrreaeervQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAH 2050
Cdd:pfam02463 800 ----------------EEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 863
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2051 AFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAA 2130
Cdd:pfam02463 864 TKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE 943
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2131 AEKLRKEAEQEAARRAQAEQAALRQKQAadAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA 2210
Cdd:pfam02463 944 EADEKEKEENNKEEEEERNKRLLLAKEE--LGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKE 1021
|
.
gi 1920237962 2211 E 2211
Cdd:pfam02463 1022 F 1022
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
962-1871 |
5.69e-17 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 88.97 E-value: 5.69e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 962 QQLLQSLEQGEQEESRCQRCISELKDiRLQLEACETRTVHRLRLPLDKEPARECAQRITEQQKAQAEVDGLGKGVARLSA 1041
Cdd:TIGR02169 173 EKALEELEEVEENIERLDLIIDEKRQ-QLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEE 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1042 EAEKVLALpepspaaptlRSELELTLGKLEQVRSLSAIYLEKL---------KTISLVIRSTQEAEEVLRAHEEQLKEAQ 1112
Cdd:TIGR02169 252 ELEKLTEE----------ISELEKRLEEIEQLLEELNKKIKDLgeeeqlrvkEKIGELEAEIASLERSIAEKERELEDAE 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1113 AVPATL-PELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERvtlllerwqavlaqT 1191
Cdd:TIGR02169 322 ERLAKLeAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE--------------L 387
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1192 DVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAvplansqavreqlrqekalleDIERHGEKVEECQRFAKQY 1271
Cdd:TIGR02169 388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNA---------------------AIAGIEAKINELEEEKEDK 446
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1272 INAIKDYELQLVTYKAQLepvaspakkpkvqsgsESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQR 1351
Cdd:TIGR02169 447 ALEIKKQEWKLEQLAADL----------------SKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGR 510
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1352 AEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGlqrRMQEEVARREEVAVEAQEQKRS-------------IQEEL 1418
Cdd:TIGR02169 511 AVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGN---RLNNVVVEDDAVAKEAIELLKRrkagratflplnkMRDER 587
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1419 QHLRQSSEA--------------EIQAKARQ-------VEAAERSRlRIEEEIRVVRLQLEATERQ---RGGA-EGELQA 1473
Cdd:TIGR02169 588 RDLSILSEDgvigfavdlvefdpKYEPAFKYvfgdtlvVEDIEAAR-RLMGKYRMVTLEGELFEKSgamTGGSrAPRGGI 666
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1474 LRARAEEAEAQkrQAQEEAERLRRQVQDETQRKRQAEAELalrvqaeAEAAREKQRALQALEELRLQAEEAERRlRQAEA 1553
Cdd:TIGR02169 667 LFSRSEPAELQ--RLRERLEGLKRELSSLQSELRRIENRL-------DELSQELSDASRKIGEIEKEIEQLEQE-EEKLK 736
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1554 ERARQVQVALETAQRSAEAElQSEHASFAEKTAQLERTLKEEHVAVVQLREEatrraqqqaeaeraraeaeRELERWQLK 1633
Cdd:TIGR02169 737 ERLEELEEDLSSLEQEIENV-KSELKELEARIEELEEDLHKLEEALNDLEAR-------------------LSHSRIPEI 796
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1634 ANEalrLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRlAAEQELIR 1713
Cdd:TIGR02169 797 QAE---LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE-ELEEELEE 872
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1714 LRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRStsekskqrLEAEAGRFR 1793
Cdd:TIGR02169 873 LEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSE--------IEDPKGEDE 944
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1794 ELAEEAARLRALAEEAKR-QRQLAEEDAVRQRA--EAERVLAEKLAAISEATRLKTEAEiALKEKEAENERLRRLAEDEA 1870
Cdd:TIGR02169 945 EIPEEELSLEDVQAELQRvEEEIRALEPVNMLAiqEYEEVLKRLDELKEKRAKLEEERK-AILERIEEYEKKKREVFMEA 1023
|
.
gi 1920237962 1871 F 1871
Cdd:TIGR02169 1024 F 1024
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1311-1964 |
8.98e-17 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 88.59 E-value: 8.98e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1311 EYVDLRTR-----YSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKqrqLAEAHAQAKAQAERE 1385
Cdd:TIGR02169 212 RYQALLKEkreyeGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQL---LEELNKKIKDLGEEE 288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1386 AQGLQRRMQEEVARREEvAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAaersrlrIEEEIRVVRLQLEATERQRG 1465
Cdd:TIGR02169 289 QLRVKEKIGELEAEIAS-LERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE-------LEREIEEERKRRDKLTEEYA 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1466 GAEGELQALRARAEEAEAQKR--------------QAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRAL 1531
Cdd:TIGR02169 361 ELKEELEDLRAELEEVDKEFAetrdelkdyrekleKLKREINELKRELDRLQEELQRLSEELA-DLNAAIAGIEAKINEL 439
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1532 QA-LEELRLQAEEAERRLRQAEAER--ARQVQVALETAQRSAEAELQS--EHASFAEKTAQLERTLKEEHVAVVQLREEa 1606
Cdd:TIGR02169 440 EEeKEDKALEIKKQEWKLEQLAADLskYEQELYDLKEEYDRVEKELSKlqRELAEAEAQARASEERVRGGRAVEEVLKA- 518
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1607 trraqQQAEAERARAEAERELERWQLKANEALRLRLQA-----EEVAQQK---------------SLTQAEAEKQKEEAE 1666
Cdd:TIGR02169 519 -----SIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNvvvedDAVAKEAiellkrrkagratflPLNKMRDERRDLSIL 593
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1667 REARRRGKA---------EEQAVR---QRELAEQELEKQRQL--------------------------AEGTAQQRLAAE 1708
Cdd:TIGR02169 594 SEDGVIGFAvdlvefdpkYEPAFKyvfGDTLVVEDIEAARRLmgkyrmvtlegelfeksgamtggsraPRGGILFSRSEP 673
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1709 QELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLL----ASKARAEEESRSTSEKSKQR 1784
Cdd:TIGR02169 674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEqeeeKLKERLEELEEDLSSLEQEI 753
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1785 LEAEAgrfrELAEEAARLRALaEEAKRQRQLAEED--------AVRQRAEAERVLAEKLAAISEATRlktEAEIALKEKE 1856
Cdd:TIGR02169 754 ENVKS----ELKELEARIEEL-EEDLHKLEEALNDlearlshsRIPEIQAELSKLEEEVSRIEARLR---EIEQKLNRLT 825
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1857 AENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKAsESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG 1936
Cdd:TIGR02169 826 LEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL-EEELEELEAALRDLESRLGDLKKERDELEAQLRELERK 904
|
730 740
....*....|....*....|....*...
gi 1920237962 1937 KAELELELGRIRGTAEDTLRSKEQAEQE 1964
Cdd:TIGR02169 905 IEELEAQIEKKRKRLSELKAKLEALEEE 932
|
|
| CH_EHBP1 |
cd21254 |
calponin homology (CH) domain found in EH domain-binding protein 1 and similar proteins; EHBP1 ... |
171-270 |
1.21e-16 |
|
calponin homology (CH) domain found in EH domain-binding protein 1 and similar proteins; EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP1 is associated with aggressive prostate cancer and insulin-stimulated trafficking and cell migration. Members of this subfamily contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409103 Cd Length: 107 Bit Score: 78.36 E-value: 1.21e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 250
Cdd:cd21254 1 NASQSLLAWCKEVTKGYRGVKITNFTTSWRNGLAFCAILHHFRPDLIDYKSLNPHDIKENNKKAYDGFAS-LGISRLLEP 79
|
90 100
....*....|....*....|.
gi 1920237962 251 ED-VDVPQPDEKSIITYVSSL 270
Cdd:cd21254 80 SDmVLLAVPDKLTVMTYLYQI 100
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1120-2046 |
4.69e-16 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 86.18 E-value: 4.69e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1120 ELEATKAALKKLRAQAEAQQPVFDALRDELRGaqevgerlqqrhgerdveverwrervtlllerwqavlaqtdvRQRELE 1199
Cdd:pfam02463 167 LKRKKKEALKKLIEETENLAELIIDLEELKLQ------------------------------------------ELKLKE 204
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1200 QLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL-EDIERHGEKVEECQRFAKQYINAIKDY 1278
Cdd:pfam02463 205 QAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSkQEIEKEEEKLAQVLKENKEEEKEKKLQ 284
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1279 ELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRtryselstltsqyIRFISETLRRMEEEERLAEQQRAEERERL 1358
Cdd:pfam02463 285 EEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKE-------------KKKAEKELKKEKEEIEELEKELKELEIKR 351
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1359 AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQsseaeiQAKARQVEA 1438
Cdd:pfam02463 352 EAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQ------LEDLLKEEK 425
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1439 AERSRLRIEEEIRVVRLQLEATERQrggaegelqaLRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ 1518
Cdd:pfam02463 426 KEELEILEEEEESIELKQGKLTEEK----------EELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKL 495
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1519 AEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ-LERTLKEEHV 1597
Cdd:pfam02463 496 EERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQkLVRALTELPL 575
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1598 AVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGK--A 1675
Cdd:pfam02463 576 GARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVslE 655
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1676 EEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK 1755
Cdd:pfam02463 656 EGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKIN 735
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1756 VRAEMEVLLASKARAEEESRSTSEKSKQRLEAEagrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKL 1835
Cdd:pfam02463 736 EELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSE----LSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELK 811
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1836 AAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDT-LR 1914
Cdd:pfam02463 812 EEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELeSK 891
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1915 QRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAArQRQLAAEEERRRREAEERVQKSLAAEE 1994
Cdd:pfam02463 892 EEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE-EADEKEKEENNKEEEEERNKRLLLAKE 970
|
890 900 910 920 930
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1995 EAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAE 2046
Cdd:pfam02463 971 ELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEF 1022
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1426-2352 |
4.73e-16 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 86.28 E-value: 4.73e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1426 EAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEgELQALRARAEEAEA-----QKRQAQEEAERLRRQVQ 1500
Cdd:TIGR02169 169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAE-RYQALLKEKREYEGyellkEKEALERQKEAIERQLA 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1501 DETQRKRQAEAELALRVQAEAEAAR------EKQRALQALEELRLQAEEAE-----RRLRQAEAERARQVQVALETaqrs 1569
Cdd:TIGR02169 248 SLEEELEKLTEEISELEKRLEEIEQlleelnKKIKDLGEEEQLRVKEKIGEleaeiASLERSIAEKERELEDAEER---- 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1570 aEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQaeaeraraeaerelerwqlkanEALRLRLQAEEVAQ 1649
Cdd:TIGR02169 324 -LAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL----------------------EDLRAELEEVDKEF 380
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1650 QKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQR-ELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1728
Cdd:TIGR02169 381 AETRDELKDYREKLEKLKREINELKRELDRLQEElQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQL 460
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1729 EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE---SRSTSEKSKQRLEAEAGRFRELAEeaarlral 1805
Cdd:TIGR02169 461 AADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERvrgGRAVEEVLKASIQGVHGTVAQLGS-------- 532
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1806 aeeAKRQRQLAEEDAVRQRAEAERVLAEKLAAiseatrlktEAEIALKEKEAENER---LRRLAEDEAFQRRLLEEQAAQ 1882
Cdd:TIGR02169 533 ---VGERYATAIEVAAGNRLNNVVVEDDAVAK---------EAIELLKRRKAGRATflpLNKMRDERRDLSILSEDGVIG 600
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1883 HKADIEARLAQLRKASESELeRQKGLVEDTLRQRRQVEE-EILALKGS-FEKAAA--GKAElelelgRIRGTAEDTLRSK 1958
Cdd:TIGR02169 601 FAVDLVEFDPKYEPAFKYVF-GDTLVVEDIEAARRLMGKyRMVTLEGElFEKSGAmtGGSR------APRGGILFSRSEP 673
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1959 EQAEQEAARqrqlaaeeerrrreaeervqkslaaEEEAARQRKAALEEVERLKAKVEEARRLRERAEQ-----ESARQLQ 2033
Cdd:TIGR02169 674 AELQRLRER-------------------------LEGLKRELSSLQSELRRIENRLDELSQELSDASRkigeiEKEIEQL 728
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2034 LAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRseaeaarraAEEAEAARERAEREAAQSRRQVEEAERL 2113
Cdd:TIGR02169 729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELE---------EDLHKLEEALNDLEARLSHSRIPEIQAE 799
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2114 KQSAEeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEE 2193
Cdd:TIGR02169 800 LSKLE--------------EEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE 865
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2194 tdhqksiLDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRlLQEEAEKMKQVA 2273
Cdd:TIGR02169 866 -------LEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAK-LEALEEELSEIE 937
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 2274 EEAARLSVAAQEAARLRQLAEEDLAQQRALaEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQ 2352
Cdd:TIGR02169 938 DPKGEDEEIPEEELSLEDVQAELQRVEEEI-RALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1138-2048 |
8.10e-16 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 85.23 E-value: 8.10e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1138 QQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQA----------VLAQTDVRQRELEQLGRQLRY 1207
Cdd:pfam01576 3 QEEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQAetelcaeaeeMRARLAARKQELEEILHELES 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1208 YRESADPLGAWLRDAKQR-QEQIQAVP--LANSQAVREQLRQEKALLE-DIERHGEKVEECQRFAKQYINAIKDYELQLV 1283
Cdd:pfam01576 83 RLEEEEERSQQLQNEKKKmQQHIQDLEeqLDEEEAARQKLQLEKVTTEaKIKKLEEDILLLEDQNSKLSKERKLLEERIS 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1284 TYKAQLEPVASPAKK-PKVQSGSESIIQEyVDLRTRYSElstltsqyirfisETLRRMEEEERLAEQQRAEERERLAEVE 1362
Cdd:pfam01576 163 EFTSNLAEEEEKAKSlSKLKNKHEAMISD-LEERLKKEE-------------KGRQELEKAKRKLEGESTDLQEQIAELQ 228
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1363 AalekqrQLAEAHAQAkAQAEREAQGLQRRMQEEVARREevavEAQEQKRSIQEELQHLRQSSEAEIQAKARqveaAERS 1442
Cdd:pfam01576 229 A------QIAELRAQL-AKKEEELQAALARLEEETAQKN----NALKKIRELEAQISELQEDLESERAARNK----AEKQ 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1443 RLRIEEEIRVVRLQLEATErqrgGAEGELQALRARAE-EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELalrvqaeA 1521
Cdd:pfam01576 294 RRDLGEELEALKTELEDTL----DTTAAQQELRSKREqEVTELKKALEEETRSHEAQLQEMRQKHTQALEEL-------T 362
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1522 EAAREKQRALQALEELRlQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEktAQLERTLKEEHVAVVQ 1601
Cdd:pfam01576 363 EQLEQAKRNKANLEKAK-QALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSE--SERQRAELAEKLSKLQ 439
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1602 LREEATRRAQQQAEAERARAEAERELERWQLKANEALRlrlqAEEVAQQKSLTQAEAEKqkeeaerearrrgkaEEQAVR 1681
Cdd:pfam01576 440 SELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELL----QEETRQKLNLSTRLRQL---------------EDERNS 500
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1682 QRELAEQELEKQRQLaegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK------ 1755
Cdd:pfam01576 501 LQEQLEEEEEAKRNV----ERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKlektkn 576
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1756 -VRAEMEVLLAS-------------------KARAEEESRSTS-EKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQ 1814
Cdd:pfam01576 577 rLQQELDDLLVDldhqrqlvsnlekkqkkfdQMLAEEKAISARyAEERDRAEAEA---REKETRALSLARALEEALEAKE 653
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1815 LAEEDAVRQRAEAERVLAEKLAA---ISEATRLKTEAEIALKEKEAENERLR-RLAEDEAFQRRL---LEEQAAQHKADI 1887
Cdd:pfam01576 654 ELERTNKQLRAEMEDLVSSKDDVgknVHELERSKRALEQQVEEMKTQLEELEdELQATEDAKLRLevnMQALKAQFERDL 733
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1888 EARlaqlrkaSESELERQKGLVedtlRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAAR 1967
Cdd:pfam01576 734 QAR-------DEQGEEKRRQLV----KQVRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKK 802
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1968 QRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR-QLQLAQEAAQKRLQAE 2046
Cdd:pfam01576 803 LQAQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQLQEDLAASERARRQAQQERDElADEIASGASGKSALQD 882
|
..
gi 1920237962 2047 EK 2048
Cdd:pfam01576 883 EK 884
|
|
| CH_MICAL2 |
cd21250 |
calponin homology (CH) domain found in molecule interacting with CasL protein 2; MICAL-2 is a ... |
175-272 |
9.75e-16 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 2; MICAL-2 is a nuclear [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-2 acts as a key regulator of the serum response factor (SRF) signaling pathway elicited by nerve growth factor and serum. It mediates oxidation and subsequent depolymerization of nuclear actin, leading to the increased MKL1/MRTF-A presence in the nucleus, promoting SRF:MKL1/MRTF-A-dependent gene transcription. MICAL-2 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409099 [Multi-domain] Cd Length: 110 Bit Score: 76.07 E-value: 9.75e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 175 KLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD-PEDV 253
Cdd:cd21250 8 KLLTWCQKQTEGYQNVNVTDLTTSWKSGLALCAIIHRFRPELIDFDSLNEDDAVKNNQLAFDVAEREFGIPPVTTgKEMA 87
|
90
....*....|....*....
gi 1920237962 254 DVPQPDEKSIITYVSSLYD 272
Cdd:cd21250 88 SAEEPDKLSMVMYLSKFYE 106
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1353-1915 |
1.11e-15 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 84.97 E-value: 1.11e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1353 EERERLAEVEAALEKQRQLAEAHAQAKAQAEreaqglQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRqsseaeIQAK 1432
Cdd:COG4913 219 EEPDTFEAADALVEHFDDLERAHEALEDARE------QIELLEPIRELAERYAAARERLAELEYLRAALR------LWFA 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1433 ARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAE-AQKRQAQEEAERLRRQvQDETQRKRQAEA 1511
Cdd:COG4913 287 QRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGgDRLEQLEREIERLERE-LEERERRRARLE 365
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1512 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQS-------EHASFAEK 1584
Cdd:COG4913 366 ALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASlerrksnIPARLLAL 445
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1585 TAQLERTLKEEHVAV------VQLREEAtrraqqqaeaeraraeaerelERWQLKANEAL---RLRL--------QAEEV 1647
Cdd:COG4913 446 RDALAEALGLDEAELpfvgelIEVRPEE---------------------ERWRGAIERVLggfALTLlvppehyaAALRW 504
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1648 AQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEK--QRQLAEGTAQQRLAAEQELIRL-RAETEQG--- 1721
Cdd:COG4913 505 VNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAwlEAELGRRFDYVCVDSPEELRRHpRAITRAGqvk 584
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1722 ------------------------EQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRST 1777
Cdd:COG4913 585 gngtrhekddrrrirsryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVA 664
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1778 S-EKSKQRLEAEagrFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKE 1856
Cdd:COG4913 665 SaEREIAELEAE---LERLDASSDDLAALEEQLEELEA-ELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAE 740
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1857 AENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRkaseSELERQKGLVEDTLRQ 1915
Cdd:COG4913 741 DLARLELRALLEERFAAALGDAVERELRENLEERIDALR----ARLNRAEEELERAMRA 795
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
917-1605 |
1.16e-15 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 84.72 E-value: 1.16e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 917 EQRQALRSLELHYQA----FLRDSQDAGgfgPEDRLQAEREYGSCSRHYQQLLQSLEQGEQE----ESRCQRCISELKDI 988
Cdd:TIGR02168 217 ELKAELRELELALLVlrleELREELEEL---QEELKEAEEELEELTAELQELEEKLEELRLEvselEEEIEELQKELYAL 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 989 RLQLEACETRTVH---RLRlPLDKEPARECAQRITEQQKAQAEVDGLGKGVARLSAEAEKVLALPEPSPAAPTLRSELEL 1065
Cdd:TIGR02168 294 ANEISRLEQQKQIlreRLA-NLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELES 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1066 TLGKL-EQVRSLSAIYLEKLKTISLvIRSTQEaeeVLRAHEEQLKEAQAVPATlpelEATKAALKKLRAQAEAQQPVFDA 1144
Cdd:TIGR02168 373 RLEELeEQLETLRSKVAQLELQIAS-LNNEIE---RLEARLERLEDRRERLQQ----EIEELLKKLEEAELKELQAELEE 444
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1145 LRDELRGAQEVGERLQQRHGERDVEVERWRERVTLL---LERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRD 1221
Cdd:TIGR02168 445 LEEELEELQEELERLEEALEELREELEEAEQALDAAereLAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGV 524
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1222 AKQR---------------QEQIQAVPLANSQAVR---EQLRQEK----ALLEDIERHGEKVEECQRFAKQYINAIKDYE 1279
Cdd:TIGR02168 525 LSELisvdegyeaaieaalGGRLQAVVVENLNAAKkaiAFLKQNElgrvTFLPLDSIKGTEIQGNDREILKNIEGFLGVA 604
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1280 LQLVTYKAQLEPVASP---------------AKKPKVQSGSESIIQEYVDLRTRYS--------ELSTL-TSQYIRFISE 1335
Cdd:TIGR02168 605 KDLVKFDPKLRKALSYllggvlvvddldnalELAKKLRPGYRIVTLDGDLVRPGGVitggsaktNSSILeRRREIEELEE 684
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1336 TLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1415
Cdd:TIGR02168 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIE 764
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1416 EELQHLRQSSEAEiqakarqvEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL 1495
Cdd:TIGR02168 765 ELEERLEEAEEEL--------AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAAT 836
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1496 RRQVQDETQRKRQAEAELAlrvqaeaeaarekqRALQALEELRLQAEEAERRLRQAEAERArQVQVALETAqRSAEAELQ 1575
Cdd:TIGR02168 837 ERRLEDLEEQIEELSEDIE--------------SLAAEIEELEELIEELESELEALLNERA-SLEEALALL-RSELEELS 900
|
730 740 750
....*....|....*....|....*....|..
gi 1920237962 1576 SEHASFAEKTAQLERTLKE--EHVAVVQLREE 1605
Cdd:TIGR02168 901 EELRELESKRSELRRELEElrEKLAQLELRLE 932
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1310-1966 |
2.32e-15 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 83.73 E-value: 2.32e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1310 QEYVDLRTRYSELSTLTSQYIRFISETLRRMEE-EERLAE--QQRAEERERLAEVEAALEKQRQLAEAhAQAKAQAEREA 1386
Cdd:pfam12128 248 QEFNTLESAELRLSHLHFGYKSDETLIASRQEErQETSAElnQLLRTLDDQWKEKRDELNGELSAADA-AVAKDRSELEA 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1387 QGLQRRMQEEVarREEVAVEAQEQKRSIQEELQHLRQSSEAeIQAKARQVEAA-ERSRLRIEEEIRVV--------RLQL 1457
Cdd:pfam12128 327 LEDQHGAFLDA--DIETAAADQEQLPSWQSELENLEERLKA-LTGKHQDVTAKyNRRRSKIKEQNNRDiagikdklAKIR 403
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1458 EATERQRGGAEGELQALRAR-AEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEA-----EAAREKQ--- 1528
Cdd:pfam12128 404 EARDRQLAVAEDDLQALESElREQLEAGKLEFNEEEYRLKSRLGELKLRLNQATATPELLLQLENfderiERAREEQeaa 483
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1529 -----RALQALEELRLQAEEAERRLRQAEAeRARQVQVALETAQR-------------SAEAELQSEHASFAEKTAQLER 1590
Cdd:pfam12128 484 naeveRLQSELRQARKRRDQASEALRQASR-RLEERQSALDELELqlfpqagtllhflRKEAPDWEQSIGKVISPELLHR 562
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1591 TLKEEHVAVVQLREEATRRaqqqaeaeraraeaerelerwqlkaneALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREAR 1670
Cdd:pfam12128 563 TDLDPEVWDGSVGGELNLY---------------------------GVKLDLKRIDVPEWAASEEELRERLDKAEEALQS 615
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1671 RRGKAEEQAvRQRELAEQELEKQrQLAEGTAQQRLA-AEQELIRLraeTEQGEQQRQLLEEELARLQREaaaATQKRREL 1749
Cdd:pfam12128 616 AREKQAAAE-EQLVQANGELEKA-SREETFARTALKnARLDLRRL---FDEKQSEKDKKNKALAERKDS---ANERLNSL 687
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1750 EAELAKVraEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQraeaer 1829
Cdd:pfam12128 688 EAQLKQL--DKKHQAWLEEQKEQKREARTEKQAYWQVVEGALDAQLALLKAAIAARRSGAKAELKALETWYKRD------ 759
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1830 vLAEKlaAISEATRLKTEAEIalKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASeSELERQKG-L 1908
Cdd:pfam12128 760 -LASL--GVDPDVIAKLKREI--RTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLATQLSNIERAI-SELQQQLArL 833
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1909 VEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTlrSKEQAEQEAA 1966
Cdd:pfam12128 834 IADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMSKLATLKEDA--NSEQAQGSIG 889
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
2801-2839 |
2.41e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 72.36 E-value: 2.41e-15
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 2801 LLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEMNRVL 2839
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4038-4076 |
5.46e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 71.20 E-value: 5.46e-15
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 4038 LLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEMNEIL 4076
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3460-3498 |
5.46e-15 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 71.20 E-value: 5.46e-15
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3460 LLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEMNRVL 3498
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| CH_PLS_FIM_rpt3 |
cd21219 |
third calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ... |
38-155 |
7.77e-15 |
|
third calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5, which cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409068 Cd Length: 113 Bit Score: 73.47 E-value: 7.77e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDrvqKKTFTKWVNKHLIKAQrhISDLYEDLRDGhnlISLLEVLsgDSLprERDVIRSSRLPREKGRMRFHKLQNVQI 117
Cdd:cd21219 2 GSRE---ERAFRMWLNSLGLDPL--INNLYEDLRDG---LVLLQVL--DKI--QPGCVNWKKVNKPKPLNKFKKVENCNY 69
|
90 100 110
....*....|....*....|....*....|....*...
gi 1920237962 118 ALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILH 155
Cdd:cd21219 70 AVDLAKKLGFSLVGIGGKDIADGNRKLTLALVWQLMRY 107
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1744-2611 |
8.04e-15 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 82.04 E-value: 8.04e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1744 QKRRELEAELAKVrAEMEvllASKARAEEESrstsEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEdavRQ 1823
Cdd:TIGR02169 153 VERRKIIDEIAGV-AEFD---RKKEKALEEL----EEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKE---KR 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1824 RAEAERVLAEKLAAisEATRLKTEAEIAlkEKEAENERLRRLAEDeafqrrlLEEQAAQHKADIEARLAQLRKASESELE 1903
Cdd:TIGR02169 222 EYEGYELLKEKEAL--ERQKEAIERQLA--SLEEELEKLTEEISE-------LEKRLEEIEQLLEELNKKIKDLGEEEQL 290
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1904 RQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDtlrSKEQAEQEAARQRQLAAEEERRRREAE 1983
Cdd:TIGR02169 291 RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE---LEREIEEERKRRDKLTEEYAELKEELE 367
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1984 ERVQKSLAAEEEAARQR---KAALEEVERLKAKVEE----ARRLRERAEQESARQLQLAQ-----EAAQKRLQAEEKAHA 2051
Cdd:TIGR02169 368 DLRAELEEVDKEFAETRdelKDYREKLEKLKREINElkreLDRLQEELQRLSEELADLNAaiagiEAKINELEEEKEDKA 447
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2052 FAVQQKEQELQQTLQQEQSVLERLRseaeaarraaeeaeaareraerEAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAA 2131
Cdd:TIGR02169 448 LEIKKQEWKLEQLAADLSKYEQELY----------------------DLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2132 EKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTAlrlqleetdhQKSIldEELQRLKA- 2210
Cdd:TIGR02169 506 VRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVA----------KEAI--ELLKRRKAg 573
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2211 EVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENR-----------ALVLRDKDSAQRLLQ---------EEAEKMK 2270
Cdd:TIGR02169 574 RATFLPLNKMRDERRDLSILSEDGVIGFAVDLVEFDPKyepafkyvfgdTLVVEDIEAARRLMGkyrmvtlegELFEKSG 653
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2271 QVA----EEAARLSVAAQEAARLRQLAEE---------DLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2337
Cdd:TIGR02169 654 AMTggsrAPRGGILFSRSEPAELQRLRERleglkrelsSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEE 733
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2338 QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARaeEDARRFRKQAEDIGERLYRTE 2417
Cdd:TIGR02169 734 KLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIE 811
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2418 LATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQalqqsfLSEKDS 2497
Cdd:TIGR02169 812 ARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRD------LESRLG 885
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2498 LLQRERC-IEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEG---------VRRQQE 2567
Cdd:TIGR02169 886 DLKKERDeLEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEElsledvqaeLQRVEE 965
|
890 900 910 920
....*....|....*....|....*....|....*....|....*..
gi 1920237962 2568 ELQRLAQQQQQQEKLLAEENQR---LRERLQHLEEERRAALARSEEI 2611
Cdd:TIGR02169 966 EIRALEPVNMLAIQEYEEVLKRldeLKEKRAKLEEERKAILERIEEY 1012
|
|
| CH_SMTNB |
cd21259 |
calponin homology (CH) domain found in smoothelin-B and similar proteins; Smoothelins are ... |
173-284 |
8.77e-15 |
|
calponin homology (CH) domain found in smoothelin-B and similar proteins; Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. The human SMTN gene encodes smoothelin-A and smoothelin-B. This model corresponds to the single CH domain of smoothelin-B. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409108 Cd Length: 112 Bit Score: 73.49 E-value: 8.77e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 173 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 252
Cdd:cd21259 3 KQMLLDWCRAKTRGYENVDIQNFSSSWSDGMAFCALVHNFFPEAFDYSQLSPQNRRHNFEVAFSSAEKHADCPQLLDVED 82
|
90 100 110
....*....|....*....|....*....|...
gi 1920237962 253 -VDVPQPDEKSIITYVSSLYDAMprvpdVQDGV 284
Cdd:cd21259 83 mVRMREPDWKCVYTYIQEFYRCL-----VQKGL 110
|
|
| CH_CTX_rpt1 |
cd21225 |
first calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling ... |
41-152 |
9.48e-15 |
|
first calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling proteins that play a critical role in regulating cell morphology and actin cytoskeleton reorganization. They play a major role in cytokinesis and contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409074 Cd Length: 111 Bit Score: 73.33 E-value: 9.48e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 41 DRVQKKTFTKWVNKHLIKAQ-RHISDLYEDLRDGHNLISLLEVLSGDSLPRERDVIRSSRLPRekgrmrfhkLQNVQIAL 119
Cdd:cd21225 2 EKVQIKAFTAWVNSVLEKRGiPKISDLATDLSDGVRLIFFLELVSGKKFPKKFDLEPKNRIQM---------IQNLHLAM 72
|
90 100 110
....*....|....*....|....*....|....
gi 1920237962 120 DYLRHR-QVKLVNIRNDDIADGNPKLTLGLIWTI 152
Cdd:cd21225 73 LFIEEDlKIRVQGIGAEDFVDNNKKLILGLLWTL 106
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
859-1589 |
1.08e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 81.64 E-value: 1.08e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 859 QEAQEAIARLEAQHQALVALWHQLHTEMKSLLAWQSLGRDMQLIRSWSLATFRTLKpEEQRQALRSLELHYQAFLRDSQD 938
Cdd:TIGR02168 291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL-EELKEELESLEAELEELEAELEE 369
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 939 AggfgpEDRLQA-EREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQ 1017
Cdd:TIGR02168 370 L-----ESRLEElEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEE 444
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1018 RITEQQKAQAEVDGLGKGVARLSAE-AEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLS---AIYLEKLKTISLVIRS 1093
Cdd:TIGR02168 445 LEEELEELQEELERLEEALEELREElEEAEQALDAAERELAQLQARLDSLERLQENLEGFSegvKALLKNQSGLSGILGV 524
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1094 TQEAEEVlrahEEQLKeaQAVPATLPE---------LEATKAALKKLrAQAEAQQPVFDALrDELRGAQEVGERLQQRHG 1164
Cdd:TIGR02168 525 LSELISV----DEGYE--AAIEAALGGrlqavvvenLNAAKKAIAFL-KQNELGRVTFLPL-DSIKGTEIQGNDREILKN 596
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1165 ERDV-----EVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYR------ESADPLGAWLRDAKQRqeqiQAVP 1233
Cdd:TIGR02168 597 IEGFlgvakDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRivtldgDLVRPGGVITGGSAKT----NSSI 672
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1234 LANSQAVREqLRQEKALLEDIERHGEKveecqrfakqyinAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYV 1313
Cdd:TIGR02168 673 LERRREIEE-LEEKIEELEEKIAELEK-------------ALAELRKELEELEEELE---------QLRKELEELSRQIS 729
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1314 DLRTRYSELST---LTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ 1390
Cdd:TIGR02168 730 ALRKDLARLEAeveQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELR 809
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1391 ---RRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQR 1464
Cdd:TIGR02168 810 aelTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEelsEDIESLAAEIEELEELIEELESELEALLNERASLEEAL 889
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1465 GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA-LRVQAEAEAAREKQRALQALEELRLQAEE 1543
Cdd:TIGR02168 890 ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDnLQERLSEEYSLTLEEAEALENKIEDDEEE 969
|
730 740 750 760
....*....|....*....|....*....|....*....|....*.
gi 1920237962 1544 AERRLRQAEAERARQVQVALEtaqrsAEAELQSEHASFAEKTAQLE 1589
Cdd:TIGR02168 970 ARRRLKRLENKIKELGPVNLA-----AIEEYEELKERYDFLTAQKE 1010
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1240-1904 |
1.16e-14 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 81.50 E-value: 1.16e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1240 VREQLRQEKALLEDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQ---LEPVaspakkpkvqsgsESIIQEYVDLR 1316
Cdd:COG4913 213 VREYMLEEPDTFEAADA-----------LVEHFDDLERAHEALEDAREQielLEPI-------------RELAERYAAAR 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1317 TRYSELSTLTSQY-IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQ----LAEAHAQAKAQAEREAQGLQR 1391
Cdd:COG4913 269 ERLAELEYLRAALrLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREeldeLEAQIRGNGGDRLEQLEREIE 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1392 RMQEEVARREEVAVEAQEQKRSI--------------QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL 1457
Cdd:COG4913 349 RLERELEERERRRARLEALLAALglplpasaeefaalRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEI 428
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1458 EATERQRGGAEGELQALRARAEEAeaqkrqAQEEAERLR-----RQVQDETQRKRQAeAELALRVQA-----EAEAAREK 1527
Cdd:COG4913 429 ASLERRKSNIPARLLALRDALAEA------LGLDEAELPfvgelIEVRPEEERWRGA-IERVLGGFAltllvPPEHYAAA 501
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1528 QRALQALE-ELRLQAEEAERRLRQAEAER------ARQVQVALETAQRSAEAELQSehaSFAEKTAQLERTLKEEHVAV- 1599
Cdd:COG4913 502 LRWVNRLHlRGRLVYERVRTGLPDPERPRldpdslAGKLDFKPHPFRAWLEAELGR---RFDYVCVDSPEELRRHPRAIt 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1600 --VQLREEATrraqqqaeaERARAEAERELERWQL-KANEALRLRLQAEEVAQQKSLTQAEAEKQKEeaerearrrgKAE 1676
Cdd:COG4913 579 raGQVKGNGT---------RHEKDDRRRIRSRYVLgFDNRAKLAALEAELAELEEELAEAEERLEAL----------EAE 639
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQL---AEGTAQQRLAAEQELIRLRAETEQGEQqrqlLEEELARLQREAAAATQKRRELEAEL 1753
Cdd:COG4913 640 LDALQERREALQRLAEYSWDeidVASAEREIAELEAELERLDASSDDLAA----LEEQLEELEAELEELEEELDELKGEI 715
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1754 AKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1833
Cdd:COG4913 716 GRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRA 795
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1834 -KLAAISEATRLKTEAEiALKEKEAENERLR--RLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELER 1904
Cdd:COG4913 796 fNREWPAETADLDADLE-SLPEYLALLDRLEedGLPEYEERFKELLNENSIEFVADLLSKLRRAIREIKERIDP 868
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1095-1595 |
1.20e-14 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 81.24 E-value: 1.20e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1095 QEAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRaqaeaqqpvfDALRDELRGAQEVGERLQQRHGERDVEVERWR 1174
Cdd:PRK02224 237 DEADEVLEEHEERREELETLEAEIEDLRETIAETERER----------EELAEEVRDLRERLEELEEERDDLLAEAGLDD 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1175 ERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQ---AVPLANSQAVREQLRQEKALL 1251
Cdd:PRK02224 307 ADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELReeaAELESELEEAREAVEDRREEI 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1252 EDIERHGEKVEEcqRFAKqyinaikdyelqlvtykaqlepvaSPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1331
Cdd:PRK02224 387 EELEEEIEELRE--RFGD------------------------APVDLGNAEDFLEELREERDELREREAELEAT------ 434
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1332 fISETLRRMEEEERLAEQQR-----------------AEERERLAEVEAALE----KQRQLAEAHAQAK--AQAEREAQG 1388
Cdd:PRK02224 435 -LRTARERVEEAEALLEAGKcpecgqpvegsphvetiEEDRERVEELEAELEdleeEVEEVEERLERAEdlVEAEDRIER 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1389 LQRR---MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvRLQLEATERQRG 1465
Cdd:PRK02224 514 LEERredLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNS--KLAELKERIESL 591
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1466 GAEGELQALRARAE---EAEAQKRQAQEEAERLRR-QVQDETQRKRQAEAELAlrvQAEAEAARE-KQRALQALeelrlq 1540
Cdd:PRK02224 592 ERIRTLLAAIADAEdeiERLREKREALAELNDERReRLAEKRERKRELEAEFD---EARIEEAREdKERAEEYL------ 662
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1541 aEEAERRLRQAEAERAR-QVQVALETAQRSAEAELQSEHASFAEKTAQLErTLKEE 1595
Cdd:PRK02224 663 -EQVEEKLDELREERDDlQAEIGAVENELEELEELRERREALENRVEALE-ALYDE 716
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1055-1808 |
1.64e-14 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 81.17 E-value: 1.64e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1055 AAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLkeaqavPATLPELEATKAALKKLRAQ 1134
Cdd:TIGR00618 160 AKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCM------PDTYHERKQVLEKELKHLRE 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1135 AEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEverwrervtllLERWQAVLAQTDVRQRELEQLGRQLRYYRESAdp 1214
Cdd:TIGR00618 234 ALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRAR-----------IEELRAQEAVLEETQERINRARKAAPLAAHIK-- 300
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1215 lgAWLRDAKQRQEQIQAvpLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVAS 1294
Cdd:TIGR00618 301 --AVTQIEQQAQRIHTE--LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHT 376
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1295 PAKKPKVQSGSESIIQEyvDLRTRYSELSTLTSQYIRFISETLRRMEEEERL--AEQQRAEERERLAEVEAALEKQRQ-- 1370
Cdd:TIGR00618 377 LTQHIHTLQQQKTTLTQ--KLQSLCKELDILQREQATIDTRTSAFRDLQGQLahAKKQQELQQRYAELCAAAITCTAQce 454
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1371 -LAEAHAQAKAQAERE-------AQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEA--EIQAKARQVEAAE 1440
Cdd:TIGR00618 455 kLEKIHLQESAQSLKEreqqlqtKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDidNPGPLTRRMQRGE 534
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1441 RSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRaRAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE 1520
Cdd:TIGR00618 535 QTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQ-QSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACE 613
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1521 AEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVV 1600
Cdd:TIGR00618 614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLT 693
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1601 QLREEATRRAQQQAEAERARAEAERELERWQLkANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREArrrgkaEEQAV 1680
Cdd:TIGR00618 694 YWKEMLAQCQTLLRELETHIEEYDREFNEIEN-ASSSLGSDLAAREDALNQSLKELMHQARTVLKARTE------AHFNN 766
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1681 RQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQR-----QLLEEELARLQREAAAATQKRRELEAELAK 1755
Cdd:TIGR00618 767 NEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEipsdeDILNLQCETLVQEEEQFLSRLEEKSATLGE 846
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 1756 VRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEE 1808
Cdd:TIGR00618 847 ITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQIKIQFDGDALIKFLHE 899
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1335-1826 |
1.66e-14 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 80.58 E-value: 1.66e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHA--QAKAQAEREAQGLQRRMQEEVARREEVAvEAQEQKR 1412
Cdd:COG4717 88 EEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLEELEERLEELR-ELEEELE 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1413 SIQEELQHLRQSSEaeiqakarqvEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEA 1492
Cdd:COG4717 167 ELEAELAELQEELE----------ELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENEL 236
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1493 ER--LRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSA 1570
Cdd:COG4717 237 EAaaLEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEEL 316
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1571 EAElqsehasfaektaQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAqq 1650
Cdd:COG4717 317 EEE-------------ELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGV-- 381
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1651 ksltqaeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEgtAQQRLAAEQELIRLRAETEQGEQqrqlLEE 1730
Cdd:COG4717 382 -----------------------EDEEELRAALEQAEEYQELKEELEE--LEEQLEELLGELEELLEALDEEE----LEE 432
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1731 ELARLQREAAAATQKRRELEAELAKVRAEMEVLlaskaraeeESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAK 1810
Cdd:COG4717 433 ELEELEEELEELEEELEELREELAELEAELEQL---------EEDGELAELLQELEELKAELRELAEEWAALKLALELLE 503
|
490
....*....|....*....
gi 1920237962 1811 RQRQLAEED---AVRQRAE 1826
Cdd:COG4717 504 EAREEYREErlpPVLERAS 522
|
|
| CH_SMTNL2 |
cd21261 |
calponin homology (CH) domain found in smoothelin-like protein 2; Smoothelin-like protein 2 ... |
173-272 |
1.95e-14 |
|
calponin homology (CH) domain found in smoothelin-like protein 2; Smoothelin-like protein 2 (SMTNL2) is highly expressed in skeletal muscle and could be associated with differentiating myocytes. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409110 Cd Length: 107 Bit Score: 72.31 E-value: 1.95e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 173 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 252
Cdd:cd21261 3 KQILLEWCRSKTIGYKNIDLQNFSSSWSDGMAFCALVHSFFPEAFDYDSLSPSNRKHNFELAFSMAEKLANCDRLIEVED 82
|
90 100
....*....|....*....|..
gi 1920237962 253 VDV--PQPDEKSIITYVSSLYD 272
Cdd:cd21261 83 MMVmgRKPDPMCVFTYVQSLYN 104
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
945-1484 |
3.52e-14 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 79.98 E-value: 3.52e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 945 EDRLQAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLrlpldkeparecAQRITEQQK 1024
Cdd:COG1196 274 LELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEEL------------EELEEELEE 341
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1025 AQAEVDGLGKGVARLSAEAEKVLAlpepspaapTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAH 1104
Cdd:COG1196 342 LEEELEEAEEELEEAEAELAEAEE---------ALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEAL 412
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1105 EEQLKEAQAvpatlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERw 1184
Cdd:COG1196 413 LERLERLEE------ELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEE- 485
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1185 qavlaqtdvRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEkALLEDIERHGEKVEEC 1264
Cdd:COG1196 486 ---------LAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA-LEAALAAALQNIVVED 555
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1265 QRFAKQYINAIKDYELQLVT-YKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEE 1343
Cdd:COG1196 556 DEVAAAAIEYLKAAKAGRATfLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAA 635
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1344 ERLAEQQRAEERERLAEVEAALEKQRqLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQ 1423
Cdd:COG1196 636 LRRAVTLAGRLREVTLEGEGGSAGGS-LTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEE 714
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1424 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQ 1484
Cdd:COG1196 715 ERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLERE 775
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1329-2048 |
3.68e-14 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 80.01 E-value: 3.68e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1329 YIRFISETLRRMEEEERLAEQQRAEERERLAEVEAaLEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQ 1408
Cdd:TIGR00618 140 YKTFTRVVLLPQGEFAQFLKAKSKEKKELLMNLFP-LDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYH 218
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1409 EQKRSIQEELQHLR--QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQAL---RARAEEAEA 1483
Cdd:TIGR00618 219 ERKQVLEKELKHLReaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERInraRKAAPLAAH 298
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1484 QKRQAQEEAERLRRQVQ-DETQRKRQAEAELALRVQAEAEAAREKQRALQAL--EELRLQAEEAERRLRQAEAERARQVQ 1560
Cdd:TIGR00618 299 IKAVTQIEQQAQRIHTElQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLhsQEIHIRDAHEVATSIREISCQQHTLT 378
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1561 VALETAQRSAEAELQSEHASFAEKTaqlertlkeehvavvQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRL 1640
Cdd:TIGR00618 379 QHIHTLQQQKTTLTQKLQSLCKELD---------------ILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELC 443
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1641 RLQAEEVAQQKSLTQAEAEKQKEEAerearrrgKAEEQAVRQRELAEQELEKQRQLAEgtaqQRLAAEQELIRLRAETEQ 1720
Cdd:TIGR00618 444 AAAITCTAQCEKLEKIHLQESAQSL--------KEREQQLQTKEQIHLQETRKKAVVL----ARLLELQEEPCPLCGSCI 511
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1721 GEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLAS----KARAEEESRSTSEKSKQR---------LEA 1787
Cdd:TIGR00618 512 HPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQraslKEQMQEIQQSFSILTQCDnrskedipnLQN 591
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1788 EAGRFRELAEEAARLR-ALAEEAKRQ-RQLAEEDAVRQRAEAERVLAEKLAaiSEATRLKTEAEIALKEKEAENERLRRL 1865
Cdd:TIGR00618 592 ITVRLQDLTEKLSEAEdMLACEQHALlRKLQPEQDLQDVRLHLQQCSQELA--LKLTALHALQLTLTQERVREHALSIRV 669
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1866 AEDEAFQRRLLEEQAAQHKADieaRLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELG 1945
Cdd:TIGR00618 670 LPKELLASRQLALQKMQSEKE---QLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLK 746
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1946 RIRGTAEDTLRSKEQAEQEAARQrqlaaeeerrRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAE 2025
Cdd:TIGR00618 747 ELMHQARTVLKARTEAHFNNNEE----------VTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDE 816
|
730 740
....*....|....*....|....
gi 1920237962 2026 QE-SARQLQLAQEAAQKRLQAEEK 2048
Cdd:TIGR00618 817 DIlNLQCETLVQEEEQFLSRLEEK 840
|
|
| CH_DIXDC1 |
cd21213 |
calponin homology (CH) domain found in Dixin and similar proteins; Dixin, also called ... |
44-156 |
4.32e-14 |
|
calponin homology (CH) domain found in Dixin and similar proteins; Dixin, also called coiled-coil protein DIX1, coiled-coil-DIX1, or DIX domain-containing protein 1, is a positive effector of the Wnt signaling pathway. It activates WNT3A signaling via DVL2 and regulates JNK activation by AXIN1 and DVL2. Members of this family contain a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409062 Cd Length: 107 Bit Score: 71.17 E-value: 4.32e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 44 QKKTFTKWVNKHLIK--AQRHISDLYEDLRDGHNLISLLEVLSGDSLPrerDVIRSsrlPREKGRMRfhklQNVQIALDY 121
Cdd:cd21213 1 QLQAYVAWVNSQLKKrpGIRPVQDLRRDLRDGVALAQLIEILAGEKLP---GIDWN---PTTDAERK----ENVEKVLQF 70
|
90 100 110
....*....|....*....|....*....|....*
gi 1920237962 122 LRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 156
Cdd:cd21213 71 MASKRIRMHQTSAKDIVDGNLKAIMRLILALAAHF 105
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1147-1926 |
4.72e-14 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 79.73 E-value: 4.72e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1147 DELRGAQEVGERLQQRHGERDvEVERWRERVTLLLERwqaVLAQTDVRQRELEQLGRqlryYREsadplgawLRDAKQRQ 1226
Cdd:TIGR02169 160 DEIAGVAEFDRKKEKALEELE-EVEENIERLDLIIDE---KRQQLERLRREREKAER----YQA--------LLKEKREY 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1227 EQiqAVPLANSQAVREQLRQEKALLEDIERHGEKVEEcqrfakqyinAIKDYELQLVTYKAQLEPVASPAKKpkvqSGSE 1306
Cdd:TIGR02169 224 EG--YELLKEKEALERQKEAIERQLASLEEELEKLTE----------EISELEKRLEEIEQLLEELNKKIKD----LGEE 287
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1307 SIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREA 1386
Cdd:TIGR02169 288 EQLRVKEKIGELEAEIASLERS-IAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1387 QGLQRRMQEEVARREEVAVEAQEQKRSIqEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGG 1466
Cdd:TIGR02169 367 EDLRAELEEVDKEFAETRDELKDYREKL-EKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1467 AEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQALEEL---RLQA-- 1541
Cdd:TIGR02169 446 KALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA-EAEAQARASEERVRGGRAVEEVlkaSIQGvh 524
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1542 ----------EEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFA-----EKTAQLERTLKEEHVA-------- 1598
Cdd:TIGR02169 525 gtvaqlgsvgERYATAIEVAAGNRLNNVVVEDDAVAKEAIELLKRRKAGRAtflplNKMRDERRDLSILSEDgvigfavd 604
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1599 --------------------VVQLREEATRRAQQQAEAERA--------------RAEAERELERWQLKAnEALRLRLQA 1644
Cdd:TIGR02169 605 lvefdpkyepafkyvfgdtlVVEDIEAARRLMGKYRMVTLEgelfeksgamtggsRAPRGGILFSRSEPA-ELQRLRERL 683
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1645 EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAV---RQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1721
Cdd:TIGR02169 684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGeieKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKEL 763
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1722 EQQRQLLEEELARLQreAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTsEKSKQRLEAEAGRFRELAEEAAR 1801
Cdd:TIGR02169 764 EARIEELEEDLHKLE--EALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREI-EQKLNRLTLEKEYLEKEIQELQE 840
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1802 LRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAA 1881
Cdd:TIGR02169 841 QRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLS 920
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1882 QHKADIEA---RLAQLRKASESELERQKGL--VEDTLRQRRQVEEEILAL 1926
Cdd:TIGR02169 921 ELKAKLEAleeELSEIEDPKGEDEEIPEEElsLEDVQAELQRVEEEIRAL 970
|
|
| CH_FLN_rpt2 |
cd21230 |
second calponin homology (CH) domain found in filamins; The filamin family includes filamin-A ... |
171-268 |
5.11e-14 |
|
second calponin homology (CH) domain found in filamins; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. Members of this family contain two copies of the CH domain. The model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409079 Cd Length: 103 Bit Score: 70.87 E-value: 5.11e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI----DMNKVYRqtnLENLDQAFSVAERDLGVTR 246
Cdd:cd21230 1 TPKQRLLGWIQNKIPQ---LPITNFTTDWNDGRALGALVDSCAPGLCpdweTWDPNDA---LENATEAMQLAEDWLGVPQ 74
|
90 100
....*....|....*....|..
gi 1920237962 247 LLDPEDVDVPQPDEKSIITYVS 268
Cdd:cd21230 75 LITPEEIINPNVDEMSVMTYLS 96
|
|
| CH_SMTNA |
cd21258 |
calponin homology (CH) domain found in smoothelin-A and similar proteins; Smoothelins are ... |
173-276 |
5.19e-14 |
|
calponin homology (CH) domain found in smoothelin-A and similar proteins; Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. This model corresponds to the single CH domain of smoothelin-A. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409107 Cd Length: 111 Bit Score: 71.23 E-value: 5.19e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 173 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 252
Cdd:cd21258 3 KQMLLDWCRAKTRGYEHVDIQNFSSSWSDGMAFCALVHNFFPDAFDYSQLSPQNRRQNFEVAFSAAEMLADCVPLVEVED 82
|
90 100
....*....|....*....|....*.
gi 1920237962 253 VDV--PQPDEKSIITYVSSLYDAMPR 276
Cdd:cd21258 83 MMImgKKPDSKCVFTYVQSLYNHLRR 108
|
|
| CH_CYTS |
cd21199 |
calponin homology (CH) domain found in the cytospin family; The cytospin family includes ... |
176-271 |
6.06e-14 |
|
calponin homology (CH) domain found in the cytospin family; The cytospin family includes cytospin-A and cytospin-B. Cytospin-A, also called renal carcinoma antigen NY-REN-22, sperm antigen with calponin homology and coiled-coil domains 1-like, or SPECC1-like (SPECC1L) protein, is involved in cytokinesis and spindle organization. It may play a role in actin cytoskeleton organization and microtubule stabilization and hence, is required for proper cell adhesion and migration. Cytospin-B, also called nuclear structure protein 5 (NSP5), sperm antigen HCMOGT-1, or sperm antigen with calponin homology and coiled-coil domains 1 (SPECC1), is a novel fusion partner to PDGFRB in juvenile myelomonocytic leukemia with translocation t(5;17)(q33;p11.2). Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409048 Cd Length: 112 Bit Score: 70.85 E-value: 6.06e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 176 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDPED-VD 254
Cdd:cd21199 13 LLKWCQEKTQGYKGIDITNFSSSWNDGLAFCALLHSYLPDKIPYSELNPQDKRRNFTLAFKAAE-SVGIPTTLTIDEmVS 91
|
90
....*....|....*..
gi 1920237962 255 VPQPDEKSIITYVSSLY 271
Cdd:cd21199 92 MERPDWQSVMSYVTAIY 108
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1322-2233 |
7.34e-14 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 79.06 E-value: 7.34e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1322 LSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEV----EAALEKQRQLAEAHAQAkAQAEREAQGLQRRMQEEV 1397
Cdd:pfam01576 178 LSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGEStdlqEQIAELQAQIAELRAQL-AKKEEELQAALARLEEET 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1398 ARREEvaveAQEQKRSIQEELQHLRQSSEAEIQAKARqveaAERSRLRIEEEIRVVRLQLEATErqrgGAEGELQALRAR 1477
Cdd:pfam01576 257 AQKNN----ALKKIRELEAQISELQEDLESERAARNK----AEKQRRDLGEELEALKTELEDTL----DTTAAQQELRSK 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1478 AE-EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvqaeaEAAREKQRALQALEELRlQAEEAERRLRQAEAERA 1556
Cdd:pfam01576 325 REqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELT-------EQLEQAKRNKANLEKAK-QALESENAELQAELRTL 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1557 RQVQVALETAQRSAEAELQSEHASFAEktAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1636
Cdd:pfam01576 397 QQAKQDSEHKRKKLEGQLQELQARLSE--SERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQ 474
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1637 ALRlrlqAEEVAQQKSLTQAEAEKqkeeaerearrrgkaEEQAVRQRELAEQELEKQRQLAegtaQQRLAAEQELIRLRA 1716
Cdd:pfam01576 475 ELL----QEETRQKLNLSTRLRQL---------------EDERNSLQEQLEEEEEAKRNVE----RQLSTLQAQLSDMKK 531
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1717 ETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK-------VRAEMEVLLAS-------------------KARA 1770
Cdd:pfam01576 532 KLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKlektknrLQQELDDLLVDldhqrqlvsnlekkqkkfdQMLA 611
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1771 EEESRSTS-EKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA---ISEATRLKT 1846
Cdd:pfam01576 612 EEKAISARyAEERDRAEAEA---REKETRALSLARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVgknVHELERSKR 688
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1847 EAEIALKEKEAENERLR-RLAEDEAFQRRL---LEEQAAQHKADIEARlaqlrkaSESELERQKGLVedtlRQRRQVEEE 1922
Cdd:pfam01576 689 ALEQQVEEMKTQLEELEdELQATEDAKLRLevnMQALKAQFERDLQAR-------DEQGEEKRRQLV----KQVRELEAE 757
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1923 ILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKA 2002
Cdd:pfam01576 758 LEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKESEKKLKN 837
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2003 ALEEVERLKAKVEEARRLRERAEQESAR-QLQLAQEAAQKRLQAEEKAHAFA-VQQKEQELQQTLQQEQSVLERLRSEAE 2080
Cdd:pfam01576 838 LEAELLQLQEDLAASERARRQAQQERDElADEIASGASGKSALQDEKRRLEArIAQLEEELEEEQSNTELLNDRLRKSTL 917
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2081 AARR---AAEEAEAARERAEREAAQSRRQVEEAeRLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQ 2157
Cdd:pfam01576 918 QVEQlttELAAERSTSQKSESARQQLERQNKEL-KAKLQEMEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQAANKL 996
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2158 AADAE---------MEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2228
Cdd:pfam01576 997 VRRTEkklkevllqVEDERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEASRANAARRKLQRELDDATESNESMNREVST 1076
|
....*
gi 1920237962 2229 LRVQM 2233
Cdd:pfam01576 1077 LKSKL 1081
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3719-3757 |
1.12e-13 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 67.74 E-value: 1.12e-13
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3719 LLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPELHDRL 3757
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1071-1574 |
1.28e-13 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 78.03 E-value: 1.28e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1071 EQVRSLSAI--YLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPatlpELEATKAALKKLRAQAEAQQPVFDALRDE 1148
Cdd:COG4913 249 EQIELLEPIreLAERYAAARERLAELEYLRAALRLWFAQRRLELLEA----ELEELRAELARLEAELERLEARLDALREE 324
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1149 LRGAQEV-----GERLQQRhgERDVE-VERWRERVTLLLERWQAVLAQTDVR---------------QRELEQLGRQLRY 1207
Cdd:COG4913 325 LDELEAQirgngGDRLEQL--EREIErLERELEERERRRARLEALLAALGLPlpasaeefaalraeaAALLEALEEELEA 402
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1208 YRESADPLGAWLRDAKQRQEQIQA-----------VPlANSQAVREQLRQEKALLED---------------------IE 1255
Cdd:COG4913 403 LEEALAEAEAALRDLRRELRELEAeiaslerrksnIP-ARLLALRDALAEALGLDEAelpfvgelievrpeeerwrgaIE 481
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1256 R--HGEK----VEEcQRFAK--QYINAIKDyELQLVTYKAQLEPVASPAKKPKVQS------GSESIIQEYVD--LRTRY 1319
Cdd:COG4913 482 RvlGGFAltllVPP-EHYAAalRWVNRLHL-RGRLVYERVRTGLPDPERPRLDPDSlagkldFKPHPFRAWLEaeLGRRF 559
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1320 S--------ELST----LTSQYIRFISETLRRMEEEERL---------AEQQRAEERERLAEVEAALEKQRQLAEAHAQA 1378
Cdd:COG4913 560 DyvcvdspeELRRhpraITRAGQVKGNGTRHEKDDRRRIrsryvlgfdNRAKLAALEAELAELEEELAEAEERLEALEAE 639
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1379 KAQAEREAQGLQRRmqEEVARREEVAVEAQEQKRSIQEELQHLRQSSeAEIQAKARQVEAAERSRLRIEEEIRVVRLQLE 1458
Cdd:COG4913 640 LDALQERREALQRL--AEYSWDEIDVASAEREIAELEAELERLDASS-DDLAALEEQLEELEAELEELEEELDELKGEIG 716
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1459 ATERQRGGAEGELQALRARAEEAE------------------AQKRQAQEEAERLRRQVQDETQRKRQAEAEL------- 1513
Cdd:COG4913 717 RLEKELEQAEEELDELQDRLEAAEdlarlelralleerfaaaLGDAVERELRENLEERIDALRARLNRAEEELeramraf 796
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1514 -------ALRVQAEAEAAREKQRALQALEELRL---QAEEAERRLRQAEAERArQVQVALETAQRSAEAEL 1574
Cdd:COG4913 797 nrewpaeTADLDADLESLPEYLALLDRLEEDGLpeyEERFKELLNENSIEFVA-DLLSKLRRAIREIKERI 866
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1631-2491 |
1.96e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 77.42 E-value: 1.96e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1631 QLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA---- 1706
Cdd:TIGR02169 231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLersi 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1707 --AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQR 1784
Cdd:TIGR02169 311 aeKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDY 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1785 LEAEAGRFRELAEEAARLRALAEEAKRQRQlaeedavrQRAEAERVLAEKLAAISEatrLKTEAEIALKEKEAENERLRR 1864
Cdd:TIGR02169 391 REKLEKLKREINELKRELDRLQEELQRLSE--------ELADLNAAIAGIEAKINE---LEEEKEDKALEIKKQEWKLEQ 459
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1865 LAED-EAFQRRLLEEQAAQhkADIEARLAQLRKaSESELERQKGLVEDTLRQRRQVEEeilalkgsfekaaagkaELELE 1943
Cdd:TIGR02169 460 LAADlSKYEQELYDLKEEY--DRVEKELSKLQR-ELAEAEAQARASEERVRGGRAVEE-----------------VLKAS 519
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1944 LGRIRGTAEDTLRSKEQ---AEQEAARQRqlaaeeerrrrEAEERVQKSLAAEE--EAARQRK---AALEEVERLKAKVE 2015
Cdd:TIGR02169 520 IQGVHGTVAQLGSVGERyatAIEVAAGNR-----------LNNVVVEDDAVAKEaiELLKRRKagrATFLPLNKMRDERR 588
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2016 EARRLRERAEQESARQLqlaQEAAQKRlqaeEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARER 2095
Cdd:TIGR02169 589 DLSILSEDGVIGFAVDL---VEFDPKY----EPAFKYVFGDTLVVEDIEAARRLMGKYRMVTLEGELFEKSGAMTGGSRA 661
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2096 AEREAAQSRRQVEEAERL---KQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQ 2172
Cdd:TIGR02169 662 PRGGILFSRSEPAELQRLrerLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2173 ALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQveEELFSLRVQMEELGKLKARIEAENRALvl 2252
Cdd:TIGR02169 742 LEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIEARLREI-- 817
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2253 rDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLaEEDLAQQRALAEKmLKEKMQAVQEatrLKAEAELLQQQK 2332
Cdd:TIGR02169 818 -EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI-ENLNGKKEELEEE-LEELEAALRD---LESRLGDLKKER 891
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2333 ELAQEQARRLQEDKEQMAQQlaqetqgfqktLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRfRKQAEDIGER 2412
Cdd:TIGR02169 892 DELEAQLRELERKIEELEAQ-----------IEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE-ELSLEDVQAE 959
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 2413 LYRTELAtqekvmlVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSF 2491
Cdd:TIGR02169 960 LQRVEEE-------IRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFEAINENF 1031
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1062-1584 |
1.97e-13 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 77.65 E-value: 1.97e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1062 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEvLRAHEEQLKEAQAvpaTLPELEATKAALKKLRAQAEAQQpv 1141
Cdd:COG4913 266 AARERLAELEYLRAALRLWFAQRRLELLEAELEELRAE-LARLEAELERLEA---RLDALREELDELEAQIRGNGGDR-- 339
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1142 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTdvrQRELEQLGRQLRYYRESADPLGAWLRD 1221
Cdd:COG4913 340 LEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEA---AALLEALEEELEALEEALAEAEAALRD 416
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1222 AKQRQEQIQA-----------VPlANSQAVREQLRQEKALLED---------------------IER--HGEK----VEE 1263
Cdd:COG4913 417 LRRELRELEAeiaslerrksnIP-ARLLALRDALAEALGLDEAelpfvgelievrpeeerwrgaIERvlGGFAltllVPP 495
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1264 cQRFAK--QYINAIKDyELQLVTYKAQLEPVASPAKKPKVQSGSEsiiqeyvdlrtrysELSTLTSQYIRFISETLRRM- 1340
Cdd:COG4913 496 -EHYAAalRWVNRLHL-RGRLVYERVRTGLPDPERPRLDPDSLAG--------------KLDFKPHPFRAWLEAELGRRf 559
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1341 -----EEEERLAEQQRAEERERLA-EVEAALEK--QRQLAEAH------AQAKAQAEREAQGLQRRMQEEVARREEVAvE 1406
Cdd:COG4913 560 dyvcvDSPEELRRHPRAITRAGQVkGNGTRHEKddRRRIRSRYvlgfdnRAKLAALEAELAELEEELAEAEERLEALE-A 638
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1407 AQEQKRSIQEELQHLRQSSEAEIQakarqVEAAERSRLRIEEEIRvvrlQLEAterqrggAEGELQALRARAEEAEAQKR 1486
Cdd:COG4913 639 ELDALQERREALQRLAEYSWDEID-----VASAEREIAELEAELE----RLDA-------SSDDLAALEEQLEELEAELE 702
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1487 QAQEEAERLRRQVQDETQRKRQAEAEL-ALRVQAEAEAAREKQRALQALEELRlqAEEAERRLRQAEAERARQVQVALET 1565
Cdd:COG4913 703 ELEEELDELKGEIGRLEKELEQAEEELdELQDRLEAAEDLARLELRALLEERF--AAALGDAVERELRENLEERIDALRA 780
|
570
....*....|....*....
gi 1920237962 1566 AQRSAEAELQSEHASFAEK 1584
Cdd:COG4913 781 RLNRAEEELERAMRAFNRE 799
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1712-2525 |
2.81e-13 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 76.93 E-value: 2.81e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1712 IRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAE--MEVLLASKARAEEESRSTSEKSKQRLEAea 1789
Cdd:TIGR00618 94 LRCTRSHRKTEQPEQLYLEQKKGRGRILAAKKSETEEVIHDLLKLDYKtfTRVVLLPQGEFAQFLKAKSKEKKELLMN-- 171
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1790 grfrelAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAErvlAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDE 1869
Cdd:TIGR00618 172 ------LFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQ---LLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSH 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1870 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEeilalkgsfeKAAAGKAELELELGRIRG 1949
Cdd:TIGR00618 243 AYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAP----------LAAHIKAVTQIEQQAQRI 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1950 TAEdtLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2029
Cdd:TIGR00618 313 HTE--LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTT 390
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2030 rQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE 2109
Cdd:TIGR00618 391 -LTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK 469
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2110 AERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRL 2189
Cdd:TIGR00618 470 EREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYH 549
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2190 QLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLkarIEAENRAlvlrdKDSAQRLLQEEAEKm 2269
Cdd:TIGR00618 550 QLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDL---TEKLSEA-----EDMLACEQHALLRK- 620
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2270 KQVAEEAARLSVAAQEAARLRQLAEEDLAQqraLAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQA--RRLQEDKE 2347
Cdd:TIGR00618 621 LQPEQDLQDVRLHLQQCSQELALKLTALHA---LQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSekEQLTYWKE 697
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2348 QMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEdigERLYRTELATQEKVMLV 2427
Cdd:TIGR00618 698 MLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLK---ARTEAHFNNNEEVTAAL 774
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2428 QTLeTQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQ 2507
Cdd:TIGR00618 775 QTG-AELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLK 853
|
810
....*....|....*...
gi 1920237962 2508 EKAKLEQlfQDEVAKAQA 2525
Cdd:TIGR00618 854 YEECSKQ--LAQLTQEQA 869
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2153-2612 |
3.41e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 76.63 E-value: 3.41e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQ 2232
Cdd:TIGR02168 231 VLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2233 MEELGKLKARIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM 2312
Cdd:TIGR02168 311 LANLERQLEELEAQ-LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2313 QAVQEATRLKAEAELLQQQKE-LAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2391
Cdd:TIGR02168 390 QLELQIASLNNEIERLEARLErLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2392 QARAEEDARRFRKQAEDIGERLYRTE-----------------------------LATQEKV------------------ 2424
Cdd:TIGR02168 470 LEEAEQALDAAERELAQLQARLDSLErlqenlegfsegvkallknqsglsgilgvLSELISVdegyeaaieaalggrlqa 549
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2425 MLVQTLETQRQ--QSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE-----------------------MQTVRQEQ 2479
Cdd:TIGR02168 550 VVVENLNAAKKaiAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEgflgvakdlvkfdpklrkalsylLGGVLVVD 629
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2480 LLQETQALQ------------------------QSFLSEKDSLLQRERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQ 2535
Cdd:TIGR02168 630 DLDNALELAkklrpgyrivtldgdlvrpggvitGGSAKTNSSILERRREIEELEEKIEEL-EEKIAELEKALAELRKELE 708
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 2536 QMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIA 2612
Cdd:TIGR02168 709 ELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1352-2458 |
4.12e-13 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 76.53 E-value: 4.12e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1352 AEERERLaeVEAALEKQRQLAEAHAQAKAQAEReaqglqrrmQEEVARREEvavEAQEQKRSIQEELQ----HLRQSSEA 1427
Cdd:PRK04863 278 ANERRVH--LEEALELRRELYTSRRQLAAEQYR---------LVEMARELA---ELNEAESDLEQDYQaasdHLNLVQTA 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1428 EIQAKA--RQVEAAERSRLRIEEEIRVVRlqlEATERQrggaegelqalraraEEAEAQKRQAQEEAERLRRQVQDETQr 1505
Cdd:PRK04863 344 LRQQEKieRYQADLEELEERLEEQNEVVE---EADEQQ---------------EENEARAEAAEEEVDELKSQLADYQQ- 404
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1506 krqaeaelALRVQAEAeaAREKQRALQALEELR-------LQAEEAERRLRQAEAERARQVQVALETAQRSAEAElqsEH 1578
Cdd:PRK04863 405 --------ALDVQQTR--AIQYQQAVQALERAKqlcglpdLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQ---AA 471
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1579 ASFAEKTAQLERTLKEEHVavvqlREEAtrraqqqaeaeraraeaerelerWQlKANEALRlrlqaeevaqqksltqaea 1658
Cdd:PRK04863 472 HSQFEQAYQLVRKIAGEVS-----RSEA-----------------------WD-VARELLR------------------- 503
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1659 ekqkeeaerearrrgkaeeQAVRQRELAEQELEKQRQLAEgtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQRE 1738
Cdd:PRK04863 504 -------------------RLREQRHLAEQLQQLRMRLSE--LEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEE 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1739 AAAAtqkRRELEAELAKVRAEMEVLlaskaRAEEESrstSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1818
Cdd:PRK04863 563 LEAR---LESLSESVSEARERRMAL-----RQQLEQ---LQARIQRLAARAPAWLAAQDALARLREQSGEEFEDSQDVTE 631
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1819 dAVRQRAEAERVLAEKLAAISEA-TRLKTEAEIALKEKEAENERLRRLAEDeaFQRRLLEEQ----AAQHKADIEARLAQ 1893
Cdd:PRK04863 632 -YMQQLLERERELTVERDELAARkQALDEEIERLSQPGGSEDPRLNALAER--FGGVLLSEIyddvSLEDAPYFSALYGP 708
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1894 LRKASeseLERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG--KAElELELGRIRGTAEDTLR-SKEQAEQ---EAAR 1967
Cdd:PRK04863 709 ARHAI---VVPDLSDAAEQLAGLEDCPEDLYLIEGDPDSFDDSvfSVE-ELEKAVVVKIADRQWRySRFPEVPlfgRAAR 784
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1968 QRQLaaeeerrrreaeervqkslaaeEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLA---------QEA 2038
Cdd:PRK04863 785 EKRI----------------------EQLRAEREELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAfeadpeaelRQL 842
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2039 AQKRLQAEEKahafavqqkeqelqqtlqqeqsvLERLRSeaeaarraaeeaeaareraerEAAQSRRQVEEAERLKQSae 2118
Cdd:PRK04863 843 NRRRVELERA-----------------------LADHES---------------------QEQQQRSQLEQAKEGLSA-- 876
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2119 eqaqaqaqaqaaaekLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRlqleeTDhqk 2198
Cdd:PRK04863 877 ---------------LNRLLPRLNLLADETLADRVEEIREQLDEAEEAKRFVQQHGNALAQLEPIVSVLQ-----SD--- 933
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2199 silDEELQRLKAEVTEAARQRGQVEEELFSLrvqmEELGKLKARIEAENRALVLRDKDSAQRLLQ---EEAEKMKQVAEE 2275
Cdd:PRK04863 934 ---PEQFEQLKQDYQQAQQTQRDAKQQAFAL----TEVVQRRAHFSYEDAAEMLAKNSDLNEKLRqrlEQAEQERTRARE 1006
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2276 AARlsvaaQEAARLRQLAE--EDLAQQRALAEKMLKEKMQAVQEAT-RLKAEAE-LLQQQKELAQEQARRLQEDKEQMAQ 2351
Cdd:PRK04863 1007 QLR-----QAQAQLAQYNQvlASLKSSYDAKRQMLQELKQELQDLGvPADSGAEeRARARRDELHARLSANRSRRNQLEK 1081
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2352 QLaqetqgfqkTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEkvmlvqtLE 2431
Cdd:PRK04863 1082 QL---------TFCEAEMDNLTKKLRKLERDYHEMREQVVNAKAGWCAVLRLVKDNGVERRLHRRELAYLS-------AD 1145
|
1130 1140
....*....|....*....|....*..
gi 1920237962 2432 TQRQQSDRDAERLREAIAELEHEKDKL 2458
Cdd:PRK04863 1146 ELRSMSDKALGALRLAVADNEHLRDVL 1172
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1118-1577 |
5.92e-13 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 75.57 E-value: 5.92e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1118 LPELEATKAALKKLRAQAEAqqpvFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTL--LLERWQAVLAQTDVRQ 1195
Cdd:COG4717 70 LKELKELEEELKEAEEKEEE----YAELQEELEELEEELEELEAELEELREELEKLEKLLQLlpLYQELEALEAELAELP 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1196 RELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQekaLLEDIERHGEKVEECQRFAKQYINAI 1275
Cdd:COG4717 146 ERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQD---LAEELEELQQRLAELEEELEEAQEEL 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1276 KDYELQLVTYKAQLEPVASPAK--KPKVQSGSESII-------QEYVDLRTRYSELSTLTSQYIRFISETLRRMEE--EE 1344
Cdd:COG4717 223 EELEEELEQLENELEAAALEERlkEARLLLLIAAALlallglgGSLLSLILTIAGVLFLVLGLLALLFLLLAREKAslGK 302
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1345 RLAEQQRAEERERL--AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKrsIQEELQHLR 1422
Cdd:COG4717 303 EAEELQALPALEELeeEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQE--IAALLAEAG 380
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1423 QSSEAEIQAKARQVEAAErsrlRIEEEIRVVRLQLEA--TERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQ 1500
Cdd:COG4717 381 VEDEEELRAALEQAEEYQ----ELKEELEELEEQLEEllGELEELLEALDEEELEEELEELEEELEELEEELEELREELA 456
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1501 DETQRKRQAEAElalrvqaeaeaarekqralQALEELRLQAEEAERRLRQAEAERARQ--VQVALETAQRSAEAELQSE 1577
Cdd:COG4717 457 ELEAELEQLEED-------------------GELAELLQELEELKAELRELAEEWAALklALELLEEAREEYREERLPP 516
|
|
| CH_PLS_rpt3 |
cd21298 |
third calponin homology (CH) domain found in the plastin family; The plastin family includes ... |
46-159 |
6.83e-13 |
|
third calponin homology (CH) domain found in the plastin family; The plastin family includes plastin-1, -2, and -3. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409147 Cd Length: 117 Bit Score: 68.03 E-value: 6.83e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 46 KTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLsgdslprERDVIRSSRLPREKGRMR--FHKLQNVQIALDYLR 123
Cdd:cd21298 9 KTYRNWMNS--LGVNPFVNHLYSDLRDGLVLLQLYDKI-------KPGVVDWSRVNKPFKKLGanMKKIENCNYAVELGK 79
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237962 124 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 159
Cdd:cd21298 80 KLKFSLVGIGGKDIYDGNRTLTLALVWQLMRAYTLS 115
|
|
| CH_jitterbug-like_rpt2 |
cd21229 |
second calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ... |
173-268 |
6.83e-13 |
|
second calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409078 Cd Length: 105 Bit Score: 67.80 E-value: 6.83e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 173 KEKLLLWSQRMVEGCqglRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPE 251
Cdd:cd21229 5 KKLMLAWLQAVLPEL---KITNFSTDWNDGIALSALLDYCKPGLCpNWRKLDPSNSLENCRRAMDLAKREFNIPMVLSPE 81
|
90
....*....|....*..
gi 1920237962 252 DVDVPQPDEKSIITYVS 268
Cdd:cd21229 82 DLSSPHLDELSGMTYLS 98
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3053-3091 |
7.92e-13 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 65.04 E-value: 7.92e-13
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3053 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPELHEKL 3091
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| CH_SMTNL1 |
cd21260 |
calponin homology (CH) domain found in smoothelin-like protein 1; Smoothelin-like protein 1 ... |
173-274 |
9.27e-13 |
|
calponin homology (CH) domain found in smoothelin-like protein 1; Smoothelin-like protein 1 (SMTNL1), also called calponin homology-associated smooth muscle protein (CHASM), plays a role in the regulation of contractile properties of both striated and smooth muscles. It can bind to calmodulin and tropomyosin. When it is unphosphorylated, SMTNL1 may inhibit myosin dephosphorylation. SMTNL1 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409109 Cd Length: 116 Bit Score: 67.80 E-value: 9.27e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 173 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 252
Cdd:cd21260 3 KNMLLEWCRAKTRGYEHVDIQNFSSSWSSGMAFCALIHKFFPDAFDYAELDPANRRHNFTLAFSTAEKHADCAPLLEVED 82
|
90 100
....*....|....*....|...
gi 1920237962 253 -VDVPQPDEKSIITYVSSLYDAM 274
Cdd:cd21260 83 mVRMSVPDSKCVYTYIQELYRSL 105
|
|
| CH_CYTSA |
cd21256 |
calponin homology (CH) domain found in cytospin-A; Cytospin-A, also called renal carcinoma ... |
171-271 |
1.20e-12 |
|
calponin homology (CH) domain found in cytospin-A; Cytospin-A, also called renal carcinoma antigen NY-REN-22, or sperm antigen with calponin homology and coiled-coil domains 1-like, or SPECC1-like protein (SPECC1L), is involved in cytokinesis and spindle organization. It may play a role in actin cytoskeleton organization and microtubule stabilization and hence, is required for proper cell adhesion and migration. Cytospin-A contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409105 Cd Length: 119 Bit Score: 67.41 E-value: 1.20e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDP 250
Cdd:cd21256 14 SKRNALLKWCQKKTEGYQNIDITNFSSSWNDGLAFCALLHTYLPAHIPYQELNSQDKRRNFTLAFQAAE-SVGIKSTLDI 92
|
90 100
....*....|....*....|..
gi 1920237962 251 ED-VDVPQPDEKSIITYVSSLY 271
Cdd:cd21256 93 NEmVRTERPDWQSVMTYVTAIY 114
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1321-1538 |
1.21e-12 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 73.75 E-value: 1.21e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1321 ELSTLTSQYIR-----FISETLRRMEEEERLAEQQRAEeRERLAEVEAAlEKQRQLAEAHAQAKAQAER----EAQGLQR 1391
Cdd:COG2268 170 ELESVAITDLEdennyLDALGRRKIAEIIRDARIAEAE-AERETEIAIA-QANREAEEAELEQEREIETariaEAEAELA 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1392 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEirvvrlqLEATERQRGGAEGEL 1471
Cdd:COG2268 248 KKKAEERREAETARAEAEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAEREEAE-------LEADVRKPAEAEKQA 320
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 1472 QALRARAeEAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELR 1538
Cdd:COG2268 321 AEAEAEA-EAEAIRAKGLAEAEGKRALA--EAWNKLGDAAILLMLIEKLPEIAEAAAKPLEKIDKIT 384
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2154-2685 |
1.28e-12 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 75.18 E-value: 1.28e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2154 RQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRV-Q 2232
Cdd:PTZ00121 1110 KAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVrK 1189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2233 MEELGKLKA--RIEAENRALVLRDKDSAQRllQEEAEKMKQV--AEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKML 2308
Cdd:PTZ00121 1190 AEELRKAEDarKAEAARKAEEERKAEEARK--AEDAKKAEAVkkAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFAR 1267
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2309 KEKMQAVQEATrlKAEaELLQQQKELAQEQARRLQEDKEqmAQQLAQETQGFQKTLETERQRQlEMSAEAERLRLRVAEM 2388
Cdd:PTZ00121 1268 RQAAIKAEEAR--KAD-ELKKAEEKKKADEAKKAEEKKK--ADEAKKKAEEAKKADEAKKKAE-EAKKKADAAKKKAEEA 1341
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2389 SRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLEtQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQlK 2468
Cdd:PTZ00121 1342 KKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK-KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK-K 1419
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2469 SEEMQTvRQEQLLQETQALQQSFLSEKDSLLQRErciEQEKAKLEQLFQ--DEVAKAQALREEQQRqqqqmqqekqqlAA 2546
Cdd:PTZ00121 1420 ADEAKK-KAEEKKKADEAKKKAEEAKKADEAKKK---AEEAKKAEEAKKkaEEAKKADEAKKKAEE------------AK 1483
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2547 SMEEARRRQHEAeegvRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR-SEEIAPSRA-AAARALPN 2624
Cdd:PTZ00121 1484 KADEAKKKAEEA----KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKkAEEKKKADElKKAEELKK 1559
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 2625 GQDAADGPAAAAEPEHAFDGLRR-----KVPAQRLQEVGVL-------SAEELQQLAQGRTTVAELAQREDVR 2685
Cdd:PTZ00121 1560 AEEKKKAEEAKKAEEDKNMALRKaeeakKAEEARIEEVMKLyeeekkmKAEEAKKAEEAKIKAEELKKAEEEK 1632
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1096-1592 |
1.36e-12 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 74.57 E-value: 1.36e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1096 EAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELrgAQEVGERLQQRHGERDVEVERWRE 1175
Cdd:COG4913 239 RAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRRLEL--LEAELEELRAELARLEAELERLEA 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1176 RVTLLLERWQAVLAQ-TDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDI 1254
Cdd:COG4913 317 RLDALREELDELEAQiRGNGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEAL 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1255 ERHGEKVEECQRFAKQyinAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFIS 1334
Cdd:COG4913 397 EEELEALEEALAEAEA---ALRDLRRELRELEAEIA---------SLERRKSNIPARLLALRDALAEALGLDEAELPFVG 464
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLaeqQRAEER-------------ERLAEVEAALEKQR-------QLAEAHAQAKAQAEREAQGLQRRM- 1393
Cdd:COG4913 465 ELIEVRPEEERW---RGAIERvlggfaltllvppEHYAAALRWVNRLHlrgrlvyERVRTGLPDPERPRLDPDSLAGKLd 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1394 ----------QEEVARREEVA-VEAQEQ----KRSIQEELQ--------------HLRQ------SSEAEIQAKARQVEA 1438
Cdd:COG4913 542 fkphpfrawlEAELGRRFDYVcVDSPEElrrhPRAITRAGQvkgngtrhekddrrRIRSryvlgfDNRAKLAALEAELAE 621
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1439 AERSRLRIEEEIRVVRLQLEATERQRGGAEG---------ELQALRARAEEAEAQKRQAQE---EAERLRRQVQDETQRK 1506
Cdd:COG4913 622 LEEELAEAEERLEALEAELDALQERREALQRlaeyswdeiDVASAEREIAELEAELERLDAssdDLAALEEQLEELEAEL 701
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1507 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV--QVALETAQRSAEAELQSEHASFAEK 1584
Cdd:COG4913 702 EELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERfaAALGDAVERELRENLEERIDALRAR 781
|
....*...
gi 1920237962 1585 TAQLERTL 1592
Cdd:COG4913 782 LNRAEEEL 789
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1338-1840 |
1.61e-12 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 74.60 E-value: 1.61e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1338 RRMEEEERLAEQQRAEERERLAE----VEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRS 1413
Cdd:COG3096 242 RMTLEAIRVTQSDRDLFKHLITEatnyVAADYMRHANERRELSERALELRRELFGARRQLAEEQYRLVEMARELEELSAR 321
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1414 iQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEeirvVRLQLEATERQRGGAEGELqalraraEEAEAQKRQAQE 1490
Cdd:COG3096 322 -ESDLEQDYQAASdhlNLVQTALRQQEKIERYQEDLEE----LTERLEEQEEVVEEAAEQL-------AEAEARLEAAEE 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1491 EAERLRRQVQDETQrkrqaeaelALRVQAEAeaAREKQRALQALEELR-------LQAEEAERRLRQAEAERARQVQVAL 1563
Cdd:COG3096 390 EVDSLKSQLADYQQ---------ALDVQQTR--AIQYQQAVQALEKARalcglpdLTPENAEDYLAAFRAKEQQATEEVL 458
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1564 ETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVavvqlREEAtrraqqqaeaeraraeaerelerWQlKANEALR---- 1639
Cdd:COG3096 459 ELEQKLSVAD---AARRQFEKAYELVCKIAGEVE-----RSQA-----------------------WQ-TARELLRryrs 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1640 LRLQAEEVAQQKSltqaeaekqkeeaerearRRGKAEEQAVRQRELAEQelekQRQLAEGTAQQRLAAEQelirLRAETE 1719
Cdd:COG3096 507 QQALAQRLQQLRA------------------QLAELEQRLRQQQNAERL----LEEFCQRIGQQLDAAEE----LEELLA 560
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1720 QGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAE----------EESRSTSEKSKQRLEAEA 1789
Cdd:COG3096 561 ELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARAPAWLAAQDALErlreqsgealADSQEVTAAMQQLLERER 640
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1790 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISE 1840
Cdd:COG3096 641 EATVERDELAARKQALESQIERLSQPGGAEDPRLLALAERLGGVLLSEIYD 691
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1339-2463 |
1.90e-12 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 74.06 E-value: 1.90e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1339 RMEEEERLAEQQRAEERERLAEVEAAL----EKQRQLAEAHAQAKAQAEREAqglqrrmqEEVARREEVAVEAQEQKRSI 1414
Cdd:pfam01576 2 RQEEEMQAKEEELQKVKERQQKAESELkeleKKHQQLCEEKNALQEQLQAET--------ELCAEAEEMRARLAARKQEL 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1415 QEELQHLrqssEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegelQALRARAEEAEAQKRQAQEEAER 1494
Cdd:pfam01576 74 EEILHEL----ESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAAR-------QKLQLEKVTTEAKIKKLEEDILL 142
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1495 LRRQVQDETQRKRQAE---AELALRVQAEAEAA------REKQRALQALEELRLQAEEAERRlrqaEAERARQVQVALET 1565
Cdd:pfam01576 143 LEDQNSKLSKERKLLEeriSEFTSNLAEEEEKAkslsklKNKHEAMISDLEERLKKEEKGRQ----ELEKAKRKLEGEST 218
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1566 AQRSAEAELQsehASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAeaeraraeaerelerwQLKANEALRLRLQaE 1645
Cdd:pfam01576 219 DLQEQIAELQ---AQIAELRAQLAKKEEELQAALARLEEETAQKNNALK----------------KIRELEAQISELQ-E 278
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1646 EVAQQKSltqaeaekqkeeaerearrrgkAEEQAVRQRELAEQELEKQRQLAEGT-----AQQRLAA--EQELIRLR--- 1715
Cdd:pfam01576 279 DLESERA----------------------ARNKAEKQRRDLGEELEALKTELEDTldttaAQQELRSkrEQEVTELKkal 336
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1716 -AETEQGEQQRQ-----------LLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEEsRSTSEKSKQ 1783
Cdd:pfam01576 337 eEETRSHEAQLQemrqkhtqaleELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHK-RKKLEGQLQ 415
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1784 RLEA---EAGRFR-ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERV---LAEKLAAISEATRLKTEAEIALKEKE 1856
Cdd:pfam01576 416 ELQArlsESERQRaELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLesqLQDTQELLQEETRQKLNLSTRLRQLE 495
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1857 AENERLRRLAEDEAFQRRLLEEQAAQHkadiEARLAQLRKASESELERQKGLVEDtlrqRRQVEEEILALKGSFEKAAAG 1936
Cdd:pfam01576 496 DERNSLQEQLEEEEEAKRNVERQLSTL----QAQLSDMKKKLEEDAGTLEALEEG----KKRLQRELEALTQQLEEKAAA 567
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1937 KAELELELGRIRGTAEDTLRSKEQaeqeaarQRQLAAEEERRRREAEErvqksLAAEEEAARQRKAALEEVERLKAKVEE 2016
Cdd:pfam01576 568 YDKLEKTKNRLQQELDDLLVDLDH-------QRQLVSNLEKKQKKFDQ-----MLAEEKAISARYAEERDRAEAEAREKE 635
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2017 ARRLRERAEQESARQLQLAQEAAQKRLQAEekahafavqqkeqelqqtlqqeqsvLERLRSEAEAARRAAEEAEAARERA 2096
Cdd:pfam01576 636 TRALSLARALEEALEAKEELERTNKQLRAE-------------------------MEDLVSSKDDVGKNVHELERSKRAL 690
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2097 EREAAQSRRQVEEAERLKQSaeeqaqaqaqaqAAAEKLRKEAEQEAARRAQAeqaalRQKQAADAEMEKHKQfaeQALRQ 2176
Cdd:pfam01576 691 EQQVEEMKTQLEELEDELQA------------TEDAKLRLEVNMQALKAQFE-----RDLQARDEQGEEKRR---QLVKQ 750
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2177 KAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKD 2256
Cdd:pfam01576 751 VRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKE 830
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2257 SAQRLLQEEAEKMkQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQ 2336
Cdd:pfam01576 831 SEKKLKNLEAELL-QLQEDLAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLN 909
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2337 EQARRLQEDKEQMAQQLAQETQGFQKtLETERQrqlEMSAEAERLRLRVAEMSRAQ---------------ARAEEDARR 2401
Cdd:pfam01576 910 DRLRKSTLQVEQLTTELAAERSTSQK-SESARQ---QLERQNKELKAKLQEMEGTVkskfkssiaaleakiAQLEEQLEQ 985
|
1130 1140 1150 1160 1170 1180
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2402 FRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQ 2463
Cdd:pfam01576 986 ESRERQAANKLVRRTEKKLKEVLLQVEDERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEAS 1047
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1469-2047 |
2.57e-12 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 73.54 E-value: 2.57e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1469 GELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRqaEAELALRVQAEAEAAREKQRALQALEELRLQAEEA--ER 1546
Cdd:PRK02224 162 GKLEEYRERASDARLGVERVLSDQRGSLDQLKAQIEEKE--EKDLHERLNGLESELAELDEEIERYEEQREQARETrdEA 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1547 RLRQAEAERARQVQVALETA---QRSAEAELQSEHASFAEKTAQLERTLKEehvavvqLREEATRRAQQQAEAERARAEA 1623
Cdd:PRK02224 240 DEVLEEHEERREELETLEAEiedLRETIAETEREREELAEEVRDLRERLEE-------LEEERDDLLAEAGLDDADAEAV 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1624 ERELERWQlKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEaerearrrgKAEEQAVRQRELA---EQELEKQRQLAEGT 1700
Cdd:PRK02224 313 EARREELE-DRDEELRDRLEECRVAAQAHNEEAESLREDAD---------DLEERAEELREEAaelESELEEAREAVEDR 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1701 AQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVR---AEMEVLLA------------ 1765
Cdd:PRK02224 383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARervEEAEALLEagkcpecgqpve 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1766 --SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAE--EDAVRQRAEAERVLAEKLAAISEA 1841
Cdd:PRK02224 463 gsPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEErrEDLEELIAERRETIEEKRERAEEL 542
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1842 TRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIE---------ARLAQLRKASESELERQKGLVE-- 1910
Cdd:PRK02224 543 RERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIEslerirtllAAIADAEDEIERLREKREALAEln 622
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1911 ----DTLRQRRqveEEILALKGSFEKAAAGKAELELElgrirgTAEDTLRSKEQAEQEAARQRqlaaeeerrrreaeERV 1986
Cdd:PRK02224 623 derrERLAEKR---ERKRELEAEFDEARIEEAREDKE------RAEEYLEQVEEKLDELREER--------------DDL 679
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1987 QKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEE 2047
Cdd:PRK02224 680 QAEIGAVENELEELEELRERREALENRVEALEALYDEAEELESMYGDLRAELRQRNVETLE 740
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3795-3833 |
3.11e-12 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 63.50 E-value: 3.11e-12
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3795 LLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDTHDQL 3833
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3384-3422 |
4.68e-12 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 63.12 E-value: 4.68e-12
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3384 LLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPELHEKL 3422
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4307-4345 |
4.87e-12 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 63.12 E-value: 4.87e-12
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 4307 LLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRI 4345
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| CH_SF |
cd00014 |
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding ... |
173-272 |
5.96e-12 |
|
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding motifs, which may be present as a single copy or in tandem repeats (which increase binding affinity). They either function as autonomous actin binding motifs or serve a regulatory function. CH domains are found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, as well as proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
Pssm-ID: 409031 [Multi-domain] Cd Length: 103 Bit Score: 65.05 E-value: 5.96e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 173 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN---LENLDQAFSVAER-DLGVTRLL 248
Cdd:cd00014 1 EEELLKWINEVLGEELPVSITDLFESLRDGVLLCKLINKLSPGSIPKINKKPKSPfkkRENINLFLNACKKlGLPELDLF 80
|
90 100
....*....|....*....|....
gi 1920237962 249 DPEDVdVPQPDEKSIITYVSSLYD 272
Cdd:cd00014 81 EPEDL-YEKGNLKKVLGTLWALAL 103
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1674-1895 |
7.10e-12 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 70.56 E-value: 7.10e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1753
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1754 AKVRAE----------------MEVLLASKARAEEESRSTSEKS-KQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLA 1816
Cdd:COG4942 100 EAQKEElaellralyrlgrqppLALLLSPEDFLDAVRRLQYLKYlAPARREQAEELRADLAELAALRAELEAERAELEAL 179
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1817 EEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEdeafqrRLLEEQAAQHKADIEARLAQLR 1895
Cdd:COG4942 180 LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIA------RLEAEAAAAAERTPAAGFAALK 252
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2171-2495 |
7.77e-12 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 72.08 E-value: 7.77e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2171 EQALRQKAQVEQELTALRLQLEETDHQKSIlDEELQRLKAEVTEAARQRGQVEEELFSL--RVQMEELGKLK-------A 2241
Cdd:pfam17380 255 EYTVRYNGQTMTENEFLNQLLHIVQHQKAV-SERQQQEKFEKMEQERLRQEKEEKAREVerRRKLEEAEKARqaemdrqA 333
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2242 RIEAENRALVLRDKDSAQRLLQEEaekmKQVAEEAARLSVAAQEAARLRQLaeEDLAQQRALAEKMLKEKMQAVQEATRL 2321
Cdd:pfam17380 334 AIYAEQERMAMERERELERIRQEE----RKRELERIRQEEIAMEISRMREL--ERLQMERQQKNERVRQELEAARKVKIL 407
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2322 KAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQgfQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARR 2401
Cdd:pfam17380 408 EEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEER--AREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRD 485
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2402 fRKQAEDIGERLYRTELATQEKVM--------LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQ 2473
Cdd:pfam17380 486 -RKRAEEQRRKILEKELEERKQAMieeerkrkLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEE 564
|
330 340
....*....|....*....|..
gi 1920237962 2474 TVRQEQLLQETQALQQSFLSEK 2495
Cdd:pfam17380 565 RSRLEAMEREREMMRQIVESEK 586
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1872-2612 |
8.57e-12 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 72.31 E-value: 8.57e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1872 QRRLLEEQAA---QHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIR 1948
Cdd:pfam02463 153 ERRLEIEEEAagsRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYL 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1949 GTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQES 2028
Cdd:pfam02463 233 KLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDD 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2029 ARQLQLAQE---AAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRR 2105
Cdd:pfam02463 313 EEKLKESEKekkKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKL 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2106 QVEEAErlKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELT 2185
Cdd:pfam02463 393 KEEELE--LKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKS 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2186 ALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEE 2265
Cdd:pfam02463 471 EDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVI 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2266 AEKMKQVAEEAARLSVAAQEAARLRQLAEedlaqqralaekmlkekmqavqeaTRLKAEAELLQQQKELAQEQARRLQED 2345
Cdd:pfam02463 551 VEVSATADEVEERQKLVRALTELPLGARK------------------------LRLLIPKLKLPLKSIAVLEIDPILNLA 606
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2346 KEQMAQQLAQETQGFQKTLETERQRQLEMSaeaERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVM 2425
Cdd:pfam02463 607 QLDKATLEADEDDKRAKVVEGILKDTELTK---LKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQE 683
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2426 LVQTLETQRQQSdrdaERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCI 2505
Cdd:pfam02463 684 KAESELAKEEIL----RRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKK 759
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2506 EQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGvrrQQEELQRLAQQQQQQEKLLAE 2585
Cdd:pfam02463 760 EEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAE---LLEEEQLLIEQEEKIKEEELE 836
|
730 740
....*....|....*....|....*..
gi 1920237962 2586 ENQRLRERLQHLEEERRAALARSEEIA 2612
Cdd:pfam02463 837 ELALELKEEQKLEKLAEEELERLEEEI 863
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1013-1598 |
9.16e-12 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 72.29 E-value: 9.16e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1013 RECAQRITEQQKAQAEVDGLGKGVARLSAEAEkvlalpepspaaptlrsELELTLGKLEqvrslsaiylEKLKTISLVIR 1092
Cdd:COG3096 522 AELEQRLRQQQNAERLLEEFCQRIGQQLDAAE-----------------ELEELLAELE----------AQLEELEEQAA 574
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1093 STQEAEEVLRAHEEQLKE-AQAVPATLPELEATKAALKKLRAQAEAQqpvFDALRDELRGAQEVGERLQQRHGERDvEVE 1171
Cdd:COG3096 575 EAVEQRSELRQQLEQLRArIKELAARAPAWLAAQDALERLREQSGEA---LADSQEVTAAMQQLLEREREATVERD-ELA 650
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1172 RWRERVTLLLERWQAVLAQTDVRQREL-EQLGRQL--RYYR----ESADPLGAWLRDAKQrqeqiqAVPLANSQAVREQL 1244
Cdd:COG3096 651 ARKQALESQIERLSQPGGAEDPRLLALaERLGGVLlsEIYDdvtlEDAPYFSALYGPARH------AIVVPDLSAVKEQL 724
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1245 RQEKALLED---IERHGEKVEECQRFAKQYINAI--KDYELQLVTYKAQLEPV----ASPAKKPKVQSGSESIIQEYVDL 1315
Cdd:COG3096 725 AGLEDCPEDlylIEGDPDSFDDSVFDAEELEDAVvvKLSDRQWRYSRFPEVPLfgraAREKRLEELRAERDELAEQYAKA 804
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1316 RTRYSELSTLTSQYIRFISETLRRMEEEErlAEQQRAEERERLAEVEAALEKQRQlAEAHAQAKAQAEREAQGLQRRMQE 1395
Cdd:COG3096 805 SFDVQKLQRLHQAFSQFVGGHLAVAFAPD--PEAELAALRQRRSELERELAQHRA-QEQQLRQQLDQLKEQLQLLNKLLP 881
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1396 EV---------ARREEVAVE---AQEQKRSIQEELQHLRQSSE--AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATE 1461
Cdd:COG3096 882 QAnlladetlaDRLEELREEldaAQEAQAFIQQHGKALAQLEPlvAVLQSDPEQFEQLQADYLQAKEQQRRLKQQIFALS 961
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1462 --RQR------GGAEGEL-------QALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAARE 1526
Cdd:COG3096 962 evVQRrphfsyEDAVGLLgensdlnEKLRARLEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQELEQE 1041
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1527 -KQRALQALEELRLQAEEaERRLRQAEAERARQVQVALETAQRSAEAELQSEHASF--AEKTAQLERTLKEEHVA 1598
Cdd:COG3096 1042 lEELGVQADAEAEERARI-RRDELHEELSQNRSRRSQLEKQLTRCEAEMDSLQKRLrkAERDYKQEREQVVQAKA 1115
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3962-4000 |
1.12e-11 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 61.96 E-value: 1.12e-11
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3962 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKL 4000
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3129-3167 |
1.24e-11 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 61.96 E-value: 1.24e-11
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3129 LLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKETSAAL 3167
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1370-1740 |
1.41e-11 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 70.92 E-value: 1.41e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1370 QLAEAHAQAKAQAEREAQGLQRRMQEEVARREevaveaqeqKRSIQEELQHLRQSSEAEiqaKARQVEA-------AERS 1442
Cdd:pfam17380 273 QLLHIVQHQKAVSERQQQEKFEKMEQERLRQE---------KEEKAREVERRRKLEEAE---KARQAEMdrqaaiyAEQE 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1443 RLRIEEEIRVVRLQLEatERQRggaegELQALRaraEEAEAQKRQAQEEAERLRRQVQDETQRKRQaEAELALRVQ-AEA 1521
Cdd:pfam17380 341 RMAMERERELERIRQE--ERKR-----ELERIR---QEEIAMEISRMRELERLQMERQQKNERVRQ-ELEAARKVKiLEE 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1522 EAAREKQRALQALEELRLQAEEA-ERRLRQAEAERARQVQ-VALETAQRSAEAE-LQSEHASFAEKTAQLERTLKEEHVA 1598
Cdd:pfam17380 410 ERQRKIQQQKVEMEQIRAEQEEArQREVRRLEEERAREMErVRLEEQERQQQVErLRQQEEERKRKKLELEKEKRDRKRA 489
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1599 VVQLREeatrraqqqaeaeraraeaereLERWQLKANEalrlRLQAEEVAQQKSLTQAEAEKQKEEAEREARRrgKAEEQ 1678
Cdd:pfam17380 490 EEQRRK----------------------ILEKELEERK----QAMIEEERKRKLLEKEMEERQKAIYEEERRR--EAEEE 541
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1679 AVRQrelaeQELEKQRQLAEgtaqQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAA 1740
Cdd:pfam17380 542 RRKQ-----QEMEERRRIQE----QMRKATEERSRLEAMEREREMMRQIVESEKARAEYEAT 594
|
|
| CH_CYTSB |
cd21257 |
calponin homology (CH) domain found in cytospin-B; Cytospin-B, also called nuclear structure ... |
171-271 |
1.56e-11 |
|
calponin homology (CH) domain found in cytospin-B; Cytospin-B, also called nuclear structure protein 5 (NSP5), or sperm antigen HCMOGT-1, or sperm antigen with calponin homology and coiled-coil domains 1 (SPECC1), is a novel fusion Cytospin-B that contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409106 Cd Length: 112 Bit Score: 63.90 E-value: 1.56e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDP 250
Cdd:cd21257 8 SKRNALLKWCQKKTEGYPNIDITNFSSSWSDGLAFCALLHTYLPAHIPYQELSSQDKKRNLLLAFQAAE-SVGIKPSLEL 86
|
90 100
....*....|....*....|..
gi 1920237962 251 ED-VDVPQPDEKSIITYVSSLY 271
Cdd:cd21257 87 SEmMYTDRPDWQSVMQYVAQIY 108
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
2197-2604 |
1.64e-11 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 70.57 E-value: 1.64e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2197 QKSILDEELQRLKAEVTEAARQRG---QVEEELFSLRVQMEELGKLKARIEAENRAL--------VLRDKDSAQRLLQEE 2265
Cdd:COG4717 65 KPELNLKELKELEEELKEAEEKEEeyaELQEELEELEEELEELEAELEELREELEKLekllqllpLYQELEALEAELAEL 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2266 AEKMKQVAEEAARLSVAAQEAARLRQLAEEdlaQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQED 2345
Cdd:COG4717 145 PERLEELEERLEELRELEEELEELEAELAE---LQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEE 221
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2346 KEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEED---------------ARRFRKQAEDIG 2410
Cdd:COG4717 222 LEELEEELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTiagvlflvlgllallFLLLAREKASLG 301
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2411 ERLYRTELATQEkvmlvQTLETQRQQSDRDAERLREAI--AELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQAL- 2487
Cdd:COG4717 302 KEAEELQALPAL-----EELEEEELEELLAALGLPPDLspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALl 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2488 QQSFLSEKDSLLQRERCIEQEKAKLEQLfqdEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQE 2567
Cdd:COG4717 377 AEAGVEDEEELRAALEQAEEYQELKEEL---EELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEELRE 453
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1920237962 2568 ELQRLAQQQQQQEK-----LLAEENQRLRERLQHLEEERRAA 2604
Cdd:COG4717 454 ELAELEAELEQLEEdgelaELLQELEELKAELRELAEEWAAL 495
|
|
| CH_FIMB_rpt3 |
cd21300 |
third calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar ... |
39-150 |
1.72e-11 |
|
third calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar proteins; Fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409149 Cd Length: 119 Bit Score: 63.98 E-value: 1.72e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRvQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSgdslPRERDVIRSSRLPREKGRMRFHKLQNVQIA 118
Cdd:cd21300 4 EGER-EARVFTLWLNS--LDVEPAVNDLFEDLRDGLILLQAYDKVI----PGSVNWKKVNKAPASAEISRFKAVENTNYA 76
|
90 100 110
....*....|....*....|....*....|..
gi 1920237962 119 LDYLRHRQVKLVNIRNDDIADGNPKLTLGLIW 150
Cdd:cd21300 77 VELGKQLGFSLVGIQGADITDGSRTLTLALVW 108
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1309-1807 |
1.89e-11 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 71.14 E-value: 1.89e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1309 IQEYVDLRTRYSELSTLTSQYirFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAqg 1388
Cdd:COG3096 248 IRVTQSDRDLFKHLITEATNY--VAADYMRHANERRELSERALELRRELFGARRQLAEEQYRLVEMARELEELSARES-- 323
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1389 lqrrmqeevarreevAVEAQEQKRSiqeelQHLrqsseAEIQAKARQVEAAERSRLRIEEeirvVRLQLEATERQRGGAE 1468
Cdd:COG3096 324 ---------------DLEQDYQAAS-----DHL-----NLVQTALRQQEKIERYQEDLEE----LTERLEEQEEVVEEAA 374
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1469 GELqalraraEEAEAQKRQAQEEAERLRRQVQDETQrkrqaeaelALRVQaeAEAAREKQRALQALEELR-------LQA 1541
Cdd:COG3096 375 EQL-------AEAEARLEAAEEEVDSLKSQLADYQQ---------ALDVQ--QTRAIQYQQAVQALEKARalcglpdLTP 436
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1542 EEAERRLRQAEAERARQVQVALETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVavvqlREEATRRAQQQAEAERARA 1621
Cdd:COG3096 437 ENAEDYLAAFRAKEQQATEEVLELEQKLSVAD---AARRQFEKAYELVCKIAGEVE-----RSQAWQTARELLRRYRSQQ 508
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1622 EAERELERWQLKANEA-LRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrGKAEEQAVRQRELAEQELEKQRQLAEGT 1700
Cdd:COG3096 509 ALAQRLQQLRAQLAELeQRLRQQQNAERLLEEF-------------------CQRIGQQLDAAEELEELLAELEAQLEEL 569
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1701 AQQRLAAEQELIRLRAETEQGEQQRQLLeEELARLQREAAAATQKRRELEAE----LAKVRAEMEVLLaSKARAEEESRS 1776
Cdd:COG3096 570 EEQAAEAVEQRSELRQQLEQLRARIKEL-AARAPAWLAAQDALERLREQSGEaladSQEVTAAMQQLL-EREREATVERD 647
|
490 500 510
....*....|....*....|....*....|....*
gi 1920237962 1777 TSEKSKQRLEAEAgrfRELA----EEAARLRALAE 1807
Cdd:COG3096 648 ELAARKQALESQI---ERLSqpggAEDPRLLALAE 679
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1403-1606 |
1.93e-11 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 69.41 E-value: 1.93e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1403 VAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAE 1482
Cdd:COG4942 14 AAAAQADAAAEAEAELEQLQQ----EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1483 AQKRQAQEEAERLRRQVQDET----QRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRL-----RQAEA 1553
Cdd:COG4942 90 KEIAELRAELEAQKEELAELLralyRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLaelaaLRAEL 169
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 1554 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1606
Cdd:COG4942 170 EAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEA 222
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
614-792 |
2.00e-11 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 66.70 E-value: 2.00e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 614 LHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQ 693
Cdd:cd00176 2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERL 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 694 AALQTQWSWMLQLCCCIEAHLKENTAYFQFFSDVREAEEQLRKLQETLRRKYTCDrsiTATRLEDLLQDAQDEKEQLSEY 773
Cdd:cd00176 82 EELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGK---DLESVEELLKKHKELEEELEAH 158
|
170
....*....|....*....
gi 1920237962 774 RGHLSGLAKRAKAIVQLKP 792
Cdd:cd00176 159 EPRLKSLNELAEELLEEGH 177
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1082-1967 |
2.16e-11 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 70.77 E-value: 2.16e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1082 EKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRaqaeaqqpvfdalrdelrgaqevgerlqq 1161
Cdd:pfam02463 180 EETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLD----------------------------- 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1162 rhgERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESAdplgawlrDAKQRQEQIQAVPLANSQAVR 1241
Cdd:pfam02463 231 ---YLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEE--------KEKKLQEEELKLLAKEEEELK 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1242 EQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSgsesiIQEYVDLRTRYSE 1321
Cdd:pfam02463 300 SELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEEL-----EKLQEKLEQLEEE 374
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1322 LSTLTSQYIRFISETLRRMEEEERLA-----EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEE 1396
Cdd:pfam02463 375 LLAKKKLESERLSSAAKLKEEELELKseeekEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELE 454
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1397 VARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLR-IEEEIRVVRLQLEATERQRGGAEGELQALR 1475
Cdd:pfam02463 455 KQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESkARSGLKVLLALIKDGVGGRIISAHGRLGDL 534
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1476 ARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQ-AEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE 1554
Cdd:pfam02463 535 GVAVENYKVAISTAVIVEVSATADEVEERQKLVrALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLE 614
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1555 RARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHvavvQLREEATRRAQQQAEAERARAEAERELERWQLKA 1634
Cdd:pfam02463 615 ADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEE----GLAEKSEVKASLSELTKELLEIQELQEKAESELA 690
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1635 NEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRL 1714
Cdd:pfam02463 691 KEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELS 770
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1715 RAETEQGEQQRQLLEEELARLQREAAAATQKrrELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRE 1794
Cdd:pfam02463 771 LKEKELAEEREKTEKLKVEEEKEEKLKAQEE--ELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 848
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1795 LAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRR 1874
Cdd:pfam02463 849 EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAE 928
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1875 LLEEQAAQHKADIEARLAQLRKASESELERQKglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDT 1954
Cdd:pfam02463 929 ILLKYEEEPEELLLEEADEKEKEENNKEEEEE---RNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
|
890
....*....|...
gi 1920237962 1955 LRSKEQAEQEAAR 1967
Cdd:pfam02463 1006 KLIRAIIEETCQR 1018
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1511-2047 |
2.28e-11 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 70.66 E-value: 2.28e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1511 AELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE-RARQVQVALETAQRSAEAELQSEHAS--------F 1581
Cdd:COG3899 712 ARRALARGAYAEALRYLERALELLPPDPEEEYRLALLLELAEALyLAGRFEEAEALLERALAARALAALAAlrhgnppaS 791
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1582 AEKTAQLERTLKEEHVAVVQLREEATRRAQQ--QAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQkslTQAEAE 1659
Cdd:COG3899 792 ARAYANLGLLLLGDYEEAYEFGELALALAERlgDRRLEARALFNLGFILHWLGPLREALELLREALEAGLE---TGDAAL 868
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1660 KQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREA 1739
Cdd:COG3899 869 ALLALAAAAAAAAAAAALAAAAAAAARLLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAA 948
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1740 AAATQKRRELEAELAKVRAEMEVLLASKARAeeesrstsekskqRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED 1819
Cdd:COG3899 949 AAAAALAAALALAAAAAAAAAAALAAAAAAA-------------AAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAAL 1015
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1820 AVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 1899
Cdd:COG3899 1016 AAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAAAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAA 1095
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1900 SELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRR 1979
Cdd:COG3899 1096 ALAAAAAAALALAAALAALALAAALAALALAAAARAAAALLLLAAALALALAALLLLAALLLALALLLLALAALALAAAL 1175
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1980 REAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEE 2047
Cdd:COG3899 1176 AALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLL 1243
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1684-2604 |
2.37e-11 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 70.59 E-value: 2.37e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1684 ELAEQELEKQR-QLAEGTAQQRLAA-EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAeME 1761
Cdd:pfam01576 111 QLDEEEAARQKlQLEKVTTEAKIKKlEEDILLLEDQNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEA-MI 189
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1762 VLLASKARAEEESRSTSEKSKQRLEAEAGrfrELAEEAARLRALAEEAKRQRQLAEEDavrqraeaervLAEKLAAISEA 1841
Cdd:pfam01576 190 SDLEERLKKEEKGRQELEKAKRKLEGEST---DLQEQIAELQAQIAELRAQLAKKEEE-----------LQAALARLEEE 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1842 TRLKTEAEIALKEKEAENERLRRLAEDEAFQRrlleEQAAQHKADIEARLAQLRKASESELERQKGLVEdtLRQRRqvEE 1921
Cdd:pfam01576 256 TAQKNNALKKIRELEAQISELQEDLESERAAR----NKAEKQRRDLGEELEALKTELEDTLDTTAAQQE--LRSKR--EQ 327
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1922 EILALKGSFEKAAAGKAELELELGRIRGTAEDTLrsKEQAEQeAARQRQLAAEEERRRREAEERVQKSL----AAEEEAA 1997
Cdd:pfam01576 328 EVTELKKALEEETRSHEAQLQEMRQKHTQALEEL--TEQLEQ-AKRNKANLEKAKQALESENAELQAELrtlqQAKQDSE 404
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1998 RQRKAALEEVERLKAKVEEARRLRERAEQESARqLQLAQEAAQKRLQAEEKahafavqqKEQELQQTLQQEQSVLERLRS 2077
Cdd:pfam01576 405 HKRKKLEGQLQELQARLSESERQRAELAEKLSK-LQSELESVSSLLNEAEG--------KNIKLSKDVSSLESQLQDTQE 475
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2078 EAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQ 2157
Cdd:pfam01576 476 LLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELE 555
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2158 AADAEMEKHKQFAEQALRQKAQVEQELTALRLQLeetDHQKSILDEELQRLKAEVTEAArqrgqvEEELFSLRVQMEelg 2237
Cdd:pfam01576 556 ALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDL---DHQRQLVSNLEKKQKKFDQMLA------EEKAISARYAEE--- 623
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2238 klKARIEAENRalvlrDKDSAQRLLQEEAEKMKQVAEEAARlsvaaqeAARLRQLAEEDLAQQRALAEKMLKE----KMQ 2313
Cdd:pfam01576 624 --RDRAEAEAR-----EKETRALSLARALEEALEAKEELER-------TNKQLRAEMEDLVSSKDDVGKNVHElersKRA 689
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2314 AVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLEtERQRQL-----EMSAEAERLRLRVAEM 2388
Cdd:pfam01576 690 LEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGE-EKRRQLvkqvrELEAELEDERKQRAQA 768
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2389 SRAQARAEEDARRFRKQAEDIGERlyRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLK 2468
Cdd:pfam01576 769 VAAKKKLELDLKELEAQIDAANKG--REEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQLQ 846
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2469 SEEMQTVRQE-QLLQETQALQQ---SFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQL 2544
Cdd:pfam01576 847 EDLAASERARrQAQQERDELADeiaSGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTEL 926
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2545 AAS---------------------------MEEARRRQHEA-----EEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRE 2592
Cdd:pfam01576 927 AAErstsqksesarqqlerqnkelkaklqeMEGTVKSKFKSsiaalEAKIAQLEEQLEQESRERQAANKLVRRTEKKLKE 1006
|
970
....*....|..
gi 1920237962 2593 RLQHLEEERRAA 2604
Cdd:pfam01576 1007 VLLQVEDERRHA 1018
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1715-2031 |
2.49e-11 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 70.15 E-value: 2.49e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1715 RAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAElAKVRAEMEVLLASKARAEEESRSTSEKSkqrlEAEAGRFRE 1794
Cdd:pfam17380 295 KMEQERLRQEKEEKAREVERRRKLEEAEKARQAEMDRQ-AAIYAEQERMAMERERELERIRQEERKR----ELERIRQEE 369
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1795 LAEEAARLRALaEEAKRQRQLAEEdAVRQRAEAERV--LAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEA-- 1870
Cdd:pfam17380 370 IAMEISRMREL-ERLQMERQQKNE-RVRQELEAARKvkILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAre 447
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1871 FQRRLLEEQAAQHKADIEARLAQLRKASESELERQKglvedtlRQRRQVEEEilaLKGSFEKAAAGKAELELELGRIRGT 1950
Cdd:pfam17380 448 MERVRLEEQERQQQVERLRQQEEERKRKKLELEKEK-------RDRKRAEEQ---RRKILEKELEERKQAMIEEERKRKL 517
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1951 AEDTLRSKEQAEQEAARQRQlaaeeerrrREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2030
Cdd:pfam17380 518 LEKEMEERQKAIYEEERRRE---------AEEERRKQQEMEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKAR 588
|
.
gi 1920237962 2031 Q 2031
Cdd:pfam17380 589 A 589
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1422-2047 |
3.11e-11 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 70.25 E-value: 3.11e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1422 RQSSEAEIQAKARQVEA--AERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE----RL 1495
Cdd:pfam12128 209 DGVVPPKSRLNRQQVEHwiRDIQAIAGIMKIRPEFTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQetsaEL 288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1496 RRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAeLQ 1575
Cdd:pfam12128 289 NQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKA-LT 367
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1576 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAerelerwqlkANEALRLRLQAEEVAQQKSLTQ 1655
Cdd:pfam12128 368 GKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAED----------DLQALESELREQLEAGKLEFNE 437
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1656 AEAEKQKEEAEREARRRG-KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR 1734
Cdd:pfam12128 438 EEYRLKSRLGELKLRLNQaTATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEE 517
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1735 LQREAAAATQ----KRRELEAELAKVRAEMEVLLASKARAEEESRS------TSEKSKQRLEAEAGRFRELAEEAARLRA 1804
Cdd:pfam12128 518 RQSALDELELqlfpQAGTLLHFLRKEAPDWEQSIGKVISPELLHRTdldpevWDGSVGGELNLYGVKLDLKRIDVPEWAA 597
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1805 LAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHK 1884
Cdd:pfam12128 598 SEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARLDLRRLFDEKQSEKDKKNKALAERK 677
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1885 A-------DIEARLAQLRKASESELERQKG---------------LVEDTLRQRRQVEEEILALKGSFE----------- 1931
Cdd:pfam12128 678 DsanerlnSLEAQLKQLDKKHQAWLEEQKEqkreartekqaywqvVEGALDAQLALLKAAIAARRSGAKaelkaletwyk 757
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1932 --------------KAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAA 1997
Cdd:pfam12128 758 rdlaslgvdpdviaKLKREIRTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLATQLSNIERAISELQQQLARLIADT 837
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1998 RQRKAALEevERLKAKVEEARRLRE-----RAEQESARQLQLAQEAAQKRLQAEE 2047
Cdd:pfam12128 838 KLRRAKLE--MERKASEKQQVRLSEnlrglRCEMSKLATLKEDANSEQAQGSIGE 890
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
2725-2763 |
3.30e-11 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 60.42 E-value: 3.30e-11
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 2725 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPELHHKL 2763
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1392-2490 |
3.44e-11 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 70.20 E-value: 3.44e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1392 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEA--AERSRLRIEEEIRvVRLQLEATErqrggAEG 1469
Cdd:pfam01576 2 RQEEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQlqAETELCAEAEEMR-ARLAARKQE-----LEE 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1470 ELQALRARAEEAEAQKRQAQEEAERLRRQVQDetqrkrqAEAELalrvqAEAEAAREKqralqaleeLRLQAEEAERRLR 1549
Cdd:pfam01576 76 ILHELESRLEEEEERSQQLQNEKKKMQQHIQD-------LEEQL-----DEEEAARQK---------LQLEKVTTEAKIK 134
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1550 QAEAErarqvqVALETAQRSaeaELQSEHASFAEKTAQLERTLKEEHVAVVQL-----REEATRRAQQQAEAERARAEAE 1624
Cdd:pfam01576 135 KLEED------ILLLEDQNS---KLSKERKLLEERISEFTSNLAEEEEKAKSLsklknKHEAMISDLEERLKKEEKGRQE 205
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1625 RELERWQLKAnEALRLRlqaEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtAQQR 1704
Cdd:pfam01576 206 LEKAKRKLEG-ESTDLQ---EQIAELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISE--LQED 279
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1705 LAAEqelirlRAETEQGEQQRQLLEEELARLQRE---AAAATQKRRELEAelakvRAEMEVLLASKArAEEESRSTSEKS 1781
Cdd:pfam01576 280 LESE------RAARNKAEKQRRDLGEELEALKTEledTLDTTAAQQELRS-----KREQEVTELKKA-LEEETRSHEAQL 347
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1782 KQRLEAEAGRFRELAEE---AARLRALAEEAK--------------RQRQLAEEDAVRQRAEAERVLAEKLAAISEATRL 1844
Cdd:pfam01576 348 QEMRQKHTQALEELTEQleqAKRNKANLEKAKqalesenaelqaelRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQ 427
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1845 KTEAEIALKEKEAENERLRRLAEDeafqrrlLEEQAAQHKADIEARLAQLRKASE--SELERQKGLVEDTLrqrRQVEEE 1922
Cdd:pfam01576 428 RAELAEKLSKLQSELESVSSLLNE-------AEGKNIKLSKDVSSLESQLQDTQEllQEETRQKLNLSTRL---RQLEDE 497
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1923 ILALKGSFEKAAAGKAELELELGRIRGTAEDTlrsKEQAEQEAARQRQLaaeeerrrreaeERVQKSLAAEEEAARQRka 2002
Cdd:pfam01576 498 RNSLQEQLEEEEEAKRNVERQLSTLQAQLSDM---KKKLEEDAGTLEAL------------EEGKKRLQRELEALTQQ-- 560
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2003 aLEEVERLKAKVEEAR-RLRERAE-----QESARQLQLAQEAAQKR---LQAEEKA-HAFAVQQKEQELQQTLQQEQSVL 2072
Cdd:pfam01576 561 -LEEKAAAYDKLEKTKnRLQQELDdllvdLDHQRQLVSNLEKKQKKfdqMLAEEKAiSARYAEERDRAEAEAREKETRAL 639
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2073 ERLRSeaeaarraaeeaeaareraereAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAA 2152
Cdd:pfam01576 640 SLARA----------------------LEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEM 697
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQaadaEMEKHKQFAEQA-LR-------QKAQVEQELTALRLQLEEtdhQKSILDEELQRLKAEVTEAARQRGQvee 2224
Cdd:pfam01576 698 KTQLE----ELEDELQATEDAkLRlevnmqaLKAQFERDLQARDEQGEE---KRRQLVKQVRELEAELEDERKQRAQ--- 767
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2225 eLFSLRVQME-ELGKLKARIEAENRAlvlrdKDSAQRLLQEEAEKMKQVAeeaarlsvaaqeaarlRQLAEEDLAQQRAL 2303
Cdd:pfam01576 768 -AVAAKKKLElDLKELEAQIDAANKG-----REEAVKQLKKLQAQMKDLQ----------------RELEEARASRDEIL 825
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2304 AEKMLKEKmqavqeatRLKA-EAELLQQQKELA-QEQARR-LQEDKEQMAQQLAQETQGfqKTLETERQRQLEmsaeaER 2380
Cdd:pfam01576 826 AQSKESEK--------KLKNlEAELLQLQEDLAaSERARRqAQQERDELADEIASGASG--KSALQDEKRRLE-----AR 890
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2381 LRLRVAEMSRAQARAEEDARRFRKQAEDIgERLyRTELATQEKvmLVQTLETQRQQSDRDAERLREAIAELEHE-KDKLK 2459
Cdd:pfam01576 891 IAQLEEELEEEQSNTELLNDRLRKSTLQV-EQL-TTELAAERS--TSQKSESARQQLERQNKELKAKLQEMEGTvKSKFK 966
|
1130 1140 1150
....*....|....*....|....*....|.
gi 1920237962 2460 QEAQLLQLKSEEMqtvrQEQLLQETQALQQS 2490
Cdd:pfam01576 967 SSIAALEAKIAQL----EEQLEQESRERQAA 993
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
2159-2515 |
3.91e-11 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 69.68 E-value: 3.91e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2159 ADAEMEKHKQFAEQA--LRQKA-QVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEE 2235
Cdd:PRK02224 344 AESLREDADDLEERAeeLREEAaELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDE 423
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2236 LGKLKARIEAENRALVLRDKDsAQRLLQE-EAEKMKQVAEEAARLSVAAQEAARLRQLAEE--DLAQQRALAEKMLKEKM 2312
Cdd:PRK02224 424 LREREAELEATLRTARERVEE-AEALLEAgKCPECGQPVEGSPHVETIEEDRERVEELEAEleDLEEEVEEVEERLERAE 502
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2313 QAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQmAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSraQ 2392
Cdd:PRK02224 503 DLVEAEDRIERLEERREDLEELIAERRETIEEKRER-AEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELN--S 579
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2393 ARAEEDARrfRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERL---REAIAELEHEKDklkqEAQLLQLKS 2469
Cdd:PRK02224 580 KLAELKER--IESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLaekRERKRELEAEFD----EARIEEARE 653
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 1920237962 2470 EEMQTVR-QEQLLQETQALQQsflsEKDSLLQRERCIEQEKAKLEQL 2515
Cdd:PRK02224 654 DKERAEEyLEQVEEKLDELRE----ERDDLQAEIGAVENELEELEEL 696
|
|
| CH_ASPM_rpt1 |
cd21223 |
first calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated ... |
62-153 |
4.57e-11 |
|
first calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated protein (ASPM) and similar proteins; ASPM, also called abnormal spindle protein homolog, or Asp homolog, is involved in mitotic spindle regulation and coordination of mitotic processes. It may also have a preferential role in regulating neurogenesis. Members of this family contain two copies of the CH domain in the middle region. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409072 Cd Length: 113 Bit Score: 62.61 E-value: 4.57e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 62 HISDLYEDLRDGHNLISLLEVLSGDSLPRERdvirsSRLPrEKGRMRfhKLQNVQIALDYLRHRQV----KLVNIRNDDI 137
Cdd:cd21223 25 AVTNLAVDLRDGVRLCRLVELLTGDWSLLSK-----LRVP-AISRLQ--KLHNVEVALKALKEAGVlrggDGGGITAKDI 96
|
90
....*....|....*.
gi 1920237962 138 ADGNPKLTLGLIWTII 153
Cdd:cd21223 97 VDGHREKTLALLWRII 112
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1355-1880 |
5.08e-11 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 69.50 E-value: 5.08e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1355 RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1434
Cdd:COG3899 722 AEALRYLERALELLPPDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYANLGLL 801
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1435 QVEAAERSRLRIEEEIRVV-RLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL--RRQVQDETQRKRQAEA 1511
Cdd:COG3899 802 LLGDYEEAYEFGELALALAeRLGDRRLEARALFNLGFILHWLGPLREALELLREALEAGLETgdAALALLALAAAAAAAA 881
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1512 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERT 1591
Cdd:COG3899 882 AAAALAAAAAAAARLLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALA 961
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1592 LKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARR 1671
Cdd:COG3899 962 AAAAAAAAAALAAAAAAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAA 1041
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1672 RGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA 1751
Cdd:COG3899 1042 ALALLAALAAAAAAAAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALA 1121
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1752 ELAKVRAEMEVLLASKARAEEESRSTsekskqRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1831
Cdd:COG3899 1122 ALALAAAARAAAALLLLAAALALALA------ALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAAL 1195
|
490 500 510 520
....*....|....*....|....*....|....*....|....*....
gi 1920237962 1832 AEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQA 1880
Cdd:COG3899 1196 LAALLALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLLA 1244
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2154-2356 |
6.12e-11 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 67.87 E-value: 6.12e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2154 RQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQM 2233
Cdd:COG4942 34 QEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRAL 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2234 EELGK---LKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE 2310
Cdd:COG4942 114 YRLGRqppLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEAL 193
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1920237962 2311 KMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQE 2356
Cdd:COG4942 194 KAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
993-1548 |
9.18e-11 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 69.01 E-value: 9.18e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 993 EACETRTVHRLRLPLDKEPARECAQRITEQQKAqaevDGLGKGV--ARLSAEAEKVLAlPEPSPAAPTLRSELELTLGKL 1070
Cdd:PTZ00121 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKA----DEAKKKAeeAKKKADAAKKKA-EEAKKAAEAAKAEAEAAADEA 1359
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1071 EQVRSLS-AIYLEKLKTISLVIRSTQEAEEVLRAhEEQLKEAQAVPATLPELEATKAALKK---LRAQAEAQQPVfdalr 1146
Cdd:PTZ00121 1360 EAAEEKAeAAEKKKEEAKKKADAAKKKAEEKKKA-DEAKKKAEEDKKKADELKKAAAAKKKadeAKKKAEEKKKA----- 1433
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1147 DELRGAQEvgERLQQRHGERDVEVERWRERVTLLLERwqavlaqtdvrQRELEQLGRQLRYYREsADPLGAWLRDAKQRQ 1226
Cdd:PTZ00121 1434 DEAKKKAE--EAKKADEAKKKAEEAKKAEEAKKKAEE-----------AKKADEAKKKAEEAKK-ADEAKKKAEEAKKKA 1499
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1227 EQIQAVPLANSQAVREQLRQEKALLEDIERHGE--KVEEcqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSG 1304
Cdd:PTZ00121 1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEakKADE----AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1305 SESIIQEYVDLR----TRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERlaeveaalEKQRQLAEAHAQAKA 1380
Cdd:PTZ00121 1576 KNMALRKAEEAKkaeeARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK--------KKVEQLKKKEAEEKK 1647
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1381 QAEReaqgLQRRMQEEVARREEVAVEAQEQKRSIQEelqhLRQSSEAEiqakarqvEAAERSRLRIEEEIRVVRLQLEAT 1460
Cdd:PTZ00121 1648 KAEE----LKKAEEENKIKAAEEAKKAEEDKKKAEE----AKKAEEDE--------KKAAEALKKEAEEAKKAEELKKKE 1711
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1461 ERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAElalrvQAEAEAAREKQRALQALEELRLQ 1540
Cdd:PTZ00121 1712 AEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK-----KEEEKKAEEIRKEKEAVIEEELD 1786
|
....*...
gi 1920237962 1541 AEEAERRL 1548
Cdd:PTZ00121 1787 EEDEKRRM 1794
|
|
| CH_MICAL1 |
cd21196 |
calponin homology (CH) domain found in molecule interacting with CasL protein 1; MICAL-1, also ... |
173-273 |
9.73e-11 |
|
calponin homology (CH) domain found in molecule interacting with CasL protein 1; MICAL-1, also called NEDD9-interacting protein with calponin homology and LIM domains, acts as a [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-1 acts as a cytoskeletal regulator that connects NEDD9 to intermediate filaments. It also acts as a negative regulator of apoptosis via its interaction with STK38 and STK38L. MICAL-1 is a Rab effector protein that plays a role in vesicle trafficking. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409045 Cd Length: 106 Bit Score: 61.60 E-value: 9.73e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 173 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 252
Cdd:cd21196 5 QEELLRWCQEQTAGYPGVHVSDLSSSWADGLALCALVYRLQPGLLEPSELQGLGALEATAWALKVAENELGITPVVSAQA 84
|
90 100
....*....|....*....|.
gi 1920237962 253 VdVPQPDEKSIITYVSSLYDA 273
Cdd:cd21196 85 V-VAGSDPLGLIAYLSHFHSA 104
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1341-1841 |
1.13e-10 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 68.35 E-value: 1.13e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1341 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER-----EAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1415
Cdd:COG3899 741 EEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRhgnppASARAYANLGLLLLGDYEEAYEFGELALALA 820
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1416 EELQHLRQSSEAEIqAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL 1495
Cdd:COG3899 821 ERLGDRRLEARALF-NLGFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAARLLAA 899
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1496 RRQVQDETQRKRQAEAELALRVQAEAEAAREkqRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQ 1575
Cdd:COG3899 900 AAAALAAAAAAAALAAAELARLAAAAAAAAA--LALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAAAA 977
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1576 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQ 1655
Cdd:COG3899 978 AAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAAA 1057
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1656 AEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARL 1735
Cdd:COG3899 1058 AAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALAALALAAAARAAAALLL 1137
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1736 QREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQL 1815
Cdd:COG3899 1138 LAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLAL 1217
|
490 500
....*....|....*....|....*.
gi 1920237962 1816 AEEDAVRQRAEAERVLAEKLAAISEA 1841
Cdd:COG3899 1218 EAAALLLLLLLAALALAAALLALRLL 1243
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2251-2607 |
1.60e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 67.77 E-value: 1.60e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2251 VLRDKDSAQRLLQEEA---EKMKQVAEEAARLSVAAQEA-ARLRQLAEEDLAQQRAL---AEKMlkEKMQAVQEATRlKA 2323
Cdd:TIGR02168 149 IIEAKPEERRAIFEEAagiSKYKERRKETERKLERTRENlDRLEDILNELERQLKSLerqAEKA--ERYKELKAELR-EL 225
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2324 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQET---QGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDAR 2400
Cdd:TIGR02168 226 ELALLVLRLEELREELEELQEELKEAEEELEELTaelQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2401 RFRKQAEDIGERLYRtelatqekvmlvqtLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQL 2480
Cdd:TIGR02168 306 ILRERLANLERQLEE--------------LEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELE 371
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2481 LQEtQALQQSFLSEKDSLLQRERCIEQEKAKLEQLfqdeVAKAQALreeqQRQQQQMQQEKQQLAASMEEARRRQHEAE- 2559
Cdd:TIGR02168 372 SRL-EELEEQLETLRSKVAQLELQIASLNNEIERL----EARLERL----EDRRERLQQEIEELLKKLEEAELKELQAEl 442
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1920237962 2560 EGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR 2607
Cdd:TIGR02168 443 EELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQAR 490
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1677-2051 |
1.86e-10 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 67.10 E-value: 1.86e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQL--AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELA 1754
Cdd:COG4717 105 EELEAELEELREELEKLEKLlqLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLE 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1755 KVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAV------------- 1821
Cdd:COG4717 185 QLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLlliaaallallgl 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1822 --RQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 1899
Cdd:COG4717 265 ggSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLD 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1900 SELERQKGLVE-DTLRQRRQVEEEILALKGSFEKAAAGKAElelelgRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERR 1978
Cdd:COG4717 345 RIEELQELLREaEELEEELQLEELEQEIAALLAEAGVEDEE------ELRAALEQAEEYQELKEELEELEEQLEELLGEL 418
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1979 RREAEERVQKSLAAE-EEAARQRKAALEEVERLKAKVEEAR-RLRERAEQESARQLQLAQEAAQKRLQAEEKAHA 2051
Cdd:COG4717 419 EELLEALDEEELEEElEELEEELEELEEELEELREELAELEaELEQLEEDGELAELLQELEELKAELRELAEEWA 493
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1408-1597 |
2.02e-10 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 66.37 E-value: 2.02e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL--EATERQRGGAEGELQALRARAEEAEAQK 1485
Cdd:PRK09510 67 QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQkkQAEEAAKQAALKQKQAEEAAAKAAAAAK 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1486 RQAQEEAERLRRQV-QDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALE 1564
Cdd:PRK09510 147 AKAEAEAKRAAAAAkKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAA 226
|
170 180 190
....*....|....*....|....*....|...
gi 1920237962 1565 TAQRSAEAELQSEHASFAEKTAQLERTLKEEHV 1597
Cdd:PRK09510 227 AAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1328-2045 |
2.09e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 67.45 E-value: 2.09e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1328 QYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1407
Cdd:pfam15921 82 EYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCLK 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRSIQEELQHLRQ---SSEAEIQA-KARQVEAAERSRLRIEEEIRVVRLQLE----ATERQRGGAEGELQALRAR-- 1477
Cdd:pfam15921 162 EDMLEDSNTQIEQLRKmmlSHEGVLQEiRSILVDFEEASGKKIYEHDSMSTMHFRslgsAISKILRELDTEISYLKGRif 241
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1478 --AEEAEAQKRQAQEEAERLRRQVQDET-QRKRQAEAELAlRVQAEAEAAREKQRALQAleelrlQAEEAERRLRQAEAE 1554
Cdd:pfam15921 242 pvEDQLEALKSESQNKIELLLQQHQDRIeQLISEHEVEIT-GLTEKASSARSQANSIQS------QLEIIQEQARNQNSM 314
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1555 RARQVQvALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAeaerelerwQLKA 1634
Cdd:pfam15921 315 YMRQLS-DLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQ---------KLLA 384
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1635 NEALRLRLQAEEVAQQKSLtqaeaekqkeeaerEARRRGKAEEQAVRQRELAEQELEKQRQLA---------EGTAQQRL 1705
Cdd:pfam15921 385 DLHKREKELSLEKEQNKRL--------------WDRDTGNSITIDHLRRELDDRNMEVQRLEAllkamksecQGQMERQM 450
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1706 AAEQ----ELIRLRAETEQGEQQRQLLEEELARLqreaaaaTQKRRELEAELAKVrAEMEVLLASKARAEEESRSTSEKS 1781
Cdd:pfam15921 451 AAIQgkneSLEKVSSLTAQLESTKEMLRKVVEEL-------TAKKMTLESSERTV-SDLTASLQEKERAIEATNAEITKL 522
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1782 KQRLEAEAGRFRELAEEAARLRALAEEAKRQR-QLAEEDAV----RQ----------------------RAEAERVLAEK 1834
Cdd:pfam15921 523 RSRVDLKLQELQHLKNEGDHLRNVQTECEALKlQMAEKDKVieilRQqienmtqlvgqhgrtagamqveKAQLEKEINDR 602
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1835 LAAISEATRLKTEAEIALKEKEAE---------------NERLRRLAEDEAFQRRLLEEQAAQHK------ADIEARLAQ 1893
Cdd:pfam15921 603 RLELQEFKILKDKKDAKIRELEARvsdlelekvklvnagSERLRAVKDIKQERDQLLNEVKTSRNelnslsEDYEVLKRN 682
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1894 LRKASE------SELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG-KAELELELGRIrgtaeDTLRSK----EQAE 1962
Cdd:pfam15921 683 FRNKSEemetttNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGmQKQITAKRGQI-----DALQSKiqflEEAM 757
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1963 QEAARQRQLAAEEERRRREAEERV---QKSLAAEEEAARQRKAALEE--------VERLKAKVEEARRLRERAEQESARq 2031
Cdd:pfam15921 758 TNANKEKHFLKEEKNKLSQELSTVateKNKMAGELEVLRSQERRLKEkvanmevaLDKASLQFAECQDIIQRQEQESVR- 836
|
810
....*....|....
gi 1920237962 2032 LQLAQEAAQKRLQA 2045
Cdd:pfam15921 837 LKLQHTLDVKELQG 850
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2251-2606 |
2.72e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 67.00 E-value: 2.72e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2251 VLRDKDSAQRLLQEEAEKMKQ-----VAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM--LKEKMQAVQEA-TRLK 2322
Cdd:TIGR02168 194 ILNELERQLKSLERQAEKAERykelkAELRELELALLVLRLEELREELEELQEELKEAEEELeeLTAELQELEEKlEELR 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2323 AEAELLQQQKELAQE---QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDA 2399
Cdd:TIGR02168 274 LEVSELEEEIEELQKelyALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2400 RRFRKQAEdigerlyrtelatqEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKqeAQLLQLKSE-EMQTVRQE 2478
Cdd:TIGR02168 354 ESLEAELE--------------ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN--NEIERLEARlERLEDRRE 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2479 QLLQETQALQQSFLSEKDSLLQRErcIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHeA 2558
Cdd:TIGR02168 418 RLQQEIEELLKKLEEAELKELQAE--LEELEEELEEL-QEELERLEEALEELREELEEAEQALDAAERELAQLQARLD-S 493
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1920237962 2559 EEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLqHLEEERRAALA 2606
Cdd:TIGR02168 494 LERLQENLEGFSEGVKALLKNQSGLSGILGVLSELI-SVDEGYEAAIE 540
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1677-2423 |
3.40e-10 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 66.86 E-value: 3.40e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQLAEGTAQqrlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1756
Cdd:COG4913 245 EDAREQIELLEPIRELAERYAAARER---LAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDAL 321
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1757 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1836
Cdd:COG4913 322 REELDELEAQIRGNGGDRLEQLEREIERLEREL---EERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEE 398
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1837 AISEATRLKTEAEIALKEKEAENERLRrlAEDEAFQRRlleeqaaqhKADIEARLAQLRKAseseLERQKGLVEDTLR-- 1914
Cdd:COG4913 399 ELEALEEALAEAEAALRDLRRELRELE--AEIASLERR---------KSNIPARLLALRDA----LAEALGLDEAELPfv 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1915 -QRRQVEEEILALKGSFEKAaagkaelelelgrIRGTAEDTLRSKEQAEQ--EAARQRQLAAEeerrrreaeerVQKSLA 1991
Cdd:COG4913 464 gELIEVRPEEERWRGAIERV-------------LGGFALTLLVPPEHYAAalRWVNRLHLRGR-----------LVYERV 519
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1992 AEEEAARQRKAALEE--VERLKAKVEEARrlrERAEQESARQLQLAQEAAQKRLQAEEKA--------HAFAVQQKEQEL 2061
Cdd:COG4913 520 RTGLPDPERPRLDPDslAGKLDFKPHPFR---AWLEAELGRRFDYVCVDSPEELRRHPRAitragqvkGNGTRHEKDDRR 596
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2062 QQTLQqeqSVLerlrseaeaarraaeeaeaareraereAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEaeqe 2141
Cdd:COG4913 597 RIRSR---YVL---------------------------GFDNRAKLAALEAELAELEEELAEAEERLEALEAELDA---- 642
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2142 aarraqaeqaaLRQKQAADAEMEKHkQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQ 2221
Cdd:COG4913 643 -----------LQERREALQRLAEY-SWDEIDVASAEREIAELEAELERLDASSDDLAALEEQLEELEAELEELEEELDE 710
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2222 VEEELFSLRvqmEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2301
Cdd:COG4913 711 LKGEIGRLE---KELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEE 787
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2302 ALAEKMLKEKMQAVQEATRLKAEAELLQQ-QKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLeterqrQLEMSAEAER 2380
Cdd:COG4913 788 ELERAMRAFNREWPAETADLDADLESLPEyLALLDRLEEDGLPEYEERFKELLNENSIEFVADL------LSKLRRAIRE 861
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 2381 LRLRVAEMSRA----------------QARAEEDARRFRKQAEDIGERLYRTELATQEK 2423
Cdd:COG4913 862 IKERIDPLNDSlkripfgpgrylrleaRPRPDPEVREFRQELRAVTSGASLFDEELSEA 920
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1013-1601 |
4.72e-10 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 66.52 E-value: 4.72e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1013 RECAQRITEQQKAQaevdglgkgvaRLSAEAEKVLALPEPSPA-APTLRSELEltlgkleqvrslsaiylEKLKTISLVI 1091
Cdd:PRK04863 523 SELEQRLRQQQRAE-----------RLLAEFCKRLGKNLDDEDeLEQLQEELE-----------------ARLESLSESV 574
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1092 RSTQEAEEVLRAHEEQLK-EAQAVPATLPELEATKAALKKLRAQAEaqqpvfdalrDELRGAQEVGERLQQrHGERDVEV 1170
Cdd:PRK04863 575 SEARERRMALRQQLEQLQaRIQRLAARAPAWLAAQDALARLREQSG----------EEFEDSQDVTEYMQQ-LLEREREL 643
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1171 ERWRERVTlllERWQAVLAQTD-VRQRELEQLGRQLRYyresADPLGAWL----------RDAKQRQ----EQIQAVPLA 1235
Cdd:PRK04863 644 TVERDELA---ARKQALDEEIErLSQPGGSEDPRLNAL----AERFGGVLlseiyddvslEDAPYFSalygPARHAIVVP 716
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1236 NSQAVREQLRQEKALLEDI-------ERHGEKVEECQRFAKQYINAIKDYELQLVTYKAqlEPVASPAKKPK----VQSG 1304
Cdd:PRK04863 717 DLSDAAEQLAGLEDCPEDLyliegdpDSFDDSVFSVEELEKAVVVKIADRQWRYSRFPE--VPLFGRAAREKrieqLRAE 794
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1305 SESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEErlAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER 1384
Cdd:PRK04863 795 REELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAFEAD--PEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKE 872
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1385 EAQGLQR-----------RMQEEVARREEVAVEAQEQKRSIQ---------EELQHLRQSSEAEIQAKARQVEAAE---- 1440
Cdd:PRK04863 873 GLSALNRllprlnlladeTLADRVEEIREQLDEAEEAKRFVQqhgnalaqlEPIVSVLQSDPEQFEQLKQDYQQAQqtqr 952
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1441 --RSRLRIEEEIRVVRLQLEATERQR-GGAEGELQ-ALRARAEEAEAQKRQAQEEAerlrRQVQDETQRKRQAEAELALR 1516
Cdd:PRK04863 953 daKQQAFALTEVVQRRAHFSYEDAAEmLAKNSDLNeKLRQRLEQAEQERTRAREQL----RQAQAQLAQYNQVLASLKSS 1028
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1517 VQAEAEAAREKQRALQAL---------EELRLQAEEAERRLRQAEAERArqvqvALETAQRSAEAELQsehaSFAEKTAQ 1587
Cdd:PRK04863 1029 YDAKRQMLQELKQELQDLgvpadsgaeERARARRDELHARLSANRSRRN-----QLEKQLTFCEAEMD----NLTKKLRK 1099
|
650
....*....|....
gi 1920237962 1588 LERTLKEEHVAVVQ 1601
Cdd:PRK04863 1100 LERDYHEMREQVVN 1113
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1766-2037 |
5.38e-10 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 65.74 E-value: 5.38e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1766 SKARAEEESRSTSEKSKQRLEAEAGRF-RELAEEAARlralAEEAKRQRQLAEEDAVrqrAEAERVLAEKLAAISEATRL 1844
Cdd:PRK05035 436 AEIRAIEQEKKKAEEAKARFEARQARLeREKAAREAR----HKKAAEARAAKDKDAV---AAALARVKAKKAAATQPIVI 508
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1845 KTEAEIALKEKEAEnERLRRLAEDEAFQRRLLEEQAAQHKADIEARL--AQLRKASESELERQKGLVEDTlrQRRQVEEE 1922
Cdd:PRK05035 509 KAGARPDNSAVIAA-REARKAQARARQAEKQAAAAADPKKAAVAAAIarAKAKKAAQQAANAEAEEEVDP--KKAAVAAA 585
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1923 ILALKGSFEKAAAGKAELELELgrirgTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKA 2002
Cdd:PRK05035 586 IARAKAKKAAQQAASAEPEEQV-----AEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVAAAIARAKARKA 660
|
250 260 270
....*....|....*....|....*....|....*
gi 1920237962 2003 ALEEVERLKAKVEEARRLRERAEQESARQLQLAQE 2037
Cdd:PRK05035 661 AQQQANAEPEEAEDPKKAAVAAAIARAKAKKAAQQ 695
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1531-2026 |
5.52e-10 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 65.83 E-value: 5.52e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1531 LQALEELRLQAEEAE---RRLRQAEAERARQVQVALE-TAQRSAEAELQSEHASFAEKTAQLERtLKEEHVAVVQLREEA 1606
Cdd:PRK02224 161 LGKLEEYRERASDARlgvERVLSDQRGSLDQLKAQIEeKEEKDLHERLNGLESELAELDEEIER-YEEQREQARETRDEA 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1607 TRRAQQQAEAERARAEAERELERWQLKANEALRLRLQ-AEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQREL 1685
Cdd:PRK02224 240 DEVLEEHEERREELETLEAEIEDLRETIAETEREREElAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREEL 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1686 AEQELEKQRQLAEGTAQQRlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLA 1765
Cdd:PRK02224 320 EDRDEELRDRLEECRVAAQ-AHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRE 398
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1766 SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE----------------DAVRQRAEAER 1829
Cdd:PRK02224 399 RFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAgkcpecgqpvegsphvETIEEDRERVE 478
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1830 VLAEKLAAIsEATRLKTEAEI----ALKEKEAENERLRRlaedeafQRRLLEEQAAQHKADIEA---RLAQLRKAS---E 1899
Cdd:PRK02224 479 ELEAELEDL-EEEVEEVEERLeraeDLVEAEDRIERLEE-------RREDLEELIAERRETIEEkreRAEELRERAaelE 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1900 SELERQKglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELElELGRIRgtaeDTLRSKEQAEQEAARQRQLAAEEERRR 1979
Cdd:PRK02224 551 AEAEEKR---EAAAEAEEEAEEAREEVAELNSKLAELKERIE-SLERIR----TLLAAIADAEDEIERLREKREALAELN 622
|
490 500 510 520
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1980 REAEERVQkslaaeeeAARQRKAALEEvERLKAKVEEARRLRERAEQ 2026
Cdd:PRK02224 623 DERRERLA--------EKRERKRELEA-EFDEARIEEAREDKERAEE 660
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1688-2049 |
6.75e-10 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 65.56 E-value: 6.75e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1688 QELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQR--EAAAATQKRRELEAELAKVRAEMEvlla 1765
Cdd:COG4717 74 KELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKllQLLPLYQELEALEAELAELPERLE---- 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1766 sKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLK 1845
Cdd:COG4717 150 -ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEE 228
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1846 TEAEIALKEKEAENERLRRL---------------AEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVE 1910
Cdd:COG4717 229 LEQLENELEAAALEERLKEArlllliaaallallgLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQ 308
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1911 DTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERrrreaeerVQKSL 1990
Cdd:COG4717 309 ALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAL--------LAEAG 380
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1991 AAEEEAARQRKAALEEVERLKAKVEEA-RRLRERAEQESARQLQLAQEAAQKRLQAEEKA 2049
Cdd:COG4717 381 VEDEEELRAALEQAEEYQELKEELEELeEQLEELLGELEELLEALDEEELEEELEELEEE 440
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1095-1546 |
7.48e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 65.93 E-value: 7.48e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1095 QEAEEVLRAHEEQLK--EAQAVPATLPELEATKAALKKLRAQAEAQQPVFDAlrDELRGAQEVGERLQQRHGErdvEVER 1172
Cdd:PTZ00121 1497 KKADEAKKAAEAKKKadEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA--DELKKAEELKKAEEKKKAE---EAKK 1571
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1173 WRERVTLLLERWQavlaqtDVRQRELEQLGRQLRYYRESADPLGAWLRdaKQRQEQIQAvplansqavrEQLRQEKALLE 1252
Cdd:PTZ00121 1572 AEEDKNMALRKAE------EAKKAEEARIEEVMKLYEEEKKMKAEEAK--KAEEAKIKA----------EELKKAEEEKK 1633
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1253 DIERHGEKVEECQRFAKQyinaIKDYELQLVTYKAQLEPVASPAKKPkvqsgSESIIQEYVDLRtRYSELSTLTSQYIRF 1332
Cdd:PTZ00121 1634 KVEQLKKKEAEEKKKAEE----LKKAEEENKIKAAEEAKKAEEDKKK-----AEEAKKAEEDEK-KAAEALKKEAEEAKK 1703
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1333 ISETLRRMEEEERLAEQQRAEERERLAEVEaalekqrqlaeahaQAKAQAEREaqglqRRMQEEVARREEVAVEAQEQKR 1412
Cdd:PTZ00121 1704 AEELKKKEAEEKKKAEELKKAEEENKIKAE--------------EAKKEAEED-----KKKAEEAKKDEEEKKKIAHLKK 1764
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1413 SIQEELQHLRQSSEAEIQAKARqvEAAERSRLRIEEEIRVVRLQLEATerQRGGAEGELQALRARAEEAEAQKRQAqEEA 1492
Cdd:PTZ00121 1765 EEEKKAEEIRKEKEAVIEEELD--EEDEKRRMEVDKKIKDIFDNFANI--IEGGKEGNLVINDSKEMEDSAIKEVA-DSK 1839
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1493 ERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRaLQALEELRLQAEEAER 1546
Cdd:PTZ00121 1840 NMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDL-KEDDEEEIEEADEIEK 1892
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1099-1918 |
7.96e-10 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 65.75 E-value: 7.96e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1099 EVLRAHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAqqpvfdaLRDELRGAQEvGERLQQRHGERDVEVERWRERvt 1178
Cdd:PRK04863 294 ELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQA-------ASDHLNLVQT-ALRQQEKIERYQADLEELEER-- 363
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1179 llLERWQAVLAQTDVRQRELEqlgRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVReQLRQEKALLE----DI 1254
Cdd:PRK04863 364 --LEEQNEVVEEADEQQEENE---ARAEAAEEEVDELKSQLADYQQALDVQQTRAIQYQQAVQ-ALERAKQLCGlpdlTA 437
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1255 ERHGEKVEECQRFAKQYINAIKDYELQLVTYKA---QLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQY-- 1329
Cdd:PRK04863 438 DNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahsQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQLqq 517
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1330 IRFISETLRRMEEEERLAEQQRAEERERL-------AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREE 1402
Cdd:PRK04863 518 LRMRLSELEQRLRQQQRAERLLAEFCKRLgknlddeDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQR 597
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1403 VAVEAQEQkRSIQEELQHLRQSSEAE----------IQAKARQVEAAERSRLRIEEEIRVVRLQLEATErQRGGAEGE-L 1471
Cdd:PRK04863 598 LAARAPAW-LAAQDALARLREQSGEEfedsqdvteyMQQLLERERELTVERDELAARKQALDEEIERLS-QPGGSEDPrL 675
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1472 QALRAR-----------------AEEAEA---QKRQA--QEEAERLRRQVQDET-------------QRKRQA-----EA 1511
Cdd:PRK04863 676 NALAERfggvllseiyddvsledAPYFSAlygPARHAivVPDLSDAAEQLAGLEdcpedlyliegdpDSFDDSvfsveEL 755
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1512 ELALRVQ-AEAE--------------AAREKQralqaLEELRLQAEEAERRLRQAEAERaRQVQVALETAQR-------- 1568
Cdd:PRK04863 756 EKAVVVKiADRQwrysrfpevplfgrAAREKR-----IEQLRAEREELAERYATLSFDV-QKLQRLHQAFSRfigshlav 829
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1569 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVA 1648
Cdd:PRK04863 830 AFEADPEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKEGLSALNRLLPRLNLLADETLADRVEEIREQLDEAEEA 909
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1649 QQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQ--------------LAEGTAQQRLAAEQEL-IR 1713
Cdd:PRK04863 910 KRFVQQHGNALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQqafaltevvqrrahFSYEDAAEMLAKNSDLnEK 989
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1714 LRAETEQGEQQRQLLEEEL--------------ARLQREAAAATQKRRELEAELAK--VRAEMEVLLASKARAEE----- 1772
Cdd:PRK04863 990 LRQRLEQAEQERTRAREQLrqaqaqlaqynqvlASLKSSYDAKRQMLQELKQELQDlgVPADSGAEERARARRDElharl 1069
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1773 ----ESRSTSEKSKQRLEAE----AGRFRELAEEAARLRALAEEAKRQRQLAeEDAVRQRAEAERVLAEKLAAIS--EAT 1842
Cdd:PRK04863 1070 sanrSRRNQLEKQLTFCEAEmdnlTKKLRKLERDYHEMREQVVNAKAGWCAV-LRLVKDNGVERRLHRRELAYLSadELR 1148
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1843 RLKTEAEIALKEKEAENERLR---RLAEDEAFQ----------RRLLEEQAAQhkaDIeARLAQLRKASEsELERQKGLV 1909
Cdd:PRK04863 1149 SMSDKALGALRLAVADNEHLRdvlRLSEDPKRPerkvqfyiavYQHLRERIRQ---DI-IRTDDPVEAIE-QMEIELSRL 1223
|
....*....
gi 1920237962 1910 EDTLRQRRQ 1918
Cdd:PRK04863 1224 TEELTSREQ 1232
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
2264-2685 |
9.07e-10 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 65.55 E-value: 9.07e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2264 EEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLK-AEAELLQQQKELAQ--EQAR 2340
Cdd:PTZ00121 1091 EATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEdAKRVEIARKAEDARkaEEAR 1170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2341 RLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQ-ARAEEDARRF---RKQAEDIGERLYRT 2416
Cdd:PTZ00121 1171 KAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEER-KAEEARKAEdAKKAEAVKKAeeaKKDAEEAKKAEEER 1249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2417 ELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQllqlKSEEMQTVrqEQLLQETQALQQSFLSEKD 2496
Cdd:PTZ00121 1250 NNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAK----KAEEKKKA--DEAKKKAEEAKKADEAKKK 1323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2497 SLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASME-----------EARRRQHEAE---EGV 2562
Cdd:PTZ00121 1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAkkkadaakkkaEEKKKADEAKkkaEED 1403
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2563 RRQQEELQRLAQQQQQQEKLL--AEENQRLRERLQHLEEERRAalarseEIAPSRAAAARALPNGQDAADGPAAAAEPEH 2640
Cdd:PTZ00121 1404 KKKADELKKAAAAKKKADEAKkkAEEKKKADEAKKKAEEAKKA------DEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 1920237962 2641 AFDGLRRKVPAQRLQEVGVLSAEELQQLAQGRTTVAELAQREDVR 2685
Cdd:PTZ00121 1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1203-1873 |
1.19e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 65.09 E-value: 1.19e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1203 RQLRYYRESADPLGAWLRDAKQRQEQIQAvpLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDY---- 1278
Cdd:PRK03918 111 SSVREWVERLIPYHVFLNAIYIRQGEIDA--ILESDESREKVVRQILGLDDYENAYKNLGEVIKEIKRRIERLEKFikrt 188
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1279 ---ELQLVTYKAQLEPVASPAKK-----PKVQSGSESIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQ 1350
Cdd:PRK03918 189 eniEELIKEKEKELEEVLREINEisselPELREELEKLEKEVKELEELKEEIEELEKE-LESLEGSKRKLEEKIRELEER 267
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1351 RAEERERLAEVE---AALEKQRQLAEAHAQAK-----------------AQAEREAQGLQRRMQEEVARREEVAvEAQEQ 1410
Cdd:PRK03918 268 IEELKKEIEELEekvKELKELKEKAEEYIKLSefyeeyldelreiekrlSRLEEEINGIEERIKELEEKEERLE-ELKKK 346
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1411 KRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQE 1490
Cdd:PRK03918 347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKK 426
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1491 EAERLRRQVQDETQRKRQAEAELALRVQAEAEAarEKQRALQALEELRLQAEEAERRLRQAEAERARQ--VQVALETAQ- 1567
Cdd:PRK03918 427 AIEELKKAKGKCPVCGRELTEEHRKELLEEYTA--ELKRIEKELKEIEEKERKLRKELRELEKVLKKEseLIKLKELAEq 504
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1568 -RSAEAELQSEHASFAEKTAQLERTLKEEhvaVVQLREEATRRAQqqaeaeraraeaerelerwQLKANEALRLRLQAEE 1646
Cdd:PRK03918 505 lKELEEKLKKYNLEELEKKAEEYEKLKEK---LIKLKGEIKSLKK-------------------ELEKLEELKKKLAELE 562
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1647 VAQQKsltqaeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLaEGTAQQRLAAEQELIRLRaeteQGEQQRQ 1726
Cdd:PRK03918 563 KKLDE----------------------LEEELAELLKELEELGFESVEEL-EERLKELEPFYNEYLELK----DAEKELE 615
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1727 LLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLasKARAEEESRSTSEKskqrleaeagrFRELAEEAARLRALA 1806
Cdd:PRK03918 616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELE--KKYSEEEYEELREE-----------YLELSRELAGLRAEL 682
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1807 EEAKRQRQLAEEDAvrqraeaeRVLAEKLAAISEATRLKTEAEIALKEKEAENERLRR---LAEDEAFQR 1873
Cdd:PRK03918 683 EELEKRREEIKKTL--------EKLKEELEEREKAKKELEKLEKALERVEELREKVKKykaLLKERALSK 744
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1508-1875 |
1.38e-09 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 64.76 E-value: 1.38e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1508 QAEAELALRVQAEAEAAREKQRALQALEELrlqAEEAERRLRQAEAERARQvqvaletaqrsaeAELQSEHASFAEktaq 1587
Cdd:pfam17380 279 QHQKAVSERQQQEKFEKMEQERLRQEKEEK---AREVERRRKLEEAEKARQ-------------AEMDRQAAIYAE---- 338
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1588 lertlkEEHVAVVQLREeatrraqqqaeaeraraeaereLERWQLKANEALRLRLQAEEVAQQksLTQAEAEKQKEEAER 1667
Cdd:pfam17380 339 ------QERMAMERERE----------------------LERIRQEERKRELERIRQEEIAME--ISRMRELERLQMERQ 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1668 EARRRGKAEEQAVRQRELAEQE-----LEKQRQLAEGTAQQRLAAEQELIRLraETEQGEQQRQLLEEELARLQReaaaa 1742
Cdd:pfam17380 389 QKNERVRQELEAARKVKILEEErqrkiQQQKVEMEQIRAEQEEARQREVRRL--EEERAREMERVRLEEQERQQQ----- 461
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1743 TQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKskqrlEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAvR 1822
Cdd:pfam17380 462 VERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEK-----ELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER-R 535
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1823 QRAEAER---VLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRL 1875
Cdd:pfam17380 536 REAEEERrkqQEMEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARAEY 591
|
|
| CH_PLS_FIM_rpt1 |
cd21217 |
first calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ... |
45-153 |
1.42e-09 |
|
first calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5; they cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409066 [Multi-domain] Cd Length: 114 Bit Score: 58.35 E-value: 1.42e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 45 KKTFTKWVN---------KHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPrERdvirssRLPREKGRMRFHKLQNV 115
Cdd:cd21217 3 KEAFVEHINslladdpdlKHLLPIDPDGDDLFEALRDGVLLCKLINKIVPGTID-ER------KLNKKKPKNIFEATENL 75
|
90 100 110
....*....|....*....|....*....|....*...
gi 1920237962 116 QIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTII 153
Cdd:cd21217 76 NLALNAAKKIGCKVVNIGPQDILDGNPHLVLGLLWQII 113
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
946-1595 |
1.49e-09 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 64.70 E-value: 1.49e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 946 DRLQAEREYgscSRHYQQLLQSLEQGEQEESrcqrcISELKDIRLQLEACEtrtvhrlrlpldkepaRECAQRITEQQKA 1025
Cdd:TIGR02169 201 ERLRREREK---AERYQALLKEKREYEGYEL-----LKEKEALERQKEAIE----------------RQLASLEEELEKL 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1026 QAEVDGLGKGVA----RLSAEAEKVLALPEPSPAAptLRSELELTLGKLEQVRSLSAIYLEKL--------KTISLVIRS 1093
Cdd:TIGR02169 257 TEEISELEKRLEeieqLLEELNKKIKDLGEEEQLR--VKEKIGELEAEIASLERSIAEKERELedaeerlaKLEAEIDKL 334
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1094 TQEAEEVLRAHEEQLKEAQAVPAtlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGE-------- 1165
Cdd:TIGR02169 335 LAEIEELEREIEEERKRRDKLTE---EYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINElkreldrl 411
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1166 ------RDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQ--------- 1230
Cdd:TIGR02169 412 qeelqrLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEkelsklqre 491
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1231 -----AVPLANSQAVREQLRQEKALLEDI--------------ERHGEKVE-------------------ECQRFAKQY- 1271
Cdd:TIGR02169 492 laeaeAQARASEERVRGGRAVEEVLKASIqgvhgtvaqlgsvgERYATAIEvaagnrlnnvvveddavakEAIELLKRRk 571
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1272 --------INAIK---------------DYELQLVTYKAQLEPVASPAKKPKV-----QSGSESIIQ-----------EY 1312
Cdd:TIGR02169 572 agratflpLNKMRderrdlsilsedgviGFAVDLVEFDPKYEPAFKYVFGDTLvvediEAARRLMGKyrmvtlegelfEK 651
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1313 VDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRR 1392
Cdd:TIGR02169 652 SGAMTGGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1393 MQEEVARREEVAVEAQEQKRSIQEELQHLrQSSEAEIQAKARQVEAAERSRLRIE-----EEIRVVRLQLEATERQRGGA 1467
Cdd:TIGR02169 732 EEKLKERLEELEEDLSSLEQEIENVKSEL-KELEARIEELEEDLHKLEEALNDLEarlshSRIPEIQAELSKLEEEVSRI 810
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1468 EGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK---RQAEAELALRVQAEAEAAREKQRALQALEE----LRLQ 1540
Cdd:TIGR02169 811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIksiEKEIENLNGKKEELEEELEELEAALRDLESrlgdLKKE 890
|
730 740 750 760 770
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1541 AEEAERRLRQAEaERARQVQVALETAqRSAEAELQSEHASFAEKTAQLERTLKEE 1595
Cdd:TIGR02169 891 RDELEAQLRELE-RKIEELEAQIEKK-RKRLSELKAKLEALEEELSEIEDPKGED 943
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1693-1899 |
2.31e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 62.86 E-value: 2.31e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1693 QRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEE 1772
Cdd:COG4942 18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1773 ESRSTSEKSKQRLEA--------------EAGRFRELAEEAARLRALAEEAKRQ-----RQLAEEDAVRQRAEAERvlAE 1833
Cdd:COG4942 98 ELEAQKEELAELLRAlyrlgrqpplalllSPEDFLDAVRRLQYLKYLAPARREQaeelrADLAELAALRAELEAER--AE 175
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1834 KLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 1899
Cdd:COG4942 176 LEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1347-1545 |
2.38e-09 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 62.90 E-value: 2.38e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1347 AEQQRAEE-RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARReevavEAQEQKrsiQEELQHLRQSS 1425
Cdd:PRK09510 72 KSAKRAEEqRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ-----AALKQK---QAEEAAAKAAA 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1426 EAEIQAKARQVEAAERSRlRIEEEIRvVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlrrqVQDETQR 1505
Cdd:PRK09510 144 AAKAKAEAEAKRAAAAAK-KAAAEAK-KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKK----AAAEAKK 217
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1920237962 1506 KRQAEAELAL-RVQAEAEAAREKQRALQALEELRLQAEEAE 1545
Cdd:PRK09510 218 KAAAEAKAAAaKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
1092-1288 |
2.62e-09 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 60.54 E-value: 2.62e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1092 RSTQEAEEVLRAHEEQLKEAQaVPATLPELEATKAALKKLRAQAEAQQPVFDALrdelrgaQEVGERLQQRHGERDVEVe 1171
Cdd:cd00176 7 RDADELEAWLSEKEELLSSTD-YGDDLESVEALLKKHEALEAELAAHEERVEAL-------NELGEQLIEEGHPDAEEI- 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1172 rwRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADpLGAWLRDAKQRQEQIQavPLANSQAVREQLRQEKALL 1251
Cdd:cd00176 78 --QERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD-LEQWLEEKEAALASED--LGKDLESVEELLKKHKELE 152
|
170 180 190
....*....|....*....|....*....|....*..
gi 1920237962 1252 EDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQ 1288
Cdd:cd00176 153 EELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLE 189
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1240-1830 |
2.66e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 63.93 E-value: 2.66e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1240 VREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDlrtRY 1319
Cdd:PRK03918 219 LREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAE---EY 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1320 SELSTLTSQYIrfisETLRRMEEEERLAEQQRAEERERLAEVEaalEKQRQLaeahaqakaqaeREAQGLQRRMQEEVAR 1399
Cdd:PRK03918 296 IKLSEFYEEYL----DELREIEKRLSRLEEEINGIEERIKELE---EKEERL------------EELKKKLKELEKRLEE 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1400 REEvAVEAQEQKRSIQEELQHLRQS-SEAEIQAKARQVEAAERSRLRIEEEIRVV---RLQLEATERQRGGAEGELQALR 1475
Cdd:PRK03918 357 LEE-RHELYEEAKAKKEELERLKKRlTGLTPEKLEKELEELEKAKEEIEEEISKItarIGELKKEIKELKKAIEELKKAK 435
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1476 ARA---------EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvqaEAEAAREKQRALQALEELRLQAEEAER 1546
Cdd:PRK03918 436 GKCpvcgrelteEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELR-----ELEKVLKKESELIKLKELAEQLKELEE 510
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1547 RLRQAEAERarqvqvaLETAQRSAEaELQSEHASFAEKTAQLERTLKEEHvavvQLREEATRRAQQQAEAERARAEAERE 1626
Cdd:PRK03918 511 KLKKYNLEE-------LEKKAEEYE-KLKEKLIKLKGEIKSLKKELEKLE----ELKKKLAELEKKLDELEEELAELLKE 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1627 LERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtAQQRLA 1706
Cdd:PRK03918 579 LEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELE--ELEKKY 656
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1707 AEQELIRLRAETEQgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlaSKARAEEESrstSEKSKQRLE 1786
Cdd:PRK03918 657 SEEEYEELREEYLE-------LSRELAGLRAELEELEKRREEIKKTLEKLKEELEER--EKAKKELEK---LEKALERVE 724
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1787 AEAGRFRELAEEAARlRALAEEAKRQRQLAEE------DAVRQRAEAERV 1830
Cdd:PRK03918 725 ELREKVKKYKALLKE-RALSKVGEIASEIFEEltegkySGVRVKAEENKV 773
|
|
| CH_FLNC_rpt2 |
cd21314 |
second calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; ... |
171-273 |
2.80e-09 |
|
second calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C (FLN-C), also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. FLN-C contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409163 Cd Length: 115 Bit Score: 57.77 E-value: 2.80e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 171 TAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 249
Cdd:cd21314 11 TPKQRLLGWIQNKVPQ---LPITNFNRDWQDGKALGALVDNCAPGLCpDWESWDPNQPVQNAREAMQQADDWLGVPQVIA 87
|
90 100
....*....|....*....|....
gi 1920237962 250 PEDVDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21314 88 PEEIVDPNVDEHSVMTYLSQFPKA 111
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2250-2601 |
3.12e-09 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 63.60 E-value: 3.12e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2250 LVLRDKDSAQRLLQEEAEKM-----KQVAEEAARlsvaaqEAARLRQLAEEDLAQQRALAEkmlkekmqavqeatrlkaE 2324
Cdd:pfam17380 277 IVQHQKAVSERQQQEKFEKMeqerlRQEKEEKAR------EVERRRKLEEAEKARQAEMDR------------------Q 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2325 AELLQQQKELAQEQARRL----QEDKEQMAQQLAQETQGFQKTLETERQR-QLEMSAEAERLRLRVAEMSRAQARAEEDA 2399
Cdd:pfam17380 333 AAIYAEQERMAMERERELerirQEERKRELERIRQEEIAMEISRMRELERlQMERQQKNERVRQELEAARKVKILEEERQ 412
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2400 RRFRKQAEDIGERLYRTELATQEKVmlvQTLETQRQqsdRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQ 2479
Cdd:pfam17380 413 RKIQQQKVEMEQIRAEQEEARQREV---RRLEEERA---REMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRDR 486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2480 LLQETQalqqsflsekdsllqRERCIEQEKAKLEQLFQDEVAKAQALREEqqrqqqqmqqekqqlaasMEEarRRQHEAE 2559
Cdd:pfam17380 487 KRAEEQ---------------RRKILEKELEERKQAMIEEERKRKLLEKE------------------MEE--RQKAIYE 531
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 1920237962 2560 EGVRRQQEELQRlaqqqqqqEKLLAEENQRLRERLQHLEEER 2601
Cdd:pfam17380 532 EERRREAEEERR--------KQQEMEERRRIQEQMRKATEER 565
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
2153-2611 |
3.12e-09 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 63.83 E-value: 3.12e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILdeelqRLKAEVTEAARQRGQVEEELFSLRVQ 2232
Cdd:TIGR00618 245 LTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAA-----PLAAHIKAVTQIEQQAQRIHTELQSK 319
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2233 MEELGKLKARIEA--ENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA---RLRQLAEE---DLAQQRALA 2304
Cdd:TIGR00618 320 MRSRAKLLMKRAAhvKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTltqHIHTLQQQkttLTQKLQSLC 399
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2305 EKMLKEKMQAVQEATRLKAEAELLQQ------QKELAQEQARRLQEDKEQMAQQLAQETQGFQK-------------TLE 2365
Cdd:TIGR00618 400 KELDILQREQATIDTRTSAFRDLQGQlahakkQQELQQRYAELCAAAITCTAQCEKLEKIHLQEsaqslkereqqlqTKE 479
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2366 TERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLR 2445
Cdd:TIGR00618 480 QIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRA 559
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2446 EAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqlLQETQALQQSFLSEKDSLLQRERCIE---------QEKAKLEQLF 2516
Cdd:TIGR00618 560 SLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNI--TVRLQDLTEKLSEAEDMLACEQHALLrklqpeqdlQDVRLHLQQC 637
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2517 QDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEAR---------------------------------RRQHEAEEGVR 2563
Cdd:TIGR00618 638 SQELALKLTALHALQLTLTQERVREHALSIRVLPKEllasrqlalqkmqsekeqltywkemlaqcqtllRELETHIEEYD 717
|
490 500 510 520
....*....|....*....|....*....|....*....|....*...
gi 1920237962 2564 RQQEELQRLAQQQQQQeklLAEENQRLRERLQHLEEERRAALARSEEI 2611
Cdd:TIGR00618 718 REFNEIENASSSLGSD---LAAREDALNQSLKELMHQARTVLKARTEA 762
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1195-1573 |
3.62e-09 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 63.43 E-value: 3.62e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1195 QRELEQLGRQLRYYRESADplgawlrDAKQRQEQIQA-VPLANSQAvREQLRQekaLLEDIERHGEKVEECQRFAKQYIN 1273
Cdd:COG3096 849 ERELAQHRAQEQQLRQQLD-------QLKEQLQLLNKlLPQANLLA-DETLAD---RLEELREELDAAQEAQAFIQQHGK 917
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1274 AIkdyelqlvtykAQLEPVASPAKKPKVQSgsESIIQEYVDLRTRYSELStltsQYIRFISETLRRM------EEEERLA 1347
Cdd:COG3096 918 AL-----------AQLEPLVAVLQSDPEQF--EQLQADYLQAKEQQRRLK----QQIFALSEVVQRRphfsyeDAVGLLG 980
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1348 EQQRAEE--RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEevarreevaveAQEQKRSIQEELQHLrqss 1425
Cdd:COG3096 981 ENSDLNEklRARLEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDA-----------KQQTLQELEQELEEL---- 1045
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1426 eaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE-------RLRRQ 1498
Cdd:COG3096 1046 --GVQADAEAEERARIRRDELHEELSQNRSRRSQLEKQLTRCEAEMDSLQKRLRKAERDYKQEREQVVqakagwcAVLRL 1123
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1499 VQDETQRKRQAEAELalrvqaeaeaarekqrALQALEELRLQAEEAERRLRQAEAERArQVQVALETAQRSAEAE 1573
Cdd:COG3096 1124 ARDNDVERRLHRREL----------------AYLSADELRSMSDKALGALRLAVADNE-HLRDALRLSEDPRRPE 1181
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1716-2305 |
3.77e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 63.16 E-value: 3.77e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1716 AETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTS--EKSKQRLEAEAG--- 1790
Cdd:PRK03918 186 KRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELEslEGSKRKLEEKIRele 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1791 --------RFRELAEEAARLRALAEEAKRQRQLAEEdaVRQRAEAERVLAEKLAAISEATRlktEAEIALKEKEAENERL 1862
Cdd:PRK03918 266 erieelkkEIEELEEKVKELKELKEKAEEYIKLSEF--YEEYLDELREIEKRLSRLEEEIN---GIEERIKELEEKEERL 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1863 RRLAEDEAFQRRLLEEQAAQHKA--DIEARLAQLRKASES----ELERQKGLVEDTLRQRRQVEEEILAL---KGSFEKA 1933
Cdd:PRK03918 341 EELKKKLKELEKRLEELEERHELyeEAKAKKEELERLKKRltglTPEKLEKELEELEKAKEEIEEEISKItarIGELKKE 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1934 AAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLaaeeerrrrEAEERVQKSLAAEEEAARQRKAALEEVERLKAK 2013
Cdd:PRK03918 421 IKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYT---------AELKRIEKELKEIEEKERKLRKELRELEKVLKK 491
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2014 VEEARRLRERAEQESARQLQLAQEAAQKrlqAEEKAHAF-AVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAeeaeaa 2092
Cdd:PRK03918 492 ESELIKLKELAEQLKELEEKLKKYNLEE---LEKKAEEYeKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELE------ 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2093 reraereaaqsrRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQ 2172
Cdd:PRK03918 563 ------------KKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDK 630
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2173 ALRQKAQVEQELTALRLQLEETdhQKSILDEELQRLKAEVTEaarqrgqVEEELFSLRVQMEELGKLKARIEAEnralvL 2252
Cdd:PRK03918 631 AFEELAETEKRLEELRKELEEL--EKKYSEEEYEELREEYLE-------LSRELAGLRAELEELEKRREEIKKT-----L 696
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 2253 RDkdsaqrlLQEEAEKMKQVAEEAARLSVAAQEAARLRQ--LAEEDLAQQRALAE 2305
Cdd:PRK03918 697 EK-------LKEELEEREKAKKELEKLEKALERVEELREkvKKYKALLKERALSK 744
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1109-1530 |
4.13e-09 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 62.83 E-value: 4.13e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1109 KEAQAVPATLPELEA-----------------TKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERdVEVE 1171
Cdd:pfam17380 221 KEVQGMPHTLAPYEKmerrkesfnlaedvttmTPEYTVRYNGQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEK-MEQE 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1172 RWR---ERVTLLLERWQAVLAQTDVRQRELEqlgRQLRYYRESAdplgawlRDAKQRQEQIQAVPLANSQAVREQLRQEK 1248
Cdd:pfam17380 300 RLRqekEEKAREVERRRKLEEAEKARQAEMD---RQAAIYAEQE-------RMAMERERELERIRQEERKRELERIRQEE 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1249 ALLEDierhgEKVEECQRFakqyinaikdyelqlvtykaQLEPvaspakkpkvQSGSESIIQEYVDLRtrysELSTLTSQ 1328
Cdd:pfam17380 370 IAMEI-----SRMRELERL--------------------QMER----------QQKNERVRQELEAAR----KVKILEEE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1329 YIRFISETLRRMEEEERLAEQQRAEERERLAEveaalEKQRQLAEAHAQakaQAEREAQGLQRRMQEEVARREEVAVEAQ 1408
Cdd:pfam17380 411 RQRKIQQQKVEMEQIRAEQEEARQREVRRLEE-----ERAREMERVRLE---EQERQQQVERLRQQEEERKRKKLELEKE 482
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1409 EQKRSIQEELQhlRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrlqleatERQRGGAEGElqalRARAEEAEAQKRQA 1488
Cdd:pfam17380 483 KRDRKRAEEQR--RKILEKELEERKQAMIEEERKRKLLEKEME---------ERQKAIYEEE----RRREAEEERRKQQE 547
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1920237962 1489 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRA 1530
Cdd:pfam17380 548 MEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARA 589
|
|
| CH_dFLNA-like_rpt2 |
cd21315 |
second calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and ... |
166-268 |
4.36e-09 |
|
second calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and similar proteins; Drosophila melanogaster filamin-A (dFLNA or dFLN-A), also called actin-binding protein 280 (ABP-280) or filamin-1, is involved in germline ring canal formation. It may tether actin microfilaments within the ovarian ring canal to the cell membrane and contributes to actin microfilament organization. dFLNA contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409164 Cd Length: 118 Bit Score: 57.10 E-value: 4.36e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 166 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 244
Cdd:cd21315 11 DGKGPTPKQRLLGWIQSKVPD---LPITNFTNDWNDGKAIGALVDALAPGLCpDWEDWDPKDAVKNAKEAMDLAEDWLDV 87
|
90 100
....*....|....*....|....
gi 1920237962 245 TRLLDPEDVDVPQPDEKSIITYVS 268
Cdd:cd21315 88 PQLIKPEEMVNPKVDELSMMTYLS 111
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1685-1916 |
4.39e-09 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 61.79 E-value: 4.39e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1685 LAEQELEKQRQLAEGTAQqrlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAelakvraemevll 1764
Cdd:TIGR02794 47 AVAQQANRIQQQKKPAAK---KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQA------------- 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1765 askARAEEESRSTSEKSKQRLEAEAGRfrelAEEAARLRALAEEAKRQrqlAEEDAVRQRAEAervlAEKLAaisEATRL 1844
Cdd:TIGR02794 111 ---AKQAEEKQKQAEEAKAKQAAEAKA----KAEAEAERKAKEEAAKQ---AEEEAKAKAAAE----AKKKA---EEAKK 173
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1845 KTEAEiALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIE----ARLAQLRKASESELERQKGLVEDTLRQR 1916
Cdd:TIGR02794 174 KAEAE-AKAKAEAEAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEaaaaAAAEAERKADEAELGDIFGLASGSNAEK 248
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4383-4421 |
4.51e-09 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 54.64 E-value: 4.51e-09
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 4383 FLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTAQKL 4421
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1489-1970 |
4.59e-09 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 62.86 E-value: 4.59e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1489 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQR 1568
Cdd:COG4717 52 EKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLY 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1569 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREeatrraqqqaeaeraraeaerelerwqlKANEALRLRLQAEEVA 1648
Cdd:COG4717 132 QELEALEAELAELPERLEELEERLEELRELEEELEE----------------------------LEAELAELQEELEELL 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1649 QQKSLTQAeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1728
Cdd:COG4717 184 EQLSLATE-----------------EELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLK 246
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1729 EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRStseKSKQRLEAEAGRFRELAEEAARLRALAEE 1808
Cdd:COG4717 247 EARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLA---REKASLGKEAEELQALPALEELEEEELEE 323
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1809 AKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLE--EQAAQHKAD 1886
Cdd:COG4717 324 LLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELRAALEqaEEYQELKEE 403
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1887 IEARLAQLRKASESELERQKGLVEDTLRQR-RQVEEEILALKGSFEKAAAGKAELELELGRIRGtaEDTLRSKEQAEQEA 1965
Cdd:COG4717 404 LEELEEQLEELLGELEELLEALDEEELEEElEELEEELEELEEELEELREELAELEAELEQLEE--DGELAELLQELEEL 481
|
....*
gi 1920237962 1966 ARQRQ 1970
Cdd:COG4717 482 KAELR 486
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1372-1595 |
4.63e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 61.70 E-value: 4.63e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1372 AEAHAQAKAQAEREAQGLQRRMQEEVARREEvaveAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEE 1448
Cdd:COG4942 15 AAAQADAAAEAEAELEQLQQEIAELEKELAA----LKKEEKALLKQLAALERriaALARRIRALEQELAALEAELAELEK 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1449 EIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1528
Cdd:COG4942 91 EIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELE 170
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1529 RALQALEELRLQAEEAERRLRQAEAERARQVQV--ALETAQRSAEAELQSEHASFAEKTAQLERTLKEE 1595
Cdd:COG4942 171 AERAELEALLAELEEERAALEALKAERQKLLARleKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4305-4342 |
6.86e-09 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 54.03 E-value: 6.86e-09
10 20 30
....*....|....*....|....*....|....*...
gi 1920237962 4305 QRLLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMV 4342
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CH_PLS3_rpt3 |
cd21331 |
third calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is ... |
39-158 |
7.30e-09 |
|
third calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Plastin-3 contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409180 Cd Length: 134 Bit Score: 56.93 E-value: 7.30e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRVQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSgdsLPRERDVIRSSRLPREKGRMRfhKLQNVQIA 118
Cdd:cd21331 18 EGETREERTFRNWMNS--LGVNPHVNHLYGDLQDALVILQLYEKIK---VPVDWNKVNKPPYPKLGANMK--KLENCNYA 90
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1920237962 119 LDYLRHR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 158
Cdd:cd21331 91 VELGKHPaKFSLVGIGGQDLNDGNPTLTLALVWQLMRRYTL 131
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1363-1587 |
7.73e-09 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 61.36 E-value: 7.73e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1363 AALEKQRQLAEAHAQAKAQAEREAQGLQRrmQEEVarREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAK---ARQVEAA 1439
Cdd:PRK09510 60 VVEQYNRQQQQQKSAKRAEEQRKKKEQQQ--AEEL--QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKqaaLKQKQAE 135
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1440 ERsrlrieeeirvvrlQLEATERQRGGAEGELQALRARAEEAEAQ-KRQAQEEAerlrrQVQDETQRKRQAEAELALRVQ 1518
Cdd:PRK09510 136 EA--------------AAKAAAAAKAKAEAEAKRAAAAAKKAAAEaKKKAEAEA-----AKKAAAEAKKKAEAEAAAKAA 196
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1519 AEAEAAREKQRALQAleelrlqAEEAErrlRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ 1587
Cdd:PRK09510 197 AEAKKKAEAEAKKKA-------AAEAK---KKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
2160-2610 |
8.10e-09 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 62.29 E-value: 8.10e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2160 DAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKaevtEAARQRGQVEEELFSLRVQMEELGKL 2239
Cdd:TIGR00618 200 TLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKR----EAQEEQLKKQQLLKQLRARIEELRAQ 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2240 KARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEA-ARLSVAAQEAARLRQLA------EEDLAQQRALAEKMLKEKM 2312
Cdd:TIGR00618 276 EAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIhTELQSKMRSRAKLLMKRaahvkqQSSIEEQRRLLQTLHSQEI 355
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2313 QAVQEATRLKAEAELLQQQKELAQeQARRLQEDKEQMAQQLaqetQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQ 2392
Cdd:TIGR00618 356 HIRDAHEVATSIREISCQQHTLTQ-HIHTLQQQKTTLTQKL----QSLCKELDILQREQATIDTRTSAFRDLQGQLAHAK 430
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2393 ARAEEDARRFRKQAEDIGER----------LYRTELATQEKVMLVQTLETQRQQSDR-----DAERLREAIAELEHEKDK 2457
Cdd:TIGR00618 431 KQQELQQRYAELCAAAITCTaqceklekihLQESAQSLKEREQQLQTKEQIHLQETRkkavvLARLLELQEEPCPLCGSC 510
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2458 LKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQM 2537
Cdd:TIGR00618 511 IHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQ 590
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 2538 QQEKQQLAASMEEARRR------QHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEE 2610
Cdd:TIGR00618 591 NITVRLQDLTEKLSEAEdmlaceQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRV 669
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1337-1557 |
8.65e-09 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 60.93 E-value: 8.65e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1337 LRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE 1416
Cdd:COG4942 29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAE 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1417 ---ELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE 1493
Cdd:COG4942 109 llrALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERA 188
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1494 RLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERAR 1557
Cdd:COG4942 189 ALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALK 252
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1816-2349 |
9.54e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 62.00 E-value: 9.54e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1816 AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEdEAFQRRLLEEQAAQHKADIEARLAQLR 1895
Cdd:PRK03918 187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKE-EIEELEKELESLEGSKRKLEEKIRELE 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1896 KASESELERQKGLVEDT--LRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAA 1973
Cdd:PRK03918 266 ERIEELKKEIEELEEKVkeLKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKK 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1974 EEERrrreaeerVQKSLAAEEEAArqrkaalEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAfA 2053
Cdd:PRK03918 346 KLKE--------LEKRLEELEERH-------ELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS-K 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2054 VQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEReaaqsRRQVEEAERLKQSAEEQAQAQAQAQAAAEK 2133
Cdd:PRK03918 410 ITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELL-----EEYTAELKRIEKELKEIEEKERKLRKELRE 484
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2134 LRKEAEQEAARraqaeqaaLRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA--- 2210
Cdd:PRK03918 485 LEKVLKKESEL--------IKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEElkk 556
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2211 EVTEAARQRGQVEEELFSLRVQMEELG---------KLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEA-ARLS 2280
Cdd:PRK03918 557 KLAELEKKLDELEEELAELLKELEELGfesveeleeRLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAfEELA 636
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2281 VAAQEAARLR-QLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2349
Cdd:PRK03918 637 ETEKRLEELRkELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEER 706
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1343-1557 |
1.05e-08 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 60.63 E-value: 1.05e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1343 EERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSiQEELQHLR 1422
Cdd:TIGR02794 49 AQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK-QAEEAKAK 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1423 QSSEAEIQAKA-RQVEAAERSRLRIEEEirvvRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEE----AERLRR 1497
Cdd:TIGR02794 128 QAAEAKAKAEAeAERKAKEEAAKQAEEE----AKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEakakAEAAKA 203
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1498 QVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERAR 1557
Cdd:TIGR02794 204 KAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAGSEVDK 263
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1374-1833 |
1.14e-08 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 61.57 E-value: 1.14e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1374 AHAQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVV 1453
Cdd:COG3903 475 EYAAERLAEAGERAAARRRHADYYLA---LAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALRLAAAL 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1454 RLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQA 1533
Cdd:COG3903 552 APFWFLRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAA 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1534 LEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQ 1613
Cdd:COG3903 632 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAAL 711
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1614 AEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQ 1693
Cdd:COG3903 712 AAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAA 791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1694 RQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1773
Cdd:COG3903 792 AAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALA 871
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1774 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1833
Cdd:COG3903 872 AAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1091-2028 |
1.17e-08 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 61.89 E-value: 1.17e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1091 IRSTQEAEEVLRAHEEQLKEAQAvpatlpeleatkaALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQ--RHGERdv 1168
Cdd:COG3096 284 SERALELRRELFGARRQLAEEQY-------------RLVEMARELEELSARESDLEQDYQAASDHLNLVQTalRQQEK-- 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1169 eVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVReqlrqek 1248
Cdd:COG3096 349 -IERYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQQTRAIQYQQAVQ------- 420
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1249 ALledierhgEKVEECQRFAKQYINAIKDYelqLVTYKAQLEpvaspakkpkvqsgseSIIQEYVDLRTRYSELSTLTSQ 1328
Cdd:COG3096 421 AL--------EKARALCGLPDLTPENAEDY---LAAFRAKEQ----------------QATEEVLELEQKLSVADAARRQ 473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1329 YIRFIsETLRRMEEE-ERLAEQQRAEERER-------LAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARR 1400
Cdd:COG3096 474 FEKAY-ELVCKIAGEvERSQAWQTARELLRryrsqqaLAQRLQQLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDAA 552
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1401 EEVAVEAQEQKRSIQEELQHLRQSSEAEIQakarqveaaersrlrieeeirvvrlqleaTERQRGGAEGELQALRARAEE 1480
Cdd:COG3096 553 EELEELLAELEAQLEELEEQAAEAVEQRSE-----------------------------LRQQLEQLRARIKELAARAPA 603
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1481 AeaqkRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAeeaeRRLRQAE-AERARQV 1559
Cdd:COG3096 604 W----LAAQDALERLREQSGEALADSQEVTAAMQQLLEREREATVERDELAARKQALESQI----ERLSQPGgAEDPRLL 675
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1560 Q----------------VALETAQR-------SAEAELQSEHASFAEKTAQLERTLkeEHVAVVQLREEATRRAQQQAEA 1616
Cdd:COG3096 676 AlaerlggvllseiyddVTLEDAPYfsalygpARHAIVVPDLSAVKEQLAGLEDCP--EDLYLIEGDPDSFDDSVFDAEE 753
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1617 ERARAEAERELERWQL-------------KANEALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgkaeEQAVRQR 1683
Cdd:COG3096 754 LEDAVVVKLSDRQWRYsrfpevplfgraaREKRLEELRAERDELAEQYA------------------------KASFDVQ 809
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1684 ELaeQELEKQ-RQLAEGTAQQRLAAEQElirlrAETEQGEQQRQLLEEELARL----QREAAAATQKRRELEAeLAKVRA 1758
Cdd:COG3096 810 KL--QRLHQAfSQFVGGHLAVAFAPDPE-----AELAALRQRRSELERELAQHraqeQQLRQQLDQLKEQLQL-LNKLLP 881
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1759 EMEVL----LASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRA--LAEEAKRQRQL---AEEDAVRQRAEAER 1829
Cdd:COG3096 882 QANLLadetLADRLEELREELDAAQEAQAFIQQHGKALAQLEPLVAVLQSdpEQFEQLQADYLqakEQQRRLKQQIFALS 961
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1830 VLAEKLAAISEAtrlktEAEIALKEKEAENERLR---RLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQK 1906
Cdd:COG3096 962 EVVQRRPHFSYE-----DAVGLLGENSDLNEKLRarlEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQ 1036
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1907 GLvedtlrQRRQVEEEILALKGSFEKAAAGKAELELELGRIRG--TAEDTLRSKEQAEQEAARQRqlaaeeerrrreaEE 1984
Cdd:COG3096 1037 EL------EQELEELGVQADAEAEERARIRRDELHEELSQNRSrrSQLEKQLTRCEAEMDSLQKR-------------LR 1097
|
970 980 990 1000
....*....|....*....|....*....|....*....|....*.
gi 1920237962 1985 RVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRL--RERAEQES 2028
Cdd:COG3096 1098 KAERDYKQEREQVVQAKAGWCAVLRLARDNDVERRLhrRELAYLSA 1143
|
|
| CH_FLNB_rpt2 |
cd21313 |
second calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; ... |
166-273 |
1.31e-08 |
|
second calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B (FLN-B) is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton. It may promote orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It anchors various transmembrane proteins to the actin cytoskeleton. FLN-B contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409162 Cd Length: 110 Bit Score: 55.48 E-value: 1.31e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 166 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 244
Cdd:cd21313 3 DAKKQTPKQRLLGWIQNKIPY---LPITNFNQNWQDGKALGALVDSCAPGLCpDWESWDPQKPVDNAREAMQQADDWLGV 79
|
90 100
....*....|....*....|....*....
gi 1920237962 245 TRLLDPEDVDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21313 80 PQVITPEEIIHPDVDEHSVMTYLSQFPKA 108
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
2153-2609 |
1.39e-08 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 61.47 E-value: 1.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEET-----DHQKSILDEELQRLKAEVTEAARQRGQVEEELF 2227
Cdd:COG4913 247 AREQIELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRrlellEAELEELRAELARLEAELERLEARLDALREELD 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2228 SLRVQMEELG-----KLKARIEAENRALVLRDKDSAQrlLQEEAEKMK-QVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2301
Cdd:COG4913 327 ELEAQIRGNGgdrleQLEREIERLERELEERERRRAR--LEALLAALGlPLPASAEEFAALRAEAAALLEALEEELEALE 404
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2302 ALAEKMLKEKMQAVQEATRLKAEAELLQQQK---ELAQEQARR-----LQEDKEQM------------------------ 2349
Cdd:COG4913 405 EALAEAEAALRDLRRELRELEAEIASLERRKsniPARLLALRDalaeaLGLDEAELpfvgelievrpeeerwrgaiervl 484
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2350 ---AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLyRTELATQEKVML 2426
Cdd:COG4913 485 ggfALTLLVPPEHYAAALRWVNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWL-EAELGRRFDYVC 563
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2427 VQTLE-------------------TQRQQSDRDAERL--------REAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQ-- 2477
Cdd:COG4913 564 VDSPEelrrhpraitragqvkgngTRHEKDDRRRIRSryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAELDal 643
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2478 EQLLQETQALQQSFLSEKDsLLQRERCIEQEKAKLEQL--FQDEVAKAQALREEqqrqqqqmqqekqqLAASMEEARRRQ 2555
Cdd:COG4913 644 QERREALQRLAEYSWDEID-VASAEREIAELEAELERLdaSSDDLAALEEQLEE--------------LEAELEELEEEL 708
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 2556 HEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSE 2609
Cdd:COG4913 709 DELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDA 762
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1486-2006 |
1.76e-08 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 61.08 E-value: 1.76e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1486 RQAQEEAERLRRQVQD----ETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARqvqv 1561
Cdd:COG4913 238 ERAHEALEDAREQIELlepiRELAERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELER---- 313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1562 aLETAQRSAEAELQSEHASFAEKTAQLERTLKEEhvavvqlreeatrraqqqaeaeraraeaereLERWQLKANEALRLR 1641
Cdd:COG4913 314 -LEARLDALREELDELEAQIRGNGGDRLEQLERE-------------------------------IERLERELEERERRR 361
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1642 LQAEEVAQQKSLTQAEAEKQKEeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1721
Cdd:COG4913 362 ARLEALLAALGLPLPASAEEFA----------ALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASL 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1722 EQQRQLLEEELARLQREAAAATQ-KRRELE--AELAKVRAE-------MEVLLASKAR---------------------- 1769
Cdd:COG4913 432 ERRKSNIPARLLALRDALAEALGlDEAELPfvGELIEVRPEeerwrgaIERVLGGFALtllvppehyaaalrwvnrlhlr 511
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1770 ------------AEEESRSTSEKS-KQRLEAEAGRFR-----ELAEEAARLRALAEEA-----------------KRQRQ 1814
Cdd:COG4913 512 grlvyervrtglPDPERPRLDPDSlAGKLDFKPHPFRawleaELGRRFDYVCVDSPEElrrhpraitragqvkgnGTRHE 591
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1815 LAEEDAVRQR----AEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEE-----QAAQHKA 1885
Cdd:COG4913 592 KDDRRRIRSRyvlgFDNRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDeidvaSAEREIA 671
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1886 DIEARLAQLRKASeSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRirgtAEDTLRSKEQAEQEA 1965
Cdd:COG4913 672 ELEAELERLDASS-DDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDE----LQDRLEAAEDLARLE 746
|
570 580 590 600
....*....|....*....|....*....|....*....|.
gi 1920237962 1966 ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 2006
Cdd:COG4913 747 LRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEE 787
|
|
| EntF |
COG1020 |
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites ... |
1358-1770 |
1.92e-08 |
|
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];
Pssm-ID: 440643 [Multi-domain] Cd Length: 1329 Bit Score: 61.03 E-value: 1.92e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1358 LAEVEAALEKQRQLAEAHAQAkaqaeREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE----------- 1426
Cdd:COG1020 885 LGEIEAALLQHPGVREAVVVA-----REDAPGDKRLVAYVVPEAGAAAAAALLRLALALLLPPYMVPAAvvlllplpltg 959
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1427 --------AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQ 1498
Cdd:COG1020 960 ngkldrlaLPAPAAAAAAAAAAPPAEEEEEEAALALLLLLVVVVGDDDFFFFGGGLGLLLLLALARAARLLLLLLLLLLL 1039
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1499 VQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEH 1578
Cdd:COG1020 1040 FLAAAAAAAAAAAAAAAAAAAAPLAAAAAPLPLPPLLLSLLALLLALLLLLALLALLALLLLLLLLLLLLALLLLLALLL 1119
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1579 ASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEA 1658
Cdd:COG1020 1120 ALLAALRARRAVRQEGPRLRLLVALAAALALAALLALLLAAAAAAAELLAAAALLLLLALLLLALLLLLLLLLLLLLLLL 1199
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1659 EKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQRE 1738
Cdd:COG1020 1200 LLLLLLLLLLLLLLLLLLLLLLLLLAAAAAALLALALLLALLALAALLALAALAALAAALLALALALLALALLLLALALL 1279
|
410 420 430
....*....|....*....|....*....|..
gi 1920237962 1739 AAAATQKRRELEAELAKVRAEMEVLLASKARA 1770
Cdd:COG1020 1280 LPALARARAARTARALALLLLLALLLLLALAL 1311
|
|
| CH_AtFIM_like_rpt3 |
cd21299 |
third calponin homology (CH) domain found in the Arabidopsis thaliana fimbrin family; The ... |
44-155 |
3.50e-08 |
|
third calponin homology (CH) domain found in the Arabidopsis thaliana fimbrin family; The Arabidopsis thaliana fimbrin (AtFIM) family includes Fimbrin-1, -2, -3, -4, and -5, which cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409148 Cd Length: 114 Bit Score: 54.43 E-value: 3.50e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 44 QKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERdvirSSRLPRekgRMRFHKLQNVQIALDYLR 123
Cdd:cd21299 5 EERCFRLWINS--LGIDTYVNNVFEDVRDGWVLLEVLDKVSPGSVNWKH----ANKPPI---KMPFKKVENCNQVVKIGK 75
|
90 100 110
....*....|....*....|....*....|..
gi 1920237962 124 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILH 155
Cdd:cd21299 76 QLKFSLVNVAGNDIVQGNKKLILALLWQLMRY 107
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
518-707 |
3.60e-08 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 57.07 E-value: 3.60e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 518 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQLSPATRGAY---RDCLGRLD 594
Cdd:cd00176 6 LRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAeeiQERLEELN 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 595 LQYAKLLSSSKARLRSLE---SLHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEI 671
Cdd:cd00176 86 QRWEELRELAEERRQRLEealDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSL 165
|
170 180 190
....*....|....*....|....*....|....*..
gi 1920237962 672 QSTGDRLLREDHP-ARPTAESFQAALQTQWSWMLQLC 707
Cdd:cd00176 166 NELAEELLEEGHPdADEEIEEKLEELNERWEELLELA 202
|
|
| CH_PARV_rpt2 |
cd21222 |
second calponin homology (CH) domain found in the parvin family; The parvin family includes ... |
38-156 |
3.84e-08 |
|
second calponin homology (CH) domain found in the parvin family; The parvin family includes alpha-parvin, beta-parvin, and gamma-parvin. Alpha-parvin, also called actopaxin, calponin-like integrin-linked kinase-binding protein (CH-ILKBP), or matrix-remodeling-associated protein 2, plays a role in sarcomere organization and in smooth muscle cell contraction. It is required for normal development of the embryonic cardiovascular system, and for normal septation of the heart outflow tract. Beta-parvin, also called affixin, is an adapter protein that plays a role in integrin signaling via ILK and in activation of the GTPases Cdc42 and Rac1 by guanine exchange factors, such as ARHGEF6. Both alpha-parvin and beta-parvin are involved in the reorganization of the actin cytoskeleton and the formation of lamellipodia, and both play roles in cell adhesion, cell spreading, establishment or maintenance of cell polarity, and cell migration. Gamma-parvin probably plays a role in the regulation of cell adhesion and cytoskeleton organization. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409071 Cd Length: 121 Bit Score: 54.52 E-value: 3.84e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQ--KKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRERDVIRSSRLPrekgrmrfHKLQNV 115
Cdd:cd21222 9 EAPEKLAevKELLLQFVNKHLAKLNIEVTDLATQFHDGVYLILLIGLLEGFFVPLHEYHLTPSTDD--------EKLHNV 80
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1920237962 116 QIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 156
Cdd:cd21222 81 KLALELMEDAGISTPKIRPEDIVNGDLKSILRVLYSLFSKY 121
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1686-1873 |
4.62e-08 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 58.66 E-value: 4.62e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1686 AEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAA--AATQKRRELEAELAKVRAEMEVL 1763
Cdd:PRK09510 77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQkqAEEAAAKAAAAAKAKAEAEAKRA 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1764 LASKARAEEESrstseksKQRLEAEAgrfRELAEEAARLRALAEEAKrqrQLAEEDAVRQRAEAERVlAEKLAAISEATR 1843
Cdd:PRK09510 157 AAAAKKAAAEA-------KKKAEAEA---AKKAAAEAKKKAEAEAAA---KAAAEAKKKAEAEAKKK-AAAEAKKKAAAE 222
|
170 180 190
....*....|....*....|....*....|
gi 1920237962 1844 LKTEAEIALKEKEAENERLRRLAEDEAFQR 1873
Cdd:PRK09510 223 AKAAAAKAAAEAKAAAEKAAAAKAAEKAAA 252
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1096-1451 |
4.63e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 59.69 E-value: 4.63e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1096 EAEEVLRAHEEQLKEAQAVPATL-PELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWR 1174
Cdd:TIGR02168 681 ELEEKIEELEEKIAELEKALAELrKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELE 760
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1175 ERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLrDAKQRQEQIQAVPLANSQAVREQLRQEKALLED- 1253
Cdd:TIGR02168 761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL-DELRAELTLLNEEAANLRERLESLERRIAATERr 839
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1254 IERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSELstltsqyirfi 1333
Cdd:TIGR02168 840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELE---------ALLNERASLEEALALLRSELEEL----------- 899
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1334 SETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQ------------------LAEAHAQAKAQAEREAQGLQRRMQE 1395
Cdd:TIGR02168 900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVridnlqerlseeysltleEAEALENKIEDDEEEARRRLKRLEN 979
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1396 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAErsrlRIEEEIR 1451
Cdd:TIGR02168 980 KIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIE----EIDREAR 1031
|
|
| WEMBL |
pfam05701 |
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ... |
1408-1948 |
5.29e-08 |
|
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".
Pssm-ID: 461718 [Multi-domain] Cd Length: 562 Bit Score: 59.27 E-value: 5.29e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEeirvVRLQLE--ATERQRGGAEGELQALRARaeeaEAQK 1485
Cdd:pfam05701 41 ELELEKVQEEIPEYKKQSEAAEAAKAQVLEELESTKRLIEE----LKLNLEraQTEEAQAKQDSELAKLRVE----EMEQ 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1486 RQAQEEAERLRRQVQDETQRKRQAEAELALrVQAE--------AEAAREKQRALQALEELRLQAEEAERRLRQAEAERAr 1557
Cdd:pfam05701 113 GIADEASVAAKAQLEVAKARHAAAVAELKS-VKEEleslrkeyASLVSERDIAIKRAEEAVSASKEIEKTVEELTIELI- 190
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1558 QVQVALETAQRS-AEAELQSEHASFA--EKTAQLERTLKEEHVAVVQLREEATRRAQQQAeaeraraeaerelerwQLKA 1634
Cdd:pfam05701 191 ATKESLESAHAAhLEAEEHRIGAALAreQDKLNWEKELKQAEEELQRLNQQLLSAKDLKS----------------KLET 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1635 NEALRLRLQAEEVAQQKSltqaeaekqkeEAEREARRRGKAEEQAVRQRE---LAEQELEKQRQLAEgtaqqRLAAEQEL 1711
Cdd:pfam05701 255 ASALLLDLKAELAAYMES-----------KLKEEADGEGNEKKTSTSIQAalaSAKKELEEVKANIE-----KAKDEVNC 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1712 IRLRAETEQGEQQRQllEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlasKARAEEESRSTSEKSKQRLEAeagr 1791
Cdd:pfam05701 319 LRVAAASLRSELEKE--KAELASLRQREGMASIAVSSLEAELNRTKSEIALV---QAKEKEAREKMVELPKQLQQA---- 389
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1792 frelAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEA--ENERLRRLAEDE 1869
Cdd:pfam05701 390 ----AQEAEEAKSLAQAAREELRKAKEEAEQAKAAASTVESRLEAVLKEIEAAKASEKLALAAIKAlqESESSAESTNQE 465
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1870 AFQR----------------RLLEEQAaqhKADIEARLAQLRKASESELERQKGLvEDTLRQRRQVEEEILALKGSFEKA 1933
Cdd:pfam05701 466 DSPRgvtlsleeyyelskraHEAEELA---NKRVAEAVSQIEEAKESELRSLEKL-EEVNREMEERKEALKIALEKAEKA 541
|
570
....*....|....*
gi 1920237962 1934 AAGKAELELELGRIR 1948
Cdd:pfam05701 542 KEGKLAAEQELRKWR 556
|
|
| CH_PLS3_rpt1 |
cd21325 |
first calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is ... |
32-164 |
5.48e-08 |
|
first calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Plastin- 3 contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409174 Cd Length: 148 Bit Score: 55.06 E-value: 5.48e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 32 ASEGKKDERDRVQKKTFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPrERDVIRSSRLPr 102
Cdd:cd21325 13 SSEGTQHSYSEEEKYAFVNWINKalendpdcrHVIPMNPNTDDLFKAVGDGIVLCKMINLSVPDTID-ERAINKKKLTP- 90
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 103 ekgrmrFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVS 164
Cdd:cd21325 91 ------FIIQENLNLALNSASAIGCHVVNIGAEDLRAGKPHLVLGLLWQIIKIGLFADIELS 146
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
2171-2611 |
5.50e-08 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 59.47 E-value: 5.50e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2171 EQALRQKAQVEQELTALRLQL-EETDHQKSILDEELQRLKAEVTEAARQRGQV---EEELFSLRVQMEELGKLKARIEAE 2246
Cdd:pfam12128 404 EARDRQLAVAEDDLQALESELrEQLEAGKLEFNEEEYRLKSRLGELKLRLNQAtatPELLLQLENFDERIERAREEQEAA 483
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2247 NRAlvlrdkdsaQRLLQEEAEKMKQVAEEAARlsvaaqeaaRLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAE 2326
Cdd:pfam12128 484 NAE---------VERLQSELRQARKRRDQASE---------ALRQASRRLEERQSALDELELQLFPQAGTLLHFLRKEAP 545
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2327 LLQQQ--KELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA-------QARAEE 2397
Cdd:pfam12128 546 DWEQSigKVISPELLHRTDLDPEVWDGSVGGELNLYGVKLDLKRIDVPEWAASEEELRERLDKAEEAlqsarekQAAAEE 625
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2398 DARRFRKQAE--DIGERLYRTELaTQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTV 2475
Cdd:pfam12128 626 QLVQANGELEkaSREETFARTAL-KNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQLDKKHQAWLEE 704
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2476 RQEQLLQETQALQQSFL---SEKDSLLQR-----ERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAAS 2547
Cdd:pfam12128 705 QKEQKREARTEKQAYWQvveGALDAQLALlkaaiAARRSGAKAELKAL-ETWYKRDLASLGVDPDVIAKLKREIRTLERK 783
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 2548 MEEARRRQHEAEEGVRRQQE----ELQRLAQQQQQQEKLLAEENQRL-------RERLQHLEEERRAALARSEEI 2611
Cdd:pfam12128 784 IERIAVRRQEVLRYFDWYQEtwlqRRPRLATQLSNIERAISELQQQLarliadtKLRRAKLEMERKASEKQQVRL 858
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1492-1951 |
5.67e-08 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 59.65 E-value: 5.67e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1492 AERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQAleelRLQAEEAERRLRQAEAERARQVQVALETAqrSAE 1571
Cdd:COG3903 478 AERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLA----RLDAEHDNLRAALRWALAHGDAELALRLA--AAL 551
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1572 AELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQK 1651
Cdd:COG3903 552 APFWFLRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAA 631
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1652 SLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEE 1731
Cdd:COG3903 632 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAAL 711
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1732 LARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKR 1811
Cdd:COG3903 712 AAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAA 791
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1812 QRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARL 1891
Cdd:COG3903 792 AAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALA 871
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1892 AQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTA 1951
Cdd:COG3903 872 AAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
2162-2610 |
6.55e-08 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 59.31 E-value: 6.55e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2162 EMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEE------ELFSLRVQMEE 2235
Cdd:PRK03918 232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEyiklseFYEEYLDELRE 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2236 LGKLKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEdlaqqralAEKMLKEKmq 2313
Cdd:PRK03918 312 IEKRLSRLEEEINGIeeRIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEE--------LERLKKRL-- 381
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2314 AVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQ-----RQLEMSAEAERLRLRVAEM 2388
Cdd:PRK03918 382 TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKcpvcgRELTEEHRKELLEEYTAEL 461
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2389 SRAQ---ARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKL-KQEAQL 2464
Cdd:PRK03918 462 KRIEkelKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLiKLKGEI 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2465 LQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRER-----CIEQEKAKLEQL--FQDEVAKAQALREEQQRQQQQM 2537
Cdd:PRK03918 542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEelgfeSVEELEERLKELepFYNEYLELKDAEKELEREEKEL 621
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 2538 QQEKQQLAASMEEARRRQHEAEEgVRRQQEELQRlaqqqqqqeKLLAEENQRLRERLQHLEEERRAALARSEE 2610
Cdd:PRK03918 622 KKLEEELDKAFEELAETEKRLEE-LRKELEELEK---------KYSEEEYEELREEYLELSRELAGLRAELEE 684
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1649-1844 |
7.15e-08 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 58.28 E-value: 7.15e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1649 QQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1728
Cdd:PRK09510 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKA 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1729 EEELARLQREAAAATQKRRELEAELAKVRAEMEvllASKARAEEESRSTSEKSKQRLEAEAgrfRELAEEAARLRALAEE 1808
Cdd:PRK09510 150 EAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE---AKKKAEAEAAAKAAAEAKKKAEAEA---KKKAAAEAKKKAAAEA 223
|
170 180 190
....*....|....*....|....*....|....*.
gi 1920237962 1809 AKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRL 1844
Cdd:PRK09510 224 KAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
|
|
| CH_ASPM_rpt2 |
cd21224 |
second calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated ... |
175-270 |
7.18e-08 |
|
second calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated protein (ASPM) and similar proteins; ASPM, also called abnormal spindle protein homolog, or Asp homolog, is involved in mitotic spindle regulation and coordination of mitotic processes. It may also have a preferential role in regulating neurogenesis. Members of this family contain two copies of CH domain in the middle region. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409073 [Multi-domain] Cd Length: 138 Bit Score: 54.23 E-value: 7.18e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 175 KLLL-WSQrMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNL-----------------------EN 230
Cdd:cd21224 3 SLLLkWCQ-AVCAHYGVKVENFTVSFADGRALCYLIHHYLPSLLPLDAIRQPTTQtvdraqdeaedfwvaefspstgdSG 81
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 231 LDQAFSVAER-----------DLG-VTRLLDPEDVDVPQPDEKSIITYVSSL 270
Cdd:cd21224 82 LSSELLANEKrnfklvqqavaELGgVPALLRASDMSNTIPDEKVVILFLSYL 133
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
2203-2471 |
7.48e-08 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 59.16 E-value: 7.48e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2203 EELQRLKAEVTEAARQRGQVE------EELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEkmkQVAEEA 2276
Cdd:COG4913 235 DDLERAHEALEDAREQIELLEpirelaERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELA---RLEAEL 311
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2277 ARLSVAAQEAARLRQLAEEDLAQQralaekmlkekmqAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQE 2356
Cdd:COG4913 312 ERLEARLDALREELDELEAQIRGN-------------GGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPAS 378
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2357 TQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQaedigerlyrtelatqekvmlVQTLETQRQQ 2436
Cdd:COG4913 379 AEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAE---------------------IASLERRKSN 437
|
250 260 270
....*....|....*....|....*....|....*.
gi 1920237962 2437 SDRDAERLREAIAE-LEHEKDKLKQEAQLLQLKSEE 2471
Cdd:COG4913 438 IPARLLALRDALAEaLGLDEAELPFVGELIEVRPEE 473
|
|
| CH_PLS1_rpt1 |
cd21323 |
first calponin homology (CH) domain found in plastin-1; Plastin-1, also called ... |
32-163 |
9.32e-08 |
|
first calponin homology (CH) domain found in plastin-1; Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. It contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409172 Cd Length: 145 Bit Score: 54.28 E-value: 9.32e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 32 ASEGKKDERDRVQKKTFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPrERDVIRSSRLPr 102
Cdd:cd21323 13 SSEGTQHSYSEEEKVAFVNWINKalegdpdckHVVPMNPTDESLFKSLADGILLCKMINLSQPDTID-ERAINKKKLTP- 90
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 103 ekgrmrFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQV 163
Cdd:cd21323 91 ------FTISENLNLALNSASAIGCTVVNIGSLDLKEGKPHLVLGLLWQIIKVGLFADIEI 145
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1427-1579 |
9.39e-08 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 57.96 E-value: 9.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1427 AEIQAKARQVEA-AERsrlriEEEIRVVRLQLEATERQrggAEGELQALRARAEEAEAQKRQAQEEAERlrrqvqdETQR 1505
Cdd:COG2268 195 AEIIRDARIAEAeAER-----ETEIAIAQANREAEEAE---LEQEREIETARIAEAEAELAKKKAEERR-------EAET 259
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 1506 KRqAEAELALRVQaEAEAAREKQRALQALE---ELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHA 1579
Cdd:COG2268 260 AR-AEAEAAYEIA-EANAEREVQRQLEIAErerEIELQEKEAEREEAELEADVRKPAEAEKQAAEAEAEAEAEAIRA 334
|
|
| MutS2 |
COG1193 |
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair]; |
1396-1561 |
1.01e-07 |
|
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];
Pssm-ID: 440806 [Multi-domain] Cd Length: 784 Bit Score: 58.61 E-value: 1.01e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1396 EVARR----EEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegel 1471
Cdd:COG1193 490 EIARRlglpEEIIERARELLGEESIDVEKLIEELERERRELEEEREEAERLREELEKLREELEEKLEELEEEK------- 562
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1472 QALRARA-EEAEAQKRQAQEEAERLRRQVQDEtqrkrqaeaelalrvQAEAEAAREKQRALQALEElRLQAEEAERRLRQ 1550
Cdd:COG1193 563 EEILEKArEEAEEILREARKEAEELIRELREA---------------QAEEEELKEARKKLEELKQ-ELEEKLEKPKKKA 626
|
170
....*....|.
gi 1920237962 1551 AEAERARQVQV 1561
Cdd:COG1193 627 KPAKPPEELKV 637
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1476-1927 |
1.02e-07 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 58.24 E-value: 1.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1476 ARAEEAEAQKRQAQEEAERLRRQVQD-ETQRKRQAEAELAL----RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQ 1550
Cdd:COG4717 71 KELKELEEELKEAEEKEEEYAELQEElEELEEELEELEAELeelrEELEKLEKLLQLLPLYQELEALEAELAELPERLEE 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1551 AEAERARQVQV-----ALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEaer 1625
Cdd:COG4717 151 LEERLEELRELeeeleELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEE--- 227
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1626 elerwQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAvrqrELAEQELEKQRQLAEGTAQQRL 1705
Cdd:COG4717 228 -----ELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIA----GVLFLVLGLLALLFLLLAREKA 298
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1706 AAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRstsEKSKQRL 1785
Cdd:COG4717 299 SLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEEL---EQEIAAL 375
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1786 EAEAGrfrelAEEAARLRALAEEAKRQRQLAEEdavrqRAEAERVLAEKLAAISEATRLKTEAEiaLKEKEAENERLRRL 1865
Cdd:COG4717 376 LAEAG-----VEDEEELRAALEQAEEYQELKEE-----LEELEEQLEELLGELEELLEALDEEE--LEEELEELEEELEE 443
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1866 AEDEafqrrllEEQAAQHKADIEARLAQLrkASESELERQKGLVEDTLRQRRQVEEEILALK 1927
Cdd:COG4717 444 LEEE-------LEELREELAELEAELEQL--EEDGELAELLQELEELKAELRELAEEWAALK 496
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4036-4072 |
1.14e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 50.56 E-value: 1.14e-07
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 4036 IRLLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEM 4072
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| CH_NAV3 |
cd21286 |
calponin homology (CH) domain found in neuron navigator 3; Neuron navigator 3 (NAV3), also ... |
46-152 |
1.29e-07 |
|
calponin homology (CH) domain found in neuron navigator 3; Neuron navigator 3 (NAV3), also called pore membrane and/or filament-interacting-like protein 1 (POMFIL1), Steerin-3 (STEERIN3), or Unc-53 homolog 3 (unc53H3), may regulate IL2 production by T-cells. It may be involved in neuron regeneration. NAV3 contains a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409135 Cd Length: 105 Bit Score: 52.72 E-value: 1.29e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 46 KTFTKWVNKHLIKA--QRHISDLYEDLRDGHNLISLLEVLSGDSLpreRDVirsSRLPREKGRMrfhkLQNVQIALDYLR 123
Cdd:cd21286 3 KIYTDWANHYLAKSghKRLIKDLQQDIADGVLLAEIIQIIANEKV---EDI---NGCPRSQSQM----IENVDVCLSFLA 72
|
90 100
....*....|....*....|....*....
gi 1920237962 124 HRQVKLVNIRNDDIADGNPKLTLGLIWTI 152
Cdd:cd21286 73 ARGVNVQGLSAEEIRNGNLKAILGLFFSL 101
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1062-1510 |
1.43e-07 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 57.86 E-value: 1.43e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1062 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKA-------ALKKLRAQ 1134
Cdd:COG4717 75 ELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAelaelpeRLEELEER 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1135 AEAQQPVFDALRDELRGAQEVGERLQQrhgERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADP 1214
Cdd:COG4717 155 LEELRELEEELEELEAELAELQEELEE---LLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQ 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1215 LGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVeecqrfakQYINAIKDYELQLVTYKAQLEPVAS 1294
Cdd:COG4717 232 LENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGV--------LFLVLGLLALLFLLLAREKASLGKE 303
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1295 PAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRfiseTLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA 1374
Cdd:COG4717 304 AEELQALPALEELEEEELEELLAALGLPPDLSPEELL----ELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEA 379
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1375 HAQAKAQAEREAQGLQRRmQEEVARREEVAVEAQEQKRSIQEELQHLRQSS-EAEIQAKARQVEAAERSRLRIEEEIRVV 1453
Cdd:COG4717 380 GVEDEEELRAALEQAEEY-QELKEELEELEEQLEELLGELEELLEALDEEElEEELEELEEELEELEEELEELREELAEL 458
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 1454 RLQLEATERqrggaEGELQALRARAEEAEAQKRQAQEEAERLR------RQVQDETQRKRQAE 1510
Cdd:COG4717 459 EAELEQLEE-----DGELAELLQELEELKAELRELAEEWAALKlalellEEAREEYREERLPP 516
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1335-1595 |
1.48e-07 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 56.85 E-value: 1.48e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSI 1414
Cdd:pfam13868 46 DEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQ 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1415 QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1494
Cdd:pfam13868 126 RQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDE 205
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1495 LR-RQVQDETQRK-RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEA 1572
Cdd:pfam13868 206 LRaKLYQEEQERKeRQKEREEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRR 285
|
250 260
....*....|....*....|...
gi 1920237962 1573 ELQSEHASFAEKTAQLERTLKEE 1595
Cdd:pfam13868 286 MKRLEHRRELEKQIEEREEQRAA 308
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1043-1605 |
1.90e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 57.77 E-value: 1.90e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1043 AEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEaqaVPATLPELE 1122
Cdd:PRK03918 203 EEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEE---LKKEIEELE 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1123 ATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLqqrhgerdvevERWRERvtllLERWQAVLAQTDVRQRELEQLG 1202
Cdd:PRK03918 280 EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL-----------SRLEEE----INGIEERIKELEEKEERLEELK 344
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1203 RQLRYYRESADPLGAWLR---DAKQRQEQIqavplansqavrEQLRQEKALL--EDIERHGEKVEECQRFAKQYINAIKD 1277
Cdd:PRK03918 345 KKLKELEKRLEELEERHElyeEAKAKKEEL------------ERLKKRLTGLtpEKLEKELEELEKAKEEIEEEISKITA 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1278 YELQLVTYKAQLEPVASPAKKPKVQS---GSESIIQEYVDLRTRYSElstltsqYIRFISETLRRMEEEERlaeqqraEE 1354
Cdd:PRK03918 413 RIGELKKEIKELKKAIEELKKAKGKCpvcGRELTEEHRKELLEEYTA-------ELKRIEKELKEIEEKER-------KL 478
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1355 RERLAEVEAALEKQRQLAEAHAQAKaqaereaqglQRRMQEEvaRREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1434
Cdd:PRK03918 479 RKELRELEKVLKKESELIKLKELAE----------QLKELEE--KLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKK 546
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1435 QVEAAERsrlrIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1514
Cdd:PRK03918 547 ELEKLEE----LKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELK 622
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1515 lRVQAEAEAAREK-QRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERtLK 1593
Cdd:PRK03918 623 -KLEEELDKAFEElAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEK-LK 700
|
570
....*....|..
gi 1920237962 1594 EEHVAVVQLREE 1605
Cdd:PRK03918 701 EELEEREKAKKE 712
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1343-1970 |
2.12e-07 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 57.50 E-value: 2.12e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1343 EERLAEQQRAeeRERLAEVEAALEK-QRQLA------------------EAHAQAKAQAEREAQGLQRRMQEEVARREEV 1403
Cdd:PRK10246 253 DELQQEASRR--QQALQQALAAEEKaQPQLAalslaqparqlrphweriQEQSAALAHTRQQIEEVNTRLQSTMALRARI 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1404 AVEAQEQKRSIQEELQHLRQ-SSEAEIQAKARQVEAAERSRL----RIEEEIRVVRLQLEATERQRGGAEGELQALRARa 1478
Cdd:PRK10246 331 RHHAAKQSAELQAQQQSLNTwLAEHDRFRQWNNELAGWRAQFsqqtSDREQLRQWQQQLTHAEQKLNALPAITLTLTAD- 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1479 EEAEAQKRQAQEEAER-----LRRQVQDETQRKRQAEAelalrvqAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1553
Cdd:PRK10246 410 EVAAALAQHAEQRPLRqrlvaLHGQIVPQQKRLAQLQV-------AIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKT 482
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1554 ERARQVQVALETAQRsaeAELQS----------EHASFAEKTAqLERTlkeehvaVVQLREEATRRAQQQAeaeraraea 1623
Cdd:PRK10246 483 ICEQEARIKDLEAQR---AQLQAgqpcplcgstSHPAVEAYQA-LEPG-------VNQSRLDALEKEVKKL--------- 542
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1624 erelerwqlkANEALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgKAEEQAVRQRElAEQELEKQRQLAEGTAQQ 1703
Cdd:PRK10246 543 ----------GEEGAALRGQLDALTKQLQ---------------------RDESEAQSLRQ-EEQALTQQWQAVCASLNI 590
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1704 RLAAEQELIRLRAETEQGEQQRQLLEEELArLQREAAAATQKRRELEAELAKVRAEMEVLLASKA------RAEEESRST 1777
Cdd:PRK10246 591 TLQPQDDIQPWLDAQEEHERQLRLLSQRHE-LQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYAltlpqeDEEASWLAT 669
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1778 SEKSKQRLEAEAGRFRELAEEAARLRALAEeakrqrQLAEEDAVrqRAEAERVLAEKLAAISEATrLKTEAEIALKEKEA 1857
Cdd:PRK10246 670 RQQEAQSWQQRQNELTALQNRIQQLTPLLE------TLPQSDDL--PHSEETVALDNWRQVHEQC-LSLHSQLQTLQQQD 740
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1858 ENERLR---------------RLAEDEAFQRRLLEEQAAQhkadieaRLAQLRKASESELERQKGLVEdtlrQRRQVEEE 1922
Cdd:PRK10246 741 VLEAQRlqkaqaqfdtalqasVFDDQQAFLAALLDEETLT-------QLEQLKQNLENQRQQAQTLVT----QTAQALAQ 809
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 1923 ILALKGSFEKAAAGKAELELELGRIRGT-AEDTLRSKE---QAEQEA-ARQRQ 1970
Cdd:PRK10246 810 HQQHRPDGLDLTVTVEQIQQELAQLAQQlRENTTRQGEirqQLKQDAdNRQQQ 862
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1707-1907 |
2.31e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 56.38 E-value: 2.31e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1707 AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLE 1786
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1787 A---------------EAGRFRELAEEAARLRALAEEAKR---QRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEA 1848
Cdd:COG3883 94 AlyrsggsvsyldvllGSESFSDFLDRLSALSKIADADADlleELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1849 EIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKG 1907
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAA 232
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1686-1938 |
2.54e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 56.38 E-value: 2.54e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1686 AEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLA 1765
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1766 SKARAEE---------ESRSTSE--KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEK 1834
Cdd:COG3883 94 ALYRSGGsvsyldvllGSESFSDflDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1835 LAAISEATRLKteAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLR 1914
Cdd:COG3883 174 EAQQAEQEALL--AQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGA 251
|
250 260
....*....|....*....|....
gi 1920237962 1915 QRRQVEEEILALKGSFEKAAAGKA 1938
Cdd:COG3883 252 AGAAGAAAGSAGAAGAAAGAAGAG 275
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1328-1589 |
2.63e-07 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 57.27 E-value: 2.63e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1328 QYIRFISETLRRMEEEERLAEQ--QRAEER-ERLAEVEAA-LEKQRQLAEAHAQAKAQA---------EREAQGLQRRMQ 1394
Cdd:PRK05035 429 QYYRQAKAEIRAIEQEKKKAEEakARFEARqARLEREKAArEARHKKAAEARAAKDKDAvaaalarvkAKKAAATQPIVI 508
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1395 EEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAA-ERSRLRIEEeirvvrlQLEATERQRGGAEGELQA 1473
Cdd:PRK05035 509 KAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAiARAKAKKAA-------QQAANAEAEEEVDPKKAA 581
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1474 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAelalrvqaeAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1553
Cdd:PRK05035 582 VAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAA---------AIARAKAKKAEQQANAEPEEPVDPRKAAVAAAI 652
|
250 260 270
....*....|....*....|....*....|....*.
gi 1920237962 1554 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLE 1589
Cdd:PRK05035 653 ARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAK 688
|
|
| EmrA |
COG1566 |
Multidrug resistance efflux pump EmrA [Defense mechanisms]; |
1439-1559 |
2.75e-07 |
|
Multidrug resistance efflux pump EmrA [Defense mechanisms];
Pssm-ID: 441174 [Multi-domain] Cd Length: 331 Bit Score: 55.82 E-value: 2.75e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1439 AERSRLRIEEEIRVVRLQLEATERQRGgAEGELQALRARAEEAEAQKRQAQEEAERLR---------RQVQDETQRKR-Q 1508
Cdd:COG1566 81 LQAALAQAEAQLAAAEAQLARLEAELG-AEAEIAAAEAQLAAAQAQLDLAQRELERYQalykkgavsQQELDEARAALdA 159
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1509 AEAELAlRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV 1559
Cdd:COG1566 160 AQAQLE-AAQAQLAQAQAGLREEEELAAAQAQVAQAEAALAQAELNLARTT 209
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1693-1893 |
3.15e-07 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 56.42 E-value: 3.15e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1693 QRQLAEGTAQQRLAAEQelIRLRAETEQGEQQRqllEEELARLQREAAAATQKRRELEAELAKVraemevllaskaraEE 1772
Cdd:COG2268 191 RRKIAEIIRDARIAEAE--AERETEIAIAQANR---EAEEAELEQEREIETARIAEAEAELAKK--------------KA 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1773 ESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEA-----ERVLAEKLAAISEAtrlKTE 1847
Cdd:COG2268 252 EERREAETARAEAEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAEREEAELeadvrKPAEAEKQAAEAEA---EAE 328
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1848 AEIALKEKEAENERLRRLAE-DEAFQRRLLEEQAAQHKADIEARLAQ 1893
Cdd:COG2268 329 AEAIRAKGLAEAEGKRALAEaWNKLGDAAILLMLIEKLPEIAEAAAK 375
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1722-1906 |
4.29e-07 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 55.58 E-value: 4.29e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1722 EQQRQLLEEELAR-LQREAAAATQKRRELEAELAKVRAEmevllasKARAEEESRSTSEKSKQRLEAEAGrfrelAEEAA 1800
Cdd:PRK09510 78 EEQRKKKEQQQAEeLQQKQAAEQERLKQLEKERLAAQEQ-------KKQAEEAAKQAALKQKQAEEAAAK-----AAAAA 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1801 RLRALAEeakrqrQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRrlAEDEAFQRrllEEQA 1880
Cdd:PRK09510 146 KAKAEAE------AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK--AEAEAKKK---AAAE 214
|
170 180
....*....|....*....|....*.
gi 1920237962 1881 AQHKADIEARLAQLRKASESELERQK 1906
Cdd:PRK09510 215 AKKKAAAEAKAAAAKAAAEAKAAAEK 240
|
|
| COG4995 |
COG4995 |
Uncharacterized conserved protein, contains CHAT domain [Function unknown]; |
2271-2724 |
4.36e-07 |
|
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
Pssm-ID: 444019 [Multi-domain] Cd Length: 711 Bit Score: 56.52 E-value: 4.36e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2271 QVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMA 2350
Cdd:COG4995 9 LLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLALALAALALA 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2351 QQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTL 2430
Cdd:COG4995 89 LLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAAAAAALLAL 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2431 ETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKA 2510
Cdd:COG4995 169 ALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLALLALAAAAA 248
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2511 KLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRL 2590
Cdd:COG4995 249 ALAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLLLAALLLLLA 328
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2591 RERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFDGLRRKVPAQRLQEVGVLSAEELQQLAQ 2670
Cdd:COG4995 329 ALALLALLLLLAAAALLAAALAAALALAAALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAALLALAAAQL 408
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 2671 GRTTVAELAQREDVRHYLQGRSSIAGLL-LKPADEKLTIYAALRRQLLSPGTALI 2724
Cdd:COG4995 409 LRLLLAALALLLALAAYAAARLALLALIeYIILPDRLYAFVQLYQLLIAPIEAEL 463
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1674-1891 |
4.72e-07 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 55.24 E-value: 4.72e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRreleAEL 1753
Cdd:TIGR02794 72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAERKA----KEE 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1754 AKVRAEMEvllaSKARAEEESRSTSEKSKQRLEAEAgrfreLAEEAARLRALAEEAKRQRQLAEEDA---VRQRAEAERV 1830
Cdd:TIGR02794 148 AAKQAEEE----AKAKAAAEAKKKAEEAKKKAEAEA-----KAKAEAEAKAKAEEAKAKAEAAKAKAaaeAAAKAEAEAA 218
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1831 LAEKLAAISEATRLKTEAEIAL-KEKEAENERLRRLAEDEAFQRRLleeqAAQHKADIEARL 1891
Cdd:TIGR02794 219 AAAAAEAERKADEAELGDIFGLaSGSNAEKQGGARGAAAGSEVDKY----AAIIQQAIQQNL 276
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1746-2452 |
4.98e-07 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 56.46 E-value: 4.98e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1746 RRELEAELAKVRAEMEVLLASKARAEEESRStsEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRA 1825
Cdd:COG4913 220 EPDTFEAADALVEHFDDLERAHEALEDAREQ--IELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRRLELLEAEL 297
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1826 EAERvlaeklaaiSEATRLKTEAEIALKEKEAENERLRRLaedeafqrrllEEQAAQHKADIEARLAQLRKASESELERQ 1905
Cdd:COG4913 298 EELR---------AELARLEAELERLEARLDALREELDEL-----------EAQIRGNGGDRLEQLEREIERLERELEER 357
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1906 KGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaaeeerrrreaeer 1985
Cdd:COG4913 358 ERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELR--------------- 422
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1986 vqkSLAAEEEAARQRKAAL-EEVERLKAKVEEARRLRE------------RAEQES-------------------ARQLQ 2033
Cdd:COG4913 423 ---ELEAEIASLERRKSNIpARLLALRDALAEALGLDEaelpfvgelievRPEEERwrgaiervlggfaltllvpPEHYA 499
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2034 LAQEAAQkRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAAREraereaaqSRRQVEEAERL 2113
Cdd:COG4913 500 AALRWVN-RLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWLEAELGRRF--------DYVCVDSPEEL 570
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2114 KQSAEEQAQAQaqaqaaaekLRKeaeqeaarraqaEQAALRQKQAADAEMEKHkQFAEQALRQKAQVEQELTALRLQLEE 2193
Cdd:COG4913 571 RRHPRAITRAG---------QVK------------GNGTRHEKDDRRRIRSRY-VLGFDNRAKLAALEAELAELEEELAE 628
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2194 TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQmEELGKLKARIEAenralvLRDKDSAQRLLQEEAEKMKQVA 2273
Cdd:COG4913 629 AEERLEALEAELDALQERREALQRLAEYSWDEIDVASAE-REIAELEAELER------LDASSDDLAALEEQLEELEAEL 701
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2274 EEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQ-QKELAQEQARRLQEDKEQMAQQ 2352
Cdd:COG4913 702 EELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAAlGDAVERELRENLEERIDALRAR 781
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2353 LAQETQGFQKTleterqrqleMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERL------YRTELATQEKVML 2426
Cdd:COG4913 782 LNRAEEELERA----------MRAFNREWPAETADLDADLESLPEYLALLDRLEEDGLPEYeerfkeLLNENSIEFVADL 851
|
730 740
....*....|....*....|....*.
gi 1920237962 2427 VQTLETQRQQSDRDAERLREAIAELE 2452
Cdd:COG4913 852 LSKLRRAIREIKERIDPLNDSLKRIP 877
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2159-2359 |
5.29e-07 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 55.22 E-value: 5.29e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2159 ADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEE--- 2235
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGErar 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2236 --------LGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM 2307
Cdd:COG3883 94 alyrsggsVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2308 LKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQG 2359
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAA 225
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
2166-2469 |
5.74e-07 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 56.50 E-value: 5.74e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2166 HKQFAEQALRQKAQVEQ---ELTALRLQLEEtdhQKSILDEelqrLKAEVTEAARQRGQVEEELFSLRVQME-------- 2234
Cdd:PRK04863 336 HLNLVQTALRQQEKIERyqaDLEELEERLEE---QNEVVEE----ADEQQEENEARAEAAEEEVDELKSQLAdyqqaldv 408
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2235 ---------------------------ELGKLKARIEAenraLVLRDKDSAQRLLQEE-----AEKMKQVAEEAARL--- 2279
Cdd:PRK04863 409 qqtraiqyqqavqalerakqlcglpdlTADNAEDWLEE----FQAKEQEATEELLSLEqklsvAQAAHSQFEQAYQLvrk 484
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2280 ---SVAAQEAARLRQLAEEDLAQQRALAEKM---------LKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKE 2347
Cdd:PRK04863 485 iagEVSRSEAWDVARELLRRLREQRHLAEQLqqlrmrlseLEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELE 564
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2348 QMAQQLAQEtqgfqktLETERQRQLEMSAEAERLRLRVAE-MSRAQA--RAEEDARRFRKQAEDigerlyrtELATQEKV 2424
Cdd:PRK04863 565 ARLESLSES-------VSEARERRMALRQQLEQLQARIQRlAARAPAwlAAQDALARLREQSGE--------EFEDSQDV 629
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2425 M--LVQTLETQRQQSdRDAERLREAIAELEHEKDKLKQ-----EAQLLQLKS 2469
Cdd:PRK04863 630 TeyMQQLLERERELT-VERDELAARKQALDEEIERLSQpggseDPRLNALAE 680
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1473-1871 |
6.25e-07 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 56.11 E-value: 6.25e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1473 ALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAeaelalrvqaeAEAAREKQRALQAL---------EELRLQAEE 1543
Cdd:COG3096 829 AFAPDPEAELAALRQRRSELERELAQHRAQEQQLRQQ-----------LDQLKEQLQLLNKLlpqanlladETLADRLEE 897
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1544 AERRLRQAEAERA--RQVQVALETAQRSAEAeLQSEHASFAEktaqlertLKEEHVAVVQLREEATRRAQQQAEAERARA 1621
Cdd:COG3096 898 LREELDAAQEAQAfiQQHGKALAQLEPLVAV-LQSDPEQFEQ--------LQADYLQAKEQQRRLKQQIFALSEVVQRRP 968
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1622 EAERELERWQLKANEALRLRLQAeevaqqksltqaeaekqkeeaerearrrgkaeeqavrQRELAEQELEKQRQLAEGTA 1701
Cdd:COG3096 969 HFSYEDAVGLLGENSDLNEKLRA-------------------------------------RLEQAEEARREAREQLRQAQ 1011
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1702 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREA-----AAATQKRRELEAELAKVRAEmevllaskaraeeesRS 1776
Cdd:COG3096 1012 AQYSQYNQVLASLKSSRDAKQQTLQELEQELEELGVQAdaeaeERARIRRDELHEELSQNRSR---------------RS 1076
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1777 TSEKSKQRLEAE----AGRFRELAEEAARLRALAEEAK----RQRQLAEEDAVRQRAEAERVL---AEKLAAISEatrlk 1845
Cdd:COG3096 1077 QLEKQLTRCEAEmdslQKRLRKAERDYKQEREQVVQAKagwcAVLRLARDNDVERRLHRRELAylsADELRSMSD----- 1151
|
410 420
....*....|....*....|....*....
gi 1920237962 1846 tEAEIALKEKEAENERLR---RLAEDEAF 1871
Cdd:COG3096 1152 -KALGALRLAVADNEHLRdalRLSEDPRR 1179
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1478-1595 |
6.57e-07 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 55.26 E-value: 6.57e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1478 AEEAEAQKRQAQEEAErLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRAlQALEELRLQAEEAERRLRQA--EAER 1555
Cdd:COG2268 212 TEIAIAQANREAEEAE-LEQEREIETARIAEAEAELA-KKKAEERREAETARA-EAEAAYEIAEANAEREVQRQleIAER 288
|
90 100 110 120
....*....|....*....|....*....|....*....|
gi 1920237962 1556 ARQVQVALETAQRsAEAELQSEHASFAEktAQLERTLKEE 1595
Cdd:COG2268 289 EREIELQEKEAER-EEAELEADVRKPAE--AEKQAAEAEA 325
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
2154-2595 |
7.37e-07 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 55.95 E-value: 7.37e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2154 RQKQAADAEMEKHKQFAEQALRqkaqvEQELTALRLQLEE--TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRV 2231
Cdd:pfam01576 92 QQLQNEKKKMQQHIQDLEEQLD-----EEEAARQKLQLEKvtTEAKIKKLEEDILLLEDQNSKLSKERKLLEERISEFTS 166
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2232 QMEE-------LGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMK----------QVAEEAARLS-VAAQEAARLRQLA 2293
Cdd:pfam01576 167 NLAEeeekaksLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKlegestdlqeQIAELQAQIAeLRAQLAKKEEELQ 246
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2294 E-----EDLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQKELAQEQARRLQEDKEQMAQQL--AQETQGFQKTLET 2366
Cdd:pfam01576 247 AalarlEEETAQKNNALKKIRELEAQISE---LQEDLESERAARNKAEKQRRDLGEELEALKTELedTLDTTAAQQELRS 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2367 ERQRQLEMSAEAERLRLRVAEMSRAQARaeedaRRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLRE 2446
Cdd:pfam01576 324 KREQEVTELKKALEEETRSHEAQLQEMR-----QKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQ 398
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2447 AIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQL-------------------------------------LQETQALQQ 2489
Cdd:pfam01576 399 AKQDSEHKRKKLEGQLQELQARLSESERQRAELAeklsklqselesvssllneaegkniklskdvsslesqLQDTQELLQ 478
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2490 SFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQekqqLAASMEEARRRQHEAEEGVRRQQEEL 2569
Cdd:pfam01576 479 EETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSD----MKKKLEEDAGTLEALEEGKKRLQREL 554
|
490 500 510
....*....|....*....|....*....|
gi 1920237962 2570 ----QRLAQQQQQQEKLlaeenQRLRERLQ 2595
Cdd:pfam01576 555 ealtQQLEEKAAAYDKL-----EKTKNRLQ 579
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1098-1760 |
7.37e-07 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 55.95 E-value: 7.37e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1098 EEVLRAHEEQLKE-----AQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVER 1172
Cdd:pfam01576 337 EEETRSHEAQLQEmrqkhTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQE 416
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1173 WRERVT-------LLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAK-QRQEQIQAvPLANSQAVReQL 1244
Cdd:pfam01576 417 LQARLSeserqraELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQeLLQEETRQ-KLNLSTRLR-QL 494
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1245 RQEKALLEdiERHGEKVEECQRFAKQyinaIKDYELQLVTYKAQLEPVASPAK-----KPKVQSGSESIIQEYVDLRTRY 1319
Cdd:pfam01576 495 EDERNSLQ--EQLEEEEEAKRNVERQ----LSTLQAQLSDMKKKLEEDAGTLEaleegKKRLQRELEALTQQLEEKAAAY 568
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1320 SELS-------------TLTSQYIRFISETLRR-------MEEEERLAEQQRAEERERlAEVEAALEKQRQLAEAHA--- 1376
Cdd:pfam01576 569 DKLEktknrlqqelddlLVDLDHQRQLVSNLEKkqkkfdqMLAEEKAISARYAEERDR-AEAEAREKETRALSLARAlee 647
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1377 --QAKAQAEREAQGLQRRMQEEVARREEV------------AVEAQEQKRSIQEE-------------------LQHLRQ 1423
Cdd:pfam01576 648 alEAKEELERTNKQLRAEMEDLVSSKDDVgknvhelerskrALEQQVEEMKTQLEeledelqatedaklrlevnMQALKA 727
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1424 SSEAEIQAKArqvEAAERSRLRIEEEIRVVRLQLEATERQRGGA-------EGELQALRARAEEAEAQKRQAQEEAERLR 1496
Cdd:pfam01576 728 QFERDLQARD---EQGEEKRRQLVKQVRELEAELEDERKQRAQAvaakkklELDLKELEAQIDAANKGREEAVKQLKKLQ 804
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1497 RQVQDetqrkRQAEAELALRVQAEAEA-AREKQRALQALEELRLQAEE----AERRLRQAEAERAR-QVQVALETAQRSA 1570
Cdd:pfam01576 805 AQMKD-----LQRELEEARASRDEILAqSKESEKKLKNLEAELLQLQEdlaaSERARRQAQQERDElADEIASGASGKSA 879
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1571 eaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAE-----AERARAEAERELERWQL-KANEALRLRLQA 1644
Cdd:pfam01576 880 ---LQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQlttelAAERSTSQKSESARQQLeRQNKELKAKLQE 956
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1645 EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQA-----VRQRELAEQEL----EKQRQLAEGTAQQRLAAEQELIRLR 1715
Cdd:pfam01576 957 MEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQaanklVRRTEKKLKEVllqvEDERRHADQYKDQAEKGNSRMKQLK 1036
|
730 740 750 760
....*....|....*....|....*....|....*....|....*
gi 1920237962 1716 AETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEM 1760
Cdd:pfam01576 1037 RQLEEAEEEASRANAARRKLQRELDDATESNESMNREVSTLKSKL 1081
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2799-2835 |
8.57e-07 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 48.25 E-value: 8.57e-07
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 2799 IRLLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEM 2835
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
2252-2689 |
9.57e-07 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 55.36 E-value: 9.57e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2252 LRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQ 2331
Cdd:TIGR00618 158 LKAKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQ 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2332 KELAQEQARRLQEDKEQMA--QQLAQETQGFQKTLETERQRqLEMSAEAERLRLRVAEMSRAQARAEEdarrFRKQAEDI 2409
Cdd:TIGR00618 238 TQQSHAYLTQKREAQEEQLkkQQLLKQLRARIEELRAQEAV-LEETQERINRARKAAPLAAHIKAVTQ----IEQQAQRI 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2410 GERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKS---EEMQTVRQEQLLQETQA 2486
Cdd:TIGR00618 313 HTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCqqhTLTQHIHTLQQQKTTLT 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2487 LQQSFLSEKDSLLQRErciEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQ 2566
Cdd:TIGR00618 393 QKLQSLCKELDILQRE---QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK 469
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2567 EELQRLAQQQQQQEKlLAEENQRLRERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFDGLR 2646
Cdd:TIGR00618 470 EREQQLQTKEQIHLQ-ETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVY 548
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 2647 RKVPA-----QRLQEVGVLSAEELQQLAQGRTTVAELAQR-----EDVRHYLQ 2689
Cdd:TIGR00618 549 HQLTSerkqrASLKEQMQEIQQSFSILTQCDNRSKEDIPNlqnitVRLQDLTE 601
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2257-2473 |
9.92e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 54.38 E-value: 9.92e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2257 SAQRLLQEEAEKMKQVAEEAARLSVAAQEAARlrqlAEEDLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQKELAQ 2336
Cdd:COG4942 17 AQADAAAEAEAELEQLQQEIAELEKELAALKK----EEKALLKQLAALERRIAALARRIRA---LEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2337 EQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRT 2416
Cdd:COG4942 90 KEIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAEL 169
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 2417 ELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQ 2473
Cdd:COG4942 170 EAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELE 226
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1870-2359 |
1.03e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 55.16 E-value: 1.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1870 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQkglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRg 1949
Cdd:COG4717 41 AFIRAMLLERLEKEADELFKPQGRKPELNLKELKEL----EEELKEAEEKEEEYAELQEELEELEEELEELEAELEELR- 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1950 tAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2029
Cdd:COG4717 116 -EELEKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEEL 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2030 RQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVL-ERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVE 2108
Cdd:COG4717 195 QDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEaAALEERLKEARLLLLIAAALLALLGLGGSLLSLILT 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2109 EAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQA------LRQKAQVEQ 2182
Cdd:COG4717 275 IAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELlelldrIEELQELLR 354
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2183 ELTALRLQLEETDHQKSIlDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL-GKLKARIEAENRALVLRDKDSAQRL 2261
Cdd:COG4717 355 EAEELEEELQLEELEQEI-AALLAEAGVEDEEELRAALEQAEEYQELKEELEELeEQLEELLGELEELLEALDEEELEEE 433
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2262 LQEEAEKMKQVAEEAARLSVAAQEA-ARLRQLAEEDLAQQRALAEKMLKEKMQ-AVQEATRLKAEAELLQQQKELAQEqa 2339
Cdd:COG4717 434 LEELEEELEELEEELEELREELAELeAELEQLEEDGELAELLQELEELKAELReLAEEWAALKLALELLEEAREEYRE-- 511
|
490 500
....*....|....*....|
gi 1920237962 2340 RRLQEDKEQMAQQLAQETQG 2359
Cdd:COG4717 512 ERLPPVLERASEYFSRLTDG 531
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
4139-4167 |
1.12e-06 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 47.71 E-value: 1.12e-06
10 20
....*....|....*....|....*....
gi 1920237962 4139 IVDPETGKEMSVYEAYRKGLIDHQTYLEL 4167
Cdd:pfam00681 11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
1475-1605 |
1.17e-06 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 54.57 E-value: 1.17e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1475 RARAEEAEAQ------KRQAQEEAerlRRQVQDETQRKRQAEAELALRVQAEAEAAREKQralQALEELRLQAEEAER-- 1546
Cdd:pfam15709 337 RLRAERAEMRrleverKRREQEEQ---RRLQQEQLERAEKMREELELEQQRRFEEIRLRK---QRLEEERQRQEEEERkq 410
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1547 -RLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ----LERTLKEEHVAVVQLREE 1605
Cdd:pfam15709 411 rLQLQAAQERARQQQEEFRRKLQELQRKKQQEEAERAEAEKQrqkeLEMQLAEEQKRLMEMAEE 474
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1338-1606 |
1.30e-06 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 53.77 E-value: 1.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1338 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVeaQEQKRSIQEE 1417
Cdd:pfam13868 36 AEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQM--DEIVERIQEE 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1418 LQHLRQSSEAEIQAKARQVEAAERSRLRI---------EEEIRVVRLQLEATERQRggAEGELQALRARAEEAEAQKRQA 1488
Cdd:pfam13868 114 DQAEAEEKLEKQRQLREEIDEFNEEQAEWkelekeeerEEDERILEYLKEKAEREE--EREAEREEIEEEKEREIARLRA 191
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1489 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQR 1568
Cdd:pfam13868 192 QQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERMLRKQA 271
|
250 260 270
....*....|....*....|....*....|....*...
gi 1920237962 1569 SAEAELQSEhasfAEKTAQLERTLKEEHVAVVQLREEA 1606
Cdd:pfam13868 272 EDEEIEQEE----AEKRRMKRLEHRRELEKQIEEREEQ 305
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1674-2047 |
1.76e-06 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 54.10 E-value: 1.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQRElaEQELEKQRQLAEGTAQQRLAAEQelIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1753
Cdd:pfam02029 14 RAREERRRQKE--EEEPSGQVTESVEPNEHNSYEED--SELKPSGQGGLDEEEAFLDRTAKREERRQKRLQEALERQKEF 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1754 AKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRElAEEAARLRALAEEAKRQ--RQLAEEDAVRQRAEAERVL 1831
Cdd:pfam02029 90 DPTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYKE-EETEIREKEYQENKWSTevRQAEEEGEEEEDKSEEAEE 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1832 AEKLAAISEatrlKTEAEIALKEKEAENERLRRLAedeafQRRLLEEQAAQhkadiearlaqlrkASESELERQKGLVED 1911
Cdd:pfam02029 169 VPTENFAKE----EVKDEKIKKEKKVKYESKVFLD-----QKRGHPEVKSQ--------------NGEEEVTKLKVTTKR 225
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1912 TLRQRRQVEEEilalkgsfEKAAAGKAELELELGRIRGTAEDtlrsKEQAEQEAARQRQLAAEEERRRREAEERVQKSLA 1991
Cdd:pfam02029 226 RQGGLSQSQER--------EEEAEVFLEAEQKLEELRRRRQE----KESEEFEKLRQKQQEAELELEELKKKREERRKLL 293
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1992 AEEEaaRQRKAalEEVERLKAKVEEARRLRERAEQESArqlqlaqEAAQKRLQAEE 2047
Cdd:pfam02029 294 EEEE--QRRKQ--EEAERKLREEEEKRRMKEEIERRRA-------EAAEKRQKLPE 338
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1342-1526 |
1.82e-06 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 53.66 E-value: 1.82e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1342 EEERLAEQQRAEERERLAEveAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEvaveaqEQKRSIQEELQhl 1421
Cdd:PRK09510 107 EKERLAAQEQKKQAEEAAK--QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA------EAKKKAEAEAA-- 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1422 rQSSEAEIQAKARQVEAAersrlrieeeirvvrlQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAerlrrqvqd 1501
Cdd:PRK09510 177 -KKAAAEAKKKAEAEAAA----------------KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKA--------- 230
|
170 180
....*....|....*....|....*
gi 1920237962 1502 ETQRKRQAEAELALRVQAEAEAARE 1526
Cdd:PRK09510 231 AAEAKAAAEKAAAAKAAEKAAAAKA 255
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2324-2610 |
1.85e-06 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 54.36 E-value: 1.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2324 EAELLQQQKELAQEQA----RRLQEDKEQMAQQ-LAQETQgfQKTLETERQRQLEMS-----AEAERLRLRVAEMSRAQA 2393
Cdd:pfam17380 267 ENEFLNQLLHIVQHQKavseRQQQEKFEKMEQErLRQEKE--EKAREVERRRKLEEAekarqAEMDRQAAIYAEQERMAM 344
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2394 RAEEDARRFRKQAEDI-GERLYRTELATQ-EKVMLVQTLETQRQQSDrdaERLREAIAELEHEKDKLKQEAQLLQLKSEE 2471
Cdd:pfam17380 345 ERERELERIRQEERKReLERIRQEEIAMEiSRMRELERLQMERQQKN---ERVRQELEAARKVKILEEERQRKIQQQKVE 421
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2472 MQTVRQEQLLQETQALQQsflsekdslLQRERCIEQEKAKLEQLfqdevakaqalreeqqRqqqqMQQEKQQLAASMEEA 2551
Cdd:pfam17380 422 MEQIRAEQEEARQREVRR---------LEEERAREMERVRLEEQ----------------E----RQQQVERLRQQEEER 472
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2552 RRRQHEAEEGVRRQQ--EELQRLAQQQQQQEKLLAE-ENQRLRERLQHLEEERRAALARSEE 2610
Cdd:pfam17380 473 KRKKLELEKEKRDRKraEEQRRKILEKELEERKQAMiEEERKRKLLEKEMEERQKAIYEEER 534
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3458-3494 |
2.05e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 47.09 E-value: 2.05e-06
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 3458 IRLLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEM 3494
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
2163-2387 |
2.16e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 54.25 E-value: 2.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2163 MEKHKQFAEQALR----QKAQVEQELTALRLQLEE--TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL 2236
Cdd:COG3206 166 LELRREEARKALEfleeQLPELRKELEEAEAALEEfrQKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAAL 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2237 GKLKARIEAENRALVlrDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEkmqAVQ 2316
Cdd:COG3206 246 RAQLGSGPDALPELL--QSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILAS---LEA 320
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 2317 EATRLKAEAELLQQQKELAQEQARRLQEdKEQMAQQLAQETQGFQKTLET--ERQRQLEMSAEAERLRLRVAE 2387
Cdd:COG3206 321 ELEALQAREASLQAQLAQLEARLAELPE-LEAELRRLEREVEVARELYESllQRLEEARLAEALTVGNVRVID 392
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
2251-2604 |
2.16e-06 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 54.28 E-value: 2.16e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2251 VLRDKDSAQRLLQEEAEKmKQVAEEAARLSVAAQEAARLRQLAE------EDLAQQRALAEKMLKEKMQAVQEATRLKAE 2324
Cdd:PRK02224 181 VLSDQRGSLDQLKAQIEE-KEEKDLHERLNGLESELAELDEEIEryeeqrEQARETRDEADEVLEEHEERREELETLEAE 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2325 AELLQQQKELAQ---EQARRLQEDKEQMAQQLAQETQGFQKTLETER-------QRQLEMSAEAERLRLRVAEMSRAQAR 2394
Cdd:PRK02224 260 IEDLRETIAETErerEELAEEVRDLRERLEELEEERDDLLAEAGLDDadaeaveARREELEDRDEELRDRLEECRVAAQA 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2395 AEEDARRFRKQAEDIGERlyrtelaTQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQT 2474
Cdd:PRK02224 340 HNEEAESLREDADDLEER-------AEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAED 412
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2475 VRQEqllqetqalqqsFLSEKDSLLQRERCIEqekAKLEQLfQDEVAKAQALREE--------------QQRQQQQMQQE 2540
Cdd:PRK02224 413 FLEE------------LREERDELREREAELE---ATLRTA-RERVEEAEALLEAgkcpecgqpvegspHVETIEEDRER 476
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2541 KQQLAASMEEARRRQHEAEEGVRRQQE------ELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAA 2604
Cdd:PRK02224 477 VEELEAELEDLEEEVEEVEERLERAEDlveaedRIERLEERREDLEELIAERRETIEEKRERAEELRERA 546
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1396-2023 |
2.20e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 54.30 E-value: 2.20e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1396 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ----------LEATERQRG 1465
Cdd:PRK03918 169 EVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEvkeleelkeeIEELEKELE 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1466 GAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEaEAAREKQRALQALEELRLQAEEAE 1545
Cdd:PRK03918 249 SLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE-EYLDELREIEKRLSRLEEEINGIE 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1546 RRLRQAEAERARQVQValetaqRSAEAELQSEHASFaEKTAQLERTLKEEHVAVVQLREEatrraqqqaeaeraraeaer 1625
Cdd:PRK03918 328 ERIKELEEKEERLEEL------KKKLKELEKRLEEL-EERHELYEEAKAKKEELERLKKR-------------------- 380
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1626 elerwqLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREarrrgkaeEQAVRQRELAEQELEKqrqlAEGTAQ--Q 1703
Cdd:PRK03918 381 ------LTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL--------KKEIKELKKAIEELKK----AKGKCPvcG 442
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1704 RLAAEQELIRLRAEteqgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVraEMEVLLASKARAE----EESRSTSE 1779
Cdd:PRK03918 443 RELTEEHRKELLEE----------YTAELKRIEKELKEIEEKERKLRKELREL--EKVLKKESELIKLkelaEQLKELEE 510
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1780 KSK----QRLEAEAGRFRELAEEAARLRalaeeaKRQRQLAEEdaVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEK 1855
Cdd:PRK03918 511 KLKkynlEELEKKAEEYEKLKEKLIKLK------GEIKSLKKE--LEKLEELKKKLAELEKKLDELEEELAELLKELEEL 582
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1856 --EAENERLRRLAEDEAFQRRLLEEQAAQHkaDIEARLAQLRKAsESELERQKGLVEDTLRQRRQVEEEILALKGSF--- 1930
Cdd:PRK03918 583 gfESVEELEERLKELEPFYNEYLELKDAEK--ELEREEKELKKL-EEELDKAFEELAETEKRLEELRKELEELEKKYsee 659
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1931 --EKAAAGKAELELELGRIRGTAEDTLRSKEQAEqeaarqrqlaaeeerrrreaeervqKSLAAEEEAARQRKAALEEVE 2008
Cdd:PRK03918 660 eyEELREEYLELSRELAGLRAELEELEKRREEIK-------------------------KTLEKLKEELEEREKAKKELE 714
|
650
....*....|....*
gi 1920237962 2009 RLKAKVEEARRLRER 2023
Cdd:PRK03918 715 KLEKALERVEELREK 729
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
1673-1893 |
2.53e-06 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 53.80 E-value: 2.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1673 GKAEEQAVRQRELAEQELEKQRQlAEGTAQQRLAAEQ-ELIRLRAETEQGEQ--QRQLLEEELARlqreaaaATQKRREL 1749
Cdd:pfam15709 307 GNMESEEERSEEDPSKALLEKRE-QEKASRDRLRAERaEMRRLEVERKRREQeeQRRLQQEQLER-------AEKMREEL 378
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1750 EAELAKVRAEMEvlLASKARAEEESRSTSEKSKQRLEaeagrfrelaEEAARLRALAEEAKRQRQLAEEDAVRQRAEAER 1829
Cdd:pfam15709 379 ELEQQRRFEEIR--LRKQRLEEERQRQEEEERKQRLQ----------LQAAQERARQQQEEFRRKLQELQRKKQQEEAER 446
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1830 VLAEKLAAISEATRLKTEAEIALkeKEAENERLRRLAE-DEAFQRRLLEEQAAQHKADIEARLAQ 1893
Cdd:pfam15709 447 AEAEKQRQKELEMQLAEEQKRLM--EMAEEERLEYQRQkQEAEEKARLEAEERRQKEEEAARLAL 509
|
|
| DUF4659 |
pfam15558 |
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ... |
1338-1599 |
2.58e-06 |
|
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.
Pssm-ID: 464768 [Multi-domain] Cd Length: 374 Bit Score: 53.12 E-value: 2.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1338 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVAR--REEVAVEAQEQKRSIQ 1415
Cdd:pfam15558 21 QRMRELQQQAALAWEELRRRDQKRQETLERERRLLLQQSQEQWQAEKEQRKARLGREERRRAdrREKQVIEKESRWREQA 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1416 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrlqleaterqRGGAEGELQALRARAEEAEaQKRQAQEEAERL 1495
Cdd:pfam15558 101 EDQENQRQEKLERARQEAEQRKQCQEQRLKEKEEEL------------QALREQNSLQLQERLEEAC-HKRQLKEREEQK 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1496 RRQVQDETQRKRQAEAELALRVQAEAEAAREK----QRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAE 1571
Cdd:pfam15558 168 KVQENNLSELLNHQARKVLVDCQAKAEELLRRlsleQSLQRSQENYEQLVEERHRELREKAQKEEEQFQRAKWRAEEKEE 247
|
250 260
....*....|....*....|....*...
gi 1920237962 1572 AELQSEHASFAEKTAQLERTLKEEHVAV 1599
Cdd:pfam15558 248 ERQEHKEALAELADRKIQQARQVAHKTV 275
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1471-1745 |
2.66e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 53.23 E-value: 2.66e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1471 LQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQA--------LEELRLQAE 1542
Cdd:COG4942 15 AAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLA-ALERRIAALARRIRALEQelaaleaeLAELEKEIA 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1543 EAERRLRQAEAERARQVQVALETAQRSAEAELQSehasfAEKTAQLERTLKEEHVAVVQLREEATRRAQqqaeaerarae 1622
Cdd:COG4942 94 ELRAELEAQKEELAELLRALYRLGRQPPLALLLS-----PEDFLDAVRRLQYLKYLAPARREQAEELRA----------- 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1623 aerelerwQLKANEALRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrgkaEEQAVRQRELAEQELEKQRQLAEgtaq 1702
Cdd:COG4942 158 --------DLAELAALRAELEAERAELEALL----------------------AELEEERAALEALKAERQKLLAR---- 203
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1920237962 1703 qrlaAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQK 1745
Cdd:COG4942 204 ----LEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
2258-2585 |
2.94e-06 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 52.61 E-value: 2.94e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2258 AQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2337
Cdd:pfam13868 33 RIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEIVERIQE 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2338 QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERlrlRVAEMSRAQARAEEDARRFRKQAEDIGERLYrte 2417
Cdd:pfam13868 113 EDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDE---RILEYLKEKAEREEEREAEREEIEEEKEREI--- 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2418 latqeKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEaqllqlksEEMQTVRQEQLLQETQALQQSFLSEKDS 2497
Cdd:pfam13868 187 -----ARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKERE--------EAEKKARQRQELQQAREEQIELKERRLA 253
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2498 LLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQ 2577
Cdd:pfam13868 254 EEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRERIEE 333
|
....*...
gi 1920237962 2578 QQEKLLAE 2585
Cdd:pfam13868 334 ERQKKLKE 341
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2202-2407 |
3.08e-06 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 52.91 E-value: 3.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2202 DEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVlRDKDSAQRLLQEEAEKMKQVAEEAARLSV 2281
Cdd:COG3883 15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQ-AEIDKLQAEIAEAEAEIEERREELGERAR 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2282 AAQEAARLRQLAE--------EDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQL 2353
Cdd:COG3883 94 ALYRSGGSVSYLDvllgsesfSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 2354 AQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAE 2407
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAA 227
|
|
| EmrA |
COG1566 |
Multidrug resistance efflux pump EmrA [Defense mechanisms]; |
1368-1496 |
3.09e-06 |
|
Multidrug resistance efflux pump EmrA [Defense mechanisms];
Pssm-ID: 441174 [Multi-domain] Cd Length: 331 Bit Score: 52.74 E-value: 3.09e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1368 QRQLAEAHAQ-AKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKArQVEAAERSRLRI 1446
Cdd:COG1566 82 QAALAQAEAQlAAAEAQLARLEAELGAEAEIAAAEAQLAAAQAQLDLAQRELERYQALYKKGAVSQQ-ELDEARAALDAA 160
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1447 EEEIRVVRLQLEATERQRGGAEgELQALRARAEEAEAQKRQAQEEAERLR 1496
Cdd:COG1566 161 QAQLEAAQAQLAQAQAGLREEE-ELAAAQAQVAQAEAALAQAELNLARTT 209
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
2175-2606 |
3.29e-06 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 53.60 E-value: 3.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2175 RQKAQVEQELTALRLQLEETDHQ---KSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALV 2251
Cdd:pfam07111 190 KQLAEAQKEAELLRKQLSKTQEEleaQVTLVESLRKYVGEQVPPEVHSQTWELERQELLDTMQHLQEDRADLQATVELLQ 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2252 LRDKDSAQRLLQEEAEKMKQVAEEAarlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAE-AELLQQ 2330
Cdd:pfam07111 270 VRVQSLTHMLALQEEELTRKIQPSD---SLEPEFPKKCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQvAELQEQ 346
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2331 QKELAQEQA--RRLQEDKEQMAQQLAQETQGFQKTL----ETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRK 2404
Cdd:pfam07111 347 VTSQSQEQAilQRALQDKAAEVEVERMSAKGLQMELsraqEARRRQQQQTASAEEQLKFVVNAMSSTQIWLETTMTRVEQ 426
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2405 QAEDIGERLYRTELATQE----------KVMLVQTLETQRQQSDRDAERLREAIAELEH---EKDKLKQEAQL-LQLKSE 2470
Cdd:pfam07111 427 AVARIPSLSNRLSYAVRKvhtikglmarKVALAQLRQESCPPPPPAPPVDADLSLELEQlreERNRLDAELQLsAHLIQQ 506
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2471 EMQTVRQE-------------QLLQETQALQQSFLS---EKDSLLQRERCIEQEKAKLEQ-LFQDEVAKAQALREEQQRQ 2533
Cdd:pfam07111 507 EVGRAREQgeaerqqlsevaqQLEQELQRAQESLASvgqQLEVARQGQQESTEEAASLRQeLTQQQEIYGQALQEKVAEV 586
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 2534 QQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQ----EELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALA 2606
Cdd:pfam07111 587 ETRLREQLSDTKRRLNEARREQAKAVVSLRQIQhratQEKERNQELRRLQDEARKEEGQRLARRVQELERDKNLMLA 663
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
2260-2508 |
3.29e-06 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 53.90 E-value: 3.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2260 RLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQ-EATRLKA---EAELLQQQKELA 2335
Cdd:PRK10929 123 RQAQQEQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNTPLAQAQLTALQaESAALKAlvdELELAQLSANNR 202
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2336 QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAE-AERLRLRVAEMSRAQARA----EEDARRFRKQAEDIG 2410
Cdd:PRK10929 203 QELARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAERALEsTELLAEQSGDLPKSIVAQfkinRELSQALNQQAQRMD 282
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2411 ERLYRTELATQEKVMLVQTLETQRQQ------SDRDAERLREAIAELEhEKDKLKQ----EAQLL--QLKSEEMQTvRQE 2478
Cdd:PRK10929 283 LIASQQRQAASQTLQVRQALNTLREQsqwlgvSNALGEALRAQVARLP-EMPKPQQldteMAQLRvqRLRYEDLLN-KQP 360
|
250 260 270
....*....|....*....|....*....|
gi 1920237962 2479 QLLQETQALQQSFLSEKDSLLQRERCIEQE 2508
Cdd:PRK10929 361 QLRQIRQADGQPLTAEQNRILDAQLRTQRE 390
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1341-1506 |
3.51e-06 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 52.54 E-value: 3.51e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1341 EEEERLAEQQRAEERERLAEVEAAleKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQH 1420
Cdd:TIGR02794 101 EKAAKQAEQAAKQAEEKQKQAEEA--KAKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAE 178
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1421 LRQSSEAEIQAKARQVEA-AERSRLRIEEEI----RVVRLQLEATERQRGGAEGELQALRARAEEAEAQK------RQAQ 1489
Cdd:TIGR02794 179 AKAKAEAEAKAKAEEAKAkAEAAKAKAAAEAaakaEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKqggargAAAG 258
|
170
....*....|....*..
gi 1920237962 1490 EEAERLRRQVQDETQRK 1506
Cdd:TIGR02794 259 SEVDKYAAIIQQAIQQN 275
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1405-2240 |
3.56e-06 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 53.51 E-value: 3.56e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1405 VEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAersrLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQ 1484
Cdd:TIGR00606 185 IKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKA----CEIRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEHN 260
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1485 KRQAQEEAERLRRQVQDETQRKRQaEAELALRVQAEAEAAREKqraLQALEELR-LQAEEAERRLRQAEAERARQVQVAL 1563
Cdd:TIGR00606 261 LSKIMKLDNEIKALKSRKKQMEKD-NSELELKMEKVFQGTDEQ---LNDLYHNHqRTVREKERELVDCQRELEKLNKERR 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1564 ETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRL--R 1641
Cdd:TIGR00606 337 LLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQLcaD 416
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1642 LQAEEVAQQKSLTQAEAEKQKEEAEREArrrgKAEEQAVRQRELaeQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1721
Cdd:TIGR00606 417 LQSKERLKQEQADEIRDEKKGLGRTIEL----KKEILEKKQEEL--KFVIKELQQLEGSSDRILELDQELRKAERELSKA 490
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1722 EQQR--QLLEEELARLQREAAAATQKRRELEAELAKV------RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFR 1793
Cdd:TIGR00606 491 EKNSltETLKKEVKSLQNEKADLDRKLRKLDQEMEQLnhhtttRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFP 570
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1794 ELAEEAARLRALAEEAKRQRQ-LAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEI----ALKEKEAENERLRRLAED 1868
Cdd:TIGR00606 571 NKKQLEDWLHSKSKEINQTRDrLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLfdvcGSQDEESDLERLKEEIEK 650
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1869 EAFQRRLLEEQAAQHKADIEAR---------LAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKaaagkaE 1939
Cdd:TIGR00606 651 SSKQRAMLAGATAVYSQFITQLtdenqsccpVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEK------R 724
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1940 LELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRR--EAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEA 2017
Cdd:TIGR00606 725 RDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNdiEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDV 804
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2018 RRLRERAEQESaRQLQLAQEAAQKRLQAEEKAHAF-AVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERA 2096
Cdd:TIGR00606 805 ERKIAQQAAKL-QGSDLDRTVQQVNQEKQEKQHELdTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRR 883
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2097 EREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKL----------RKEAEQEAARRAQAEQAALRQKQAADAEMEKH 2166
Cdd:TIGR00606 884 QQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDqqekeelissKETSNKKAQDKVNDIKEKVKNIHGYMKDIENK 963
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 2167 KQfaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVtEAARQRGQVEEELFSLRVQMEELGKLK 2240
Cdd:TIGR00606 964 IQ--DGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDI-DTQKIQERWLQDNLTLRKRENELKEVE 1034
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1884-2602 |
3.58e-06 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 53.69 E-value: 3.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1884 KADIEARLAQlRKASESELERQKGLVEDTLRQRrQVEEEILALKGSFEKAAAG-----KAELELELGRIRGTAEDTLRSK 1958
Cdd:pfam12128 199 KSMIVAILED-DGVVPPKSRLNRQQVEHWIRDI-QAIAGIMKIRPEFTKLQQEfntleSAELRLSHLHFGYKSDETLIAS 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1959 EQAEQEA--ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQ 2036
Cdd:pfam12128 277 RQEERQEtsAELNQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAAADQEQLPSWQSEL 356
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2037 EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQS 2116
Cdd:pfam12128 357 ENLEERLKALTGKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEAGKLEFN 436
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2117 AEEQAQaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDh 2196
Cdd:pfam12128 437 EEEYRL----------KSRLGELKLRLNQATATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQAS- 505
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2197 qksildEELQRLKAEVTEAARQRGQVEEELFS--------LRVQM----EELGKLKARieaenrALVLR-------DKDS 2257
Cdd:pfam12128 506 ------EALRQASRRLEERQSALDELELQLFPqagtllhfLRKEApdweQSIGKVISP------ELLHRtdldpevWDGS 573
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2258 AQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2337
Cdd:pfam12128 574 VGGELNLYGVKLDLKRIDVPEWAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARL 653
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2338 QARRLQEDKEQMAQQLAQETQGFQKTLETERQrqlEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQA--EDIGERLYR 2415
Cdd:pfam12128 654 DLRRLFDEKQSEKDKKNKALAERKDSANERLN---SLEAQLKQLDKKHQAWLEEQKEQKREARTEKQAYwqVVEGALDAQ 730
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2416 TELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqLLQETQALQQSFLSEK 2495
Cdd:pfam12128 731 LALLKAAIAARRSGAKAELKALETWYKRDLASLGVDPDVIAKLKREIRTLERKIERIAVRRQE-VLRYFDWYQETWLQRR 809
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2496 DSLLQRERCIEQEKAKLEQlfqdevakaqalreEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQ 2575
Cdd:pfam12128 810 PRLATQLSNIERAISELQQ--------------QLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMSKLATL 875
|
730 740
....*....|....*....|....*..
gi 1920237962 2576 QQQQEKllAEENQRLRERLQHLEEERR 2602
Cdd:pfam12128 876 KEDANS--EQAQGSIGERLAQLEDLKL 900
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
2163-2466 |
3.67e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 53.49 E-value: 3.67e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2163 MEKHKQFAEQALRQKAQVEQELTAlrlQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKAR 2242
Cdd:TIGR04523 389 LESQINDLESKIQNQEKLNQQKDE---QIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRES 465
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2243 IEAENRALvlrdKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAarlRQLAEE--DLAQQRALaekmLKEKMQavqeatr 2320
Cdd:TIGR04523 466 LETQLKVL----SRSINKIKQNLEQKQKELKSKEKELKKLNEEK---KELEEKvkDLTKKISS----LKEKIE------- 527
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2321 lKAEAELLQQQKELAQeqarrLQEDKEQMAQQLAQETqgfqktLETERQRQLEmsaEAERLRLRVAEMSRAQARAEEDAR 2400
Cdd:TIGR04523 528 -KLESEKKEKESKISD-----LEDELNKDDFELKKEN------LEKEIDEKNK---EIEELKQTQKSLKKKQEEKQELID 592
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 2401 RFRKQAEDIGERLyrtelatQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQ 2466
Cdd:TIGR04523 593 QKEKEKKDLIKEI-------EEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIK 651
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1239-1971 |
3.70e-06 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 53.57 E-value: 3.70e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1239 AVREQLRQEKALLEDierhGEKVEECQRFAKQyinaikdyELQLVTYKAQLepvaspakkpKVQSGsesiIQEYVDLRTR 1318
Cdd:pfam05483 96 SIEAELKQKENKLQE----NRKIIEAQRKAIQ--------ELQFENEKVSL----------KLEEE----IQENKDLIKE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1319 yselSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKqrqLAEAHAQAKAQAEREAQGLQRRMQEEva 1398
Cdd:pfam05483 150 ----NNATRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEK---MILAFEELRVQAENARLEMHFKLKED-- 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1399 rreevaveaqeqkrsiQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARA 1478
Cdd:pfam05483 221 ----------------HEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENL 284
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1479 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELR----LQAEEAERRLRQAEaE 1554
Cdd:pfam05483 285 KELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKaahsFVVTEFEATTCSLE-E 363
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1555 RARQVQVALEtaqrSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaeaerarAEAERELERWQLKA 1634
Cdd:pfam05483 364 LLRTEQQRLE----KNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKK-------------ILAEDEKLLDEKKQ 426
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1635 NEALRLRLQAEEvaQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQR-QLAEGTAQ--------QRL 1705
Cdd:pfam05483 427 FEKIAEELKGKE--QELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKlKNIELTAHcdklllenKEL 504
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1706 AAEQE--LIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA---ELAKVRAEMEVLL---ASKARAEEESRST 1777
Cdd:pfam05483 505 TQEASdmTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESvreEFIQKGDEVKCKLdksEENARSIEYEVLK 584
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1778 SEKSKQRLEAEAGRFRELAEEAAR-LRALAEEAKRQRQlaEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKE 1856
Cdd:pfam05483 585 KEKQMKILENKCNNLKKQIENKNKnIEELHQENKALKK--KGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKE 662
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1857 AENERL--RRLAEdEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRrqvEEEILALKGSFEKAA 1934
Cdd:pfam05483 663 IEDKKIseEKLLE-EVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEER---DSELGLYKNKEQEQS 738
|
730 740 750
....*....|....*....|....*....|....*..
gi 1920237962 1935 AGKAELELELGRIRGtaeDTLRSKEQAEQEAARQRQL 1971
Cdd:pfam05483 739 SAKAALEIELSNIKA---ELLSLKKQLEIEKEEKEKL 772
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1322-1577 |
4.08e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 53.10 E-value: 4.08e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1322 LSTLTSQYIRFISEtLRRMEEEERLA--EQQRAEERERLAEVEAALEKQRQlaeahAQAKAQAEREAQGLQRRMQEEVAR 1399
Cdd:COG3206 154 ANALAEAYLEQNLE-LRREEARKALEflEEQLPELRKELEEAEAALEEFRQ-----KNGLVDLSEEAKLLLQQLSELESQ 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1400 ReevaVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRieEEIRVVRLQLEATERQRGGAEGELQALRARAE 1479
Cdd:COG3206 228 L----AEARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQLR--AQLAELEAELAELSARYTPNHPDVIALRAQIA 301
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1480 EAEAQKRQaqeEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAeaeRARQV 1559
Cdd:COG3206 302 ALRAQLQQ---EAQRILASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREVEVARELYESL---LQRLE 375
|
250
....*....|....*...
gi 1920237962 1560 QVALETAQRSAEAELQSE 1577
Cdd:COG3206 376 EARLAEALTVGNVRVIDP 393
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1358-1452 |
4.30e-06 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 53.42 E-value: 4.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1358 LAEVEAALEKQRQLAEAHAQAKAQAEREAqglqRRMQEEVARREEVAVEAQEQKRSIQEELQHLR-QSSEAEIQAKARQV 1436
Cdd:PRK11448 144 LHALQQEVLTLKQQLELQAREKAQSQALA----EAQQQELVALEGLAAELEEKQQELEAQLEQLQeKAAETSQERKQKRK 219
|
90
....*....|....*....
gi 1920237962 1437 EAAERSRLRI---EEEIRV 1452
Cdd:PRK11448 220 EITDQAAKRLelsEEETRI 238
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1679-2421 |
4.36e-06 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 53.42 E-value: 4.36e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1679 AVRQRELAEQELEKQRQLAeGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEE-------LARLQrEAAAATQKrrelea 1751
Cdd:COG3096 277 ANERRELSERALELRRELF-GARRQLAEEQYRLVEMARELEELSARESDLEQDyqaasdhLNLVQ-TALRQQEK------ 348
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1752 eLAKVRAEMEVLlasKARAEEESRSTSEKSKQRLEAEAgRFRELAEEAARLRAlaEEAKRQRQLaeeDAVRQRAEAERvl 1831
Cdd:COG3096 349 -IERYQEDLEEL---TERLEEQEEVVEEAAEQLAEAEA-RLEAAEEEVDSLKS--QLADYQQAL---DVQQTRAIQYQ-- 416
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1832 aEKLAAISEATRLKTEAEIALKEKEAENERLRRlAEDEAFQRRLleeQAAQHKADIEARLAQLRKAseseLERQKGLVED 1911
Cdd:COG3096 417 -QAVQALEKARALCGLPDLTPENAEDYLAAFRA-KEQQATEEVL---ELEQKLSVADAARRQFEKA----YELVCKIAGE 487
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1912 TLRQRR-QVEEEILALKGSFEKAAAGKAELELELGrirgtaedtlrskeQAEQEAARQRQLAAEEERRRREAEERVQKSL 1990
Cdd:COG3096 488 VERSQAwQTARELLRRYRSQQALAQRLQQLRAQLA--------------ELEQRLRQQQNAERLLEEFCQRIGQQLDAAE 553
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1991 AAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEA-----AQKRLQ--AEEKAHAFAVQQkeqelqQ 2063
Cdd:COG3096 554 ELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARApawlaAQDALErlREQSGEALADSQ------E 627
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2064 TLQQEQSVLERLRseaeaarraaeeaeaareraerEAAQSRRQVEEAERlkqsaeeqaqaqaqaqaaaeklrkeaeqeaa 2143
Cdd:COG3096 628 VTAAMQQLLERER----------------------EATVERDELAARKQ------------------------------- 654
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2144 rraQAEQAALRQKQAADAEMEKHKQFAEQ----------------------AL--------------------------- 2174
Cdd:COG3096 655 ---ALESQIERLSQPGGAEDPRLLALAERlggvllseiyddvtledapyfsALygparhaivvpdlsavkeqlagledcp 731
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2175 ---------------------------------RQ-------------KAQVEQELTALRLQLEETD---HQKSILDEEL 2205
Cdd:COG3096 732 edlyliegdpdsfddsvfdaeeledavvvklsdRQwrysrfpevplfgRAAREKRLEELRAERDELAeqyAKASFDVQKL 811
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2206 QRL--------------------KAEVTEAARQRGQVEEELFSLRVQM----EELGKLKARIEAENRAL----VLRDKDS 2257
Cdd:COG3096 812 QRLhqafsqfvgghlavafapdpEAELAALRQRRSELERELAQHRAQEqqlrQQLDQLKEQLQLLNKLLpqanLLADETL 891
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2258 AQRL--LQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAekmlKEKMQAVQEATRLKAEA---------- 2325
Cdd:COG3096 892 ADRLeeLREELDAAQEAQAFIQQHGKALAQLEPLVAVLQSDPEQFEQLQ----ADYLQAKEQQRRLKQQIfalsevvqrr 967
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2326 -------------------ELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQkTLETERQRQLEMSAEAERlrlRVA 2386
Cdd:COG3096 968 phfsyedavgllgensdlnEKLRARLEQAEEARREAREQLRQAQAQYSQYNQVLA-SLKSSRDAKQQTLQELEQ---ELE 1043
|
890 900 910
....*....|....*....|....*....|....*...
gi 1920237962 2387 EMS-RAQARAEEDA--RRFRKQAEDIGERLYRTELATQ 2421
Cdd:COG3096 1044 ELGvQADAEAEERAriRRDELHEELSQNRSRRSQLEKQ 1081
|
|
| CCDC22 |
pfam05667 |
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ... |
2155-2340 |
4.39e-06 |
|
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.
Pssm-ID: 461708 [Multi-domain] Cd Length: 600 Bit Score: 53.11 E-value: 4.39e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2155 QKQAADAEMEKHKQFAEQALRQKAQvEQELTALRLQLEETDHQKSILDEELQRLKAEVTeaarqrgQVEEELFSLRVQME 2234
Cdd:pfam05667 309 TNEAPAATSSPPTKVETEEELQQQR-EEELEELQEQLEDLESSIQELEKEIKKLESSIK-------QVEEELEELKEQNE 380
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2235 ELGK---LKARI-----EAENRALVLrdkdsaQRLLQEEAEKMKQVAE--EAARLSVAAQEAARLRQLAEEDLAQQRALA 2304
Cdd:pfam05667 381 ELEKqykVKKKTldllpDAEENIAKL------QALVDASAQRLVELAGqwEKHRVPLIEEYRALKEAKSNKEDESQRKLE 454
|
170 180 190
....*....|....*....|....*....|....*....
gi 1920237962 2305 E-KMLKEKMQAVQEATRLKAE--AELLQQQKELAQEQAR 2340
Cdd:pfam05667 455 EiKELREKIKEVAEEAKQKEElyKQLVAEYERLPKDVSR 493
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1427-1606 |
4.41e-06 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 52.16 E-value: 4.41e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1427 AEIQAKARQVEAAERSRLRIEEEirvvrlQLEATERQRGGAEGELQALRARAEEAEAqKRQAQEEAERLRRQVQDETQRK 1506
Cdd:TIGR02794 53 NRIQQQKKPAAKKEQERQKKLEQ------QAEEAEKQRAAEQARQKELEQRAAAEKA-AKQAEQAAKQAEEKQKQAEEAK 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1507 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1586
Cdd:TIGR02794 126 AKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAKAKA 205
|
170 180
....*....|....*....|
gi 1920237962 1587 QLERTLKEEHVAVVQLREEA 1606
Cdd:TIGR02794 206 AAEAAAKAEAEAAAAAAAEA 225
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3717-3752 |
4.49e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 45.94 E-value: 4.49e-06
10 20 30
....*....|....*....|....*....|....*.
gi 1920237962 3717 RQLLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPE 3752
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
2155-2339 |
4.62e-06 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 52.50 E-value: 4.62e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2155 QKQAADAEMEKHKQFAEQA--LRQKAQVEQEltalRLQLEETDHQKSildeelQRLKAEVTEAARQRGQVEEELFSLRVQ 2232
Cdd:PRK09510 71 QKSAKRAEEQRKKKEQQQAeeLQQKQAAEQE----RLKQLEKERLAA------QEQKKQAEEAAKQAALKQKQAEEAAAK 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2233 MEELGKLKARIEAEN-RALVLRDKDSAQRLLQEEAEK-----MKQVAEEAARLSVAAQEAARLRQLAEE---DLAQQRAL 2303
Cdd:PRK09510 141 AAAAAKAKAEAEAKRaAAAAKKAAAEAKKKAEAEAAKkaaaeAKKKAEAEAAAKAAAEAKKKAEAEAKKkaaAEAKKKAA 220
|
170 180 190
....*....|....*....|....*....|....*.
gi 1920237962 2304 AEKmlKEKMQAVQEATRLKAEAELLQQQKELAQEQA 2339
Cdd:PRK09510 221 AEA--KAAAAKAAAEAKAAAEKAAAAKAAEKAAAAK 254
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
2203-2611 |
5.00e-06 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 53.12 E-value: 5.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2203 EELQRLKAEVteAARQRGQVEEELFSLRVQMEELGKLKARIEaENRALVLRDKDSAQRLLQEEAEKMKQ---VAEEAARL 2279
Cdd:PRK02224 187 GSLDQLKAQI--EEKEEKDLHERLNGLESELAELDEEIERYE-EQREQARETRDEADEVLEEHEERREEletLEAEIEDL 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2280 SVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLaqetQG 2359
Cdd:PRK02224 264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAA----QA 339
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2360 FQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLyrtelatqekvmlvQTLETQRQQSDR 2439
Cdd:PRK02224 340 HNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEI--------------EELRERFGDAPV 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2440 DAERLREAIAELEHEKDKLKQEAQLLQ--LKSEEMQTVRQEQLLQE------TQALQQSflSEKDSLLQRERCIEQEKAK 2511
Cdd:PRK02224 406 DLGNAEDFLEELREERDELREREAELEatLRTARERVEEAEALLEAgkcpecGQPVEGS--PHVETIEEDRERVEELEAE 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2512 LEQL------FQDEVAKAQALREEQQR--QQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLL 2583
Cdd:PRK02224 484 LEDLeeeveeVEERLERAEDLVEAEDRieRLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEA 563
|
410 420 430
....*....|....*....|....*....|....
gi 1920237962 2584 AEENQRLRERLQHLEE------ERRAALARSEEI 2611
Cdd:PRK02224 564 EEEAEEAREEVAELNSklaelkERIESLERIRTL 597
|
|
| CH_PLS2_rpt1 |
cd21324 |
first calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or ... |
31-163 |
5.18e-06 |
|
first calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-2 contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409173 Cd Length: 145 Bit Score: 49.24 E-value: 5.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 31 RASEGKKDERDRVQKKTFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPrERDVIRSSRLP 101
Cdd:cd21324 12 QSSAGTQHSYSEEEKYAFVNWINKalendpdckHVIPMNPNTDDLFKAVGDGIVLCKMINFSVPDTID-ERTINKKKLTP 90
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 102 rekgrmrFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQV 163
Cdd:cd21324 91 -------FTIQENLNLALNSASAIGCHVVNIGAEDLKEGKPYLVLGLLWQVIKIGLFADIEL 145
|
|
| DUF4659 |
pfam15558 |
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ... |
1335-1568 |
5.38e-06 |
|
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.
Pssm-ID: 464768 [Multi-domain] Cd Length: 374 Bit Score: 51.96 E-value: 5.38e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQRAEERERLA--EVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEvavEAQEQKR 1412
Cdd:pfam15558 51 ERRLLLQQSQEQWQAEKEQRKARLGreERRRADRREKQVIEKESRWREQAEDQENQRQEKLERARQEAEQ---RKQCQEQ 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1413 SIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEA------EAQKR 1486
Cdd:pfam15558 128 RLKEKEEELQALREQNSLQLQERLEEACHKRQLKEREEQKKVQENNLSELLNHQARKVLVDCQAKAEELlrrlslEQSLQ 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1487 QAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ----RALQALEELRL-QAEEAERRLRQAEAERARQVQV 1561
Cdd:pfam15558 208 RSQENYEQLVEERHRELREKAQKEEEQFQRAKWRAEEKEEERqehkEALAELADRKIqQARQVAHKTVQDKAQRARELNL 287
|
....*..
gi 1920237962 1562 ALETAQR 1568
Cdd:pfam15558 288 EREKNHH 294
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3922-3959 |
5.69e-06 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 45.94 E-value: 5.69e-06
10 20 30
....*....|....*....|....*....|....*...
gi 1920237962 3922 QRFLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTA 3959
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
1376-1598 |
5.84e-06 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 51.77 E-value: 5.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1376 AQAKAQAEREAQGLQ---RRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKArqveaAERSRLRIEEEIRV 1452
Cdd:TIGR02794 46 GAVAQQANRIQQQKKpaaKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQ-----AEQAAKQAEEKQKQ 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1453 vrlQLEATERQRggaegelqALRARAEEAEAqKRQAQEEAerlRRQVQDETQRKRQAEAelalrvQAEAEAAREKQRAL- 1531
Cdd:TIGR02794 121 ---AEEAKAKQA--------AEAKAKAEAEA-ERKAKEEA---AKQAEEEAKAKAAAEA------KKKAEEAKKKAEAEa 179
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1532 --QALEELRLQAEEAERRLRQAE----------AERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVA 1598
Cdd:TIGR02794 180 kaKAEAEAKAKAEEAKAKAEAAKakaaaeaaakAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAG 258
|
|
| CH_FLNA_rpt2 |
cd21312 |
second calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; ... |
166-273 |
5.86e-06 |
|
second calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A (FLN-A) is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also anchors various transmembrane proteins to the actin cytoskeleton and serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-A contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409161 Cd Length: 114 Bit Score: 48.26 E-value: 5.86e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 166 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 244
Cdd:cd21312 7 EAKKQTPKQRLLGWIQNKLPQ---LPITNFSRDWQSGRALGALVDSCAPGLCpDWDSWDASKPVTNAREAMQQADDWLGI 83
|
90 100
....*....|....*....|....*....
gi 1920237962 245 TRLLDPEDVDVPQPDEKSIITYVSSLYDA 273
Cdd:cd21312 84 PQVITPEEIVDPNVDEHSVMTYLSQFPKA 112
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1773-2614 |
5.89e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 52.81 E-value: 5.89e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1773 ESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIAL 1852
Cdd:pfam15921 81 EEYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCL 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1853 KEKEAENERlrrlAEDEAFQRRLLEEQAAQHkaDIEARLAQLRKASESELERQKGLVEDTLRQR--------RQVEEEIL 1924
Cdd:pfam15921 161 KEDMLEDSN----TQIEQLRKMMLSHEGVLQ--EIRSILVDFEEASGKKIYEHDSMSTMHFRSLgsaiskilRELDTEIS 234
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1925 ALKGSF----EKAAAGKAELELELGRIRGTAEDTLrskEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQR 2000
Cdd:pfam15921 235 YLKGRIfpveDQLEALKSESQNKIELLLQQHQDRI---EQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQ 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2001 KAA----LEEVE----RLKAKVEEARRLRERAEQESARQLQLAQ----EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQE 2068
Cdd:pfam15921 312 NSMymrqLSDLEstvsQLRSELREAKRMYEDKIEELEKQLVLANseltEARTERDQFSQESGNLDDQLQKLLADLHKREK 391
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2069 QSVLERlrseaeaarraaeeaeaareraereaAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2148
Cdd:pfam15921 392 ELSLEK--------------------------EQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQ 445
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2149 EQAALRQKQAADAEMEKHKQFAEQALRQKA---QVEQELTALRLQLEETDHQKSILDEELQR----LKAEVTEAARQRGQ 2221
Cdd:pfam15921 446 MERQMAAIQGKNESLEKVSSLTAQLESTKEmlrKVVEELTAKKMTLESSERTVSDLTASLQEkeraIEATNAEITKLRSR 525
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2222 VE---EELFSLRVQMEELGKLKAriEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQ-EAArlrQLAEEDL 2297
Cdd:pfam15921 526 VDlklQELQHLKNEGDHLRNVQT--ECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQvEKA---QLEKEIN 600
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2298 AQQRALAE-KMLKEKMQAvqEATRLKAEAELLQQQK-ELAQEQARRLQEDKEqmaqqLAQETQGFQKTLETERQRQLEMS 2375
Cdd:pfam15921 601 DRRLELQEfKILKDKKDA--KIRELEARVSDLELEKvKLVNAGSERLRAVKD-----IKQERDQLLNEVKTSRNELNSLS 673
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2376 AEAERLRlrvaemsraqaraeedaRRFRKQAEDIGERLYRTELATQE-KVMLVQTLETQRQQSDRDAERLREA------I 2448
Cdd:pfam15921 674 EDYEVLK-----------------RNFRNKSEEMETTTNKLKMQLKSaQSELEQTRNTLKSMEGSDGHAMKVAmgmqkqI 736
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2449 AELEHEKDKLKQEAQLLQlksEEMQTVRQEQ-LLQEtqalqqsflsEKDSLLQRERCIEQEKAKLEQlfQDEVAKAQALR 2527
Cdd:pfam15921 737 TAKRGQIDALQSKIQFLE---EAMTNANKEKhFLKE----------EKNKLSQELSTVATEKNKMAG--ELEVLRSQERR 801
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2528 eeqqrqqqqMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA----EENQRLRERLQhleeeRRA 2603
Cdd:pfam15921 802 ---------LKEKVANMEVALDKASLQFAECQDIIQRQEQESVRLKLQHTLDVKELQgpgyTSNSSMKPRLL-----QPA 867
|
890
....*....|.
gi 1920237962 2604 ALARSEEIAPS 2614
Cdd:pfam15921 868 SFTRTHSNVPS 878
|
|
| CH_PLS1_rpt3 |
cd21329 |
third calponin homology (CH) domain found in plastin-1; Plastin-1, also called ... |
39-159 |
6.06e-06 |
|
third calponin homology (CH) domain found in plastin-1; Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. It contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409178 Cd Length: 118 Bit Score: 48.44 E-value: 6.06e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRVQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSgdsLPRERDVIRSSRLPREKGRMRfhKLQNVQIA 118
Cdd:cd21329 2 EGESSEERTFRNWMNS--LGVNPYVNHLYSDLCDALVIFQLYEMTR---VPVDWGHVNKPPYPALGGNMK--KIENCNYA 74
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1920237962 119 LDYLRHR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 159
Cdd:cd21329 75 VELGKNKaKFSLVGIAGSDLNEGNKTLTLALIWQLMRRYTLN 116
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
2153-2599 |
7.91e-06 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 52.42 E-value: 7.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQ-----KAQVEQELTA---LRLQLEETDHQKSI-LDEELQRLKAEVTEAARQRGQVE 2223
Cdd:pfam05483 160 LKETCARSAEKTKKYEYEREETRQvymdlNNNIEKMILAfeeLRVQAENARLEMHFkLKEDHEKIQHLEEEYKKEINDKE 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2224 EELFSLRVQM-EELGKLKARI----EAENRALVLRDKDSAQ-RLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDL 2297
Cdd:pfam05483 240 KQVSLLLIQItEKENKMKDLTflleESRDKANQLEEKTKLQdENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDL 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2298 AQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQ--------KELAQEQARRLQEDKEQMaQQLAQETQgfQKTLETERQ 2369
Cdd:pfam05483 320 QIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEfeattcslEELLRTEQQRLEKNEDQL-KIITMELQ--KKSSELEEM 396
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2370 RQLEMSAEAERLRLRVAeMSRAQARAEEdarrfRKQAEDIGERLYRTElatQEKVMLVQTLETQRQQSDRDAERLREAIA 2449
Cdd:pfam05483 397 TKFKNNKEVELEELKKI-LAEDEKLLDE-----KKQFEKIAEELKGKE---QELIFLLQAREKEIHDLEIQLTAIKTSEE 467
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2450 ELEHEKDKLKQEAQLLQLKSEEMqTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQdevaKAQALREE 2529
Cdd:pfam05483 468 HYLKEVEDLKTELEKEKLKNIEL-TAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLK----QIENLEEK 542
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2530 QQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEE 2599
Cdd:pfam05483 543 EMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEE 612
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1673-1955 |
7.95e-06 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 51.44 E-value: 7.95e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1673 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1752
Cdd:COG4372 23 GILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEE 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1753 LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQ----LAEEDAVRQRAEAE 1828
Cdd:COG4372 103 LESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEelaaLEQELQALSEAEAE 182
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1829 RVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGL 1908
Cdd:COG4372 183 QALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEE 262
|
250 260 270 280
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1909 VEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTL 1955
Cdd:COG4372 263 LELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSL 309
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
2181-2610 |
8.34e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 52.48 E-value: 8.34e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2181 EQELTALRLQLEETDHQKSILDEElQRLKAEvtEAARQRGQVEeelfslRVQMEelGKLKariEAENRALVLRDKDSaqR 2260
Cdd:pfam01576 86 EEEERSQQLQNEKKKMQQHIQDLE-EQLDEE--EAARQKLQLE------KVTTE--AKIK---KLEEDILLLEDQNS--K 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2261 LLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE-KMQAVQEATRLKAEAELLQqqkelAQEQA 2339
Cdd:pfam01576 150 LSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKEeKGRQELEKAKRKLEGESTD-----LQEQI 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2340 RRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQA--RAEEDAR-RFRKQAEDIGERL--Y 2414
Cdd:pfam01576 225 AELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEdlESERAARnKAEKQRRDLGEELeaL 304
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2415 RTELA-TQEKVMLVQTLETQRQQsdrDAERLREAIaelehEKDKLKQEAQLLQLKSEEMQTVR--QEQLLQ--------- 2482
Cdd:pfam01576 305 KTELEdTLDTTAAQQELRSKREQ---EVTELKKAL-----EEETRSHEAQLQEMRQKHTQALEelTEQLEQakrnkanle 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2483 -ETQALQQSFL---SEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQalreEQQRQQQQMQQEKQQLAASMEEARRRQHEA 2558
Cdd:pfam01576 377 kAKQALESENAelqAELRTLQQAKQDSEHKRKKLEGQLQELQARLS----ESERQRAELAEKLSKLQSELESVSSLLNEA 452
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 2559 EEGVRRQQEELQRLAQQQQQQEKLLAEENQR---LRERLQHLEEERRAALARSEE 2610
Cdd:pfam01576 453 EGKNIKLSKDVSSLESQLQDTQELLQEETRQklnLSTRLRQLEDERNSLQEQLEE 507
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
2312-2457 |
8.99e-06 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 52.33 E-value: 8.99e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2312 MQAVQEATRlkaeaELLQQQKELA-------QEQARRLQ-EDKEQMAQ-QLAQetQGFQKTLETERQRQLEMSAEAERLR 2382
Cdd:PTZ00491 638 VEPVDERTR-----DSLQKSVQLAieittksQEAAARHQaELLEQEARgRLER--QKMHDKAKAEEQRTKLLELQAESAA 710
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 2383 LRVAEMSRAQARAEEDARRFRKQAEdigerLYRTEL-ATQEKVMLVQTLETQRQQSDRDAERlREAIAELEHEKDK 2457
Cdd:PTZ00491 711 VESSGQSRAEALAEAEARLIEAEAE-----VEQAELrAKALRIEAEAELEKLRKRQELELEY-EQAQNELEIAKAK 780
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1300-1514 |
9.00e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 51.30 E-value: 9.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1300 KVQSGSESIIQEYVDLRTRYSELSTL---TSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA-- 1374
Cdd:COG4942 45 ALKKEEKALLKQLAALERRIAALARRiraLEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPLal 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1375 --HAQAKAQAEREAQGLQRRMQeevARREEVaveaqEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIrv 1452
Cdd:COG4942 125 llSPEDFLDAVRRLQYLKYLAP---ARREQA-----EELRADLAELAALRAELEAERAELEALLAELEEERAALEALK-- 194
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1453 vrlqleaTERQRggaegELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1514
Cdd:COG4942 195 -------AERQK-----LLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTP 244
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
1223-1542 |
9.28e-06 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 52.36 E-value: 9.28e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1223 KQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQ 1302
Cdd:PRK10929 29 TQELEQAKAAKTPAQAEIVEALQSALNWLEERKGSLER-------AKQYQQVIDNFPKLSAELRQQLNNERDEPRSVPPN 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1303 SGSESIIQEYVDLRTRYSELSTLTSQ---YIRFISETLRRMEeeerlaeQQRAEERERLAEVEAALEKQRQLAEAHAQAK 1379
Cdd:PRK10929 102 MSTDALEQEILQVSSQLLEKSRQAQQeqdRAREISDSLSQLP-------QQQTEARRQLNEIERRLQTLGTPNTPLAQAQ 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1380 AQAereaqglqrrMQEEVARReevaveaqeqkRSIQEELQhLRQSSEAEIQAKAR-QVEAAERSRLRIEEEIRVVRLQLE 1458
Cdd:PRK10929 175 LTA----------LQAESAAL-----------KALVDELE-LAQLSANNRQELARlRSELAKKRSQQLDAYLQALRNQLN 232
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1459 aTERQRggaegelqalraRAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ-------AEAEAAREKQRAL 1531
Cdd:PRK10929 233 -SQRQR------------EAERALESTELLAEQSGDLPKSIVAQFKINRELSQALNQQAQrmdliasQQRQAASQTLQVR 299
|
330
....*....|.
gi 1920237962 1532 QALEELRLQAE 1542
Cdd:PRK10929 300 QALNTLREQSQ 310
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1521-2052 |
9.66e-06 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 51.99 E-value: 9.66e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1521 AEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQV--ALETAQRSAEAELQS------EHASFAEKTAQLERTL 1592
Cdd:PRK03918 168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREinEISSELPELREELEKlekevkELEELKEEIEELEKEL 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1593 KEEHVAVVQLRE---EATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREA 1669
Cdd:PRK03918 248 ESLEGSKRKLEEkirELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1670 RRRGKAEEQAVRQRELAEQELEKQRQLAE--------GTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR--LQREA 1739
Cdd:PRK03918 328 ERIKELEEKEERLEELKKKLKELEKRLEEleerhelyEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKeeIEEEI 407
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1740 AAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGR-FRELAEEAARLRALAEEAKRQ-RQLAE 1817
Cdd:PRK03918 408 SKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAeLKRIEKELKEIEEKERKLRKElRELEK 487
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1818 EDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERlRRLAEDEAFQRRLLEEqaAQHKADIEARLAQLRKA 1897
Cdd:PRK03918 488 VLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLK-EKLIKLKGEIKSLKKE--LEKLEELKKKLAELEKK 564
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1898 SEsELERQKGLVEDTLRQR-----RQVEEEILALKGSFEK---AAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQR 1969
Cdd:PRK03918 565 LD-ELEEELAELLKELEELgfesvEELEERLKELEPFYNEyleLKDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1970 QLAAEEERRrreaeervqKSLAAEEEAARQRKAALE---EVERLKAKVEEARRLRERAEQ--ESARQLQLAQEAAQKRLQ 2044
Cdd:PRK03918 644 ELRKELEEL---------EKKYSEEEYEELREEYLElsrELAGLRAELEELEKRREEIKKtlEKLKEELEEREKAKKELE 714
|
....*...
gi 1920237962 2045 AEEKAHAF 2052
Cdd:PRK03918 715 KLEKALER 722
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3924-3962 |
9.77e-06 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 45.01 E-value: 9.77e-06
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3924 FLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTAFEL 3962
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1350-2054 |
1.04e-05 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 52.11 E-value: 1.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1350 QRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER-EAQGlqrrmqeevarrEEVAVEAQEQKRSIQEELQHLRQsseae 1428
Cdd:PRK10246 168 ERAELLEELTGTEIYGQISAMVFEQHKSARTELEKlQAQA------------SGVALLTPEQVQSLTASLQVLTD----- 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1429 iqakarqveaaersrlriEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAerlrrqvQDETQRKRQ 1508
Cdd:PRK10246 231 ------------------EEKQLLTAQQQQQQSLNWLTRLDELQQEASRRQQALQQALAAEEKA-------QPQLAALSL 285
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1509 AEAELALRVQAEaeaarEKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQrsaeaELQSEHASFAEKTAql 1588
Cdd:PRK10246 286 AQPARQLRPHWE-----RIQEQSAALAHTRQQIEEVNTRLQSTMALRARIRHHAAKQSA-----ELQAQQQSLNTWLA-- 353
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1589 ertlkeEHVAVVQLREE-----ATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLtqaeaekqke 1663
Cdd:PRK10246 354 ------EHDRFRQWNNElagwrAQFSQQTSDREQLRQWQQQLTHAEQKLNALPAITLTLTADEVAAALAQ---------- 417
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1664 eaerearrrgKAEEQAVRQR--ELAEQELEKQRQLAEgTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLqREAAA 1741
Cdd:PRK10246 418 ----------HAEQRPLRQRlvALHGQIVPQQKRLAQ-LQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADV-KTICE 485
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1742 ATQKRRELEAELAKVRAEMEVLL--ASKARAEEESRS-TSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1818
Cdd:PRK10246 486 QEARIKDLEAQRAQLQAGQPCPLcgSTSHPAVEAYQAlEPGVNQSRLDALEKEVKKLGEEGAALRGQLDALTKQLQRDES 565
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1819 DAvRQRAEAERVLAEKLAAISEA--TRLKTEAEIA--LKEKEAENERLRRLAEDEAFQRRLLE--EQAAQHKADIEARLA 1892
Cdd:PRK10246 566 EA-QSLRQEEQALTQQWQAVCASlnITLQPQDDIQpwLDAQEEHERQLRLLSQRHELQGQIAAhnQQIIQYQQQIEQRQQ 644
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1893 QLR----------------------KASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGT 1950
Cdd:PRK10246 645 QLLtalagyaltlpqedeeaswlatRQQEAQSWQQRQNELTALQNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHE 724
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1951 AEDTLRSK-----EQAEQEAARQRQLAAEEERRRREAEERVQKSLAAE--EEAARQRKAALEevERLKAKVEEARRLRER 2023
Cdd:PRK10246 725 QCLSLHSQlqtlqQQDVLEAQRLQKAQAQFDTALQASVFDDQQAFLAAllDEETLTQLEQLK--QNLENQRQQAQTLVTQ 802
|
730 740 750
....*....|....*....|....*....|.
gi 1920237962 2024 AEQESARQLQLAQEAAQKRLQAEEKAHAFAV 2054
Cdd:PRK10246 803 TAQALAQHQQHRPDGLDLTVTVEQIQQELAQ 833
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
2270-2486 |
1.07e-05 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 51.00 E-value: 1.07e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2270 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2349
Cdd:TIGR02794 46 GAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2350 AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKvmlvQT 2429
Cdd:TIGR02794 126 AKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAE----AA 201
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 2430 LETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQA 2486
Cdd:TIGR02794 202 KAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAG 258
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1736-2057 |
1.23e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 51.66 E-value: 1.23e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1736 QREAAAATQKRRELEAELAKVRAEMEvllaSKARAEEESRSTSEKSKQRlEAEAGRFRELAEEAARLralaeEAKRQRQL 1815
Cdd:pfam17380 281 QKAVSERQQQEKFEKMEQERLRQEKE----EKAREVERRRKLEEAEKAR-QAEMDRQAAIYAEQERM-----AMEREREL 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1816 AEEDAVRQRAEAERVLAEKLAAisEATRLKtEAEIALKEKEAENERLRRLAEdEAFQRRLLEEQAAQHKADIEARLAQLR 1895
Cdd:pfam17380 351 ERIRQEERKRELERIRQEEIAM--EISRMR-ELERLQMERQQKNERVRQELE-AARKVKILEEERQRKIQQQKVEMEQIR 426
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1896 KASESELERQ-KGLVEDTLRQRRQVEEEilalkgsfekaaagKAELELELGRIRGTAEDTLRSKEQAEQEaaRQRQLAAE 1974
Cdd:pfam17380 427 AEQEEARQREvRRLEEERAREMERVRLE--------------EQERQQQVERLRQQEEERKRKKLELEKE--KRDRKRAE 490
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1975 EERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARrlRERAEQESARQLQLAQE---AAQKRLQAEEKAHA 2051
Cdd:pfam17380 491 EQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER--RREAEEERRKQQEMEERrriQEQMRKATEERSRL 568
|
....*.
gi 1920237962 2052 FAVQQK 2057
Cdd:pfam17380 569 EAMERE 574
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
2321-2607 |
1.45e-05 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 51.88 E-value: 1.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2321 LKAEAELLQQQKELAQEQARRlqedkEQMAQQLAQETQGfQKTLETERQrqlemsAEAERLRLRVAEMsraqaRAEEDAR 2400
Cdd:COG3096 288 LELRRELFGARRQLAEEQYRL-----VEMARELEELSAR-ESDLEQDYQ------AASDHLNLVQTAL-----RQQEKIE 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2401 RFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEA----QLLQLKsEEMQTVR 2476
Cdd:COG3096 351 RYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQQTRAiqyqQAVQAL-EKARALC 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2477 Q---------EQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQ------DEVAKAQA-------LREEQQRQQ 2534
Cdd:COG3096 430 GlpdltpenaEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYElvckiaGEVERSQAwqtarelLRRYRSQQA 509
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 2535 QQMQQEKqqLAASMEEARRRQHEAEEgVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR 2607
Cdd:COG3096 510 LAQRLQQ--LRAQLAELEQRLRQQQN-AERLLEEFCQRIGQQLDAAEELEELLAELEAQLEELEEQAAEAVEQ 579
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2433-2622 |
1.45e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 50.92 E-value: 1.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2433 QRQQSDRDAERLREAIAELEHEKDKLKQEAQLL--QLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKA 2510
Cdd:COG4942 21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALlkQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELE 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2511 KLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASME------EARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA 2584
Cdd:COG4942 101 AQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQylkylaPARREQAEELRADLAELAALRAELEAERAELEALL 180
|
170 180 190
....*....|....*....|....*....|....*...
gi 1920237962 2585 EENQRLRERLQHLEEERRAALARSEEIAPSRAAAARAL 2622
Cdd:COG4942 181 AELEEERAALEALKAERQKLLARLEKELAELAAELAEL 218
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1815-2515 |
1.53e-05 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 51.59 E-value: 1.53e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1815 LAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKAdiEARLAQL 1894
Cdd:TIGR00606 165 LSEGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLES--SREIVKS 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1895 RKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaae 1974
Cdd:TIGR00606 243 YENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKE---- 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1975 eerrrrEAEERVQKSLAAEEEAAR---QRKAALE-EVERLKAKVE---EARRLRERAEQESARQLQLAQEAAQKRLQAEE 2047
Cdd:TIGR00606 319 ------RELVDCQRELEKLNKERRllnQEKTELLvEQGRLQLQADrhqEHIRARDSLIQSLATRLELDGFERGPFSERQI 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2048 K-AHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQ 2126
Cdd:TIGR00606 393 KnFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDR 472
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2127 AQAAAEKLRKEAE---------------QEAARRAQAEQAALRQKQAADAEME----------------KHKQFAEQALR 2175
Cdd:TIGR00606 473 ILELDQELRKAERelskaeknsltetlkKEVKSLQNEKADLDRKLRKLDQEMEqlnhhtttrtqmemltKDKMDKDEQIR 552
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2176 Q-KAQVEQELTAL------RLQLEETDHQKS----ILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGklkariE 2244
Cdd:TIGR00606 553 KiKSRHSDELTSLlgyfpnKKQLEDWLHSKSkeinQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYE------D 626
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2245 AENRALVLRDKDSAQRLLQEEAEKM-KQVAEEAARLSVAAQeaaRLRQLAEED-----LAQQRALAEKMLKEKMQAVQEA 2318
Cdd:TIGR00606 627 KLFDVCGSQDEESDLERLKEEIEKSsKQRAMLAGATAVYSQ---FITQLTDENqsccpVCQRVFQTEAELQEFISDLQSK 703
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2319 TRLkAEAELLQQQKELAQEQARRlqEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSR--AQARAE 2396
Cdd:TIGR00606 704 LRL-APDKLKSTESELKKKEKRR--DEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETllGTIMPE 780
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2397 EDARRFRKQAEDIGERLYRtELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVR 2476
Cdd:TIGR00606 781 EESAKVCLTDVTIMERFQM-ELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQI 859
|
730 740 750
....*....|....*....|....*....|....*....
gi 1920237962 2477 QeQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQL 2515
Cdd:TIGR00606 860 Q-HLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEV 897
|
|
| ATAD3_N |
pfam12037 |
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal ... |
1316-1457 |
1.54e-05 |
|
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal domain of ATPase family AAA domain-containing protein 3 (ATAD3) which is involved in dimerization and interacts with the inner surface of the outer mitochondrial membrane. This domain is found associated with the AAA ATPase domain (pfam00004). ATAD3 is essential for mitochondrial network organization, mitochondrial metabolism and cell growth at organizm and cellular level. It may also play an important role in mitochondrial protein synthesis.
Pssm-ID: 463442 [Multi-domain] Cd Length: 264 Bit Score: 49.98 E-value: 1.54e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1316 RTRYSELSTLTSQYIRFI----SETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRqlaeahaqakAQAEREAQglqR 1391
Cdd:pfam12037 56 QTRQAELQAKIKEYEAAQeqlkIERQRVEYEERRKTLQEETKQKQQRAQYQDELARKR----------YQDQLEAQ---R 122
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1392 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL 1457
Cdd:pfam12037 123 RRNEELLRKQEESVAKQEAMRIQAQRRQTEEHEAELRRETERAKAEAEAEARAKEERENEDLNLEQ 188
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
2171-2610 |
1.58e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 51.22 E-value: 1.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2171 EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAenral 2250
Cdd:PRK03918 161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEE----- 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2251 vLRDKDSAQRLLQEEAEKMKQVAEEaarlsvaaqeaaRLRQLAEedlaqQRALAEKMLKEKMQAVQEATRLKAEAELLQQ 2330
Cdd:PRK03918 236 -LKEEIEELEKELESLEGSKRKLEE------------KIRELEE-----RIEELKKEIEELEEKVKELKELKEKAEEYIK 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2331 QKELAQEQARRLQEDKEQMA--QQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMsRAQARAEEDARRFRKQAED 2408
Cdd:PRK03918 298 LSEFYEEYLDELREIEKRLSrlEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL-EERHELYEEAKAKKEELER 376
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2409 IGERLyrTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKqeAQLLQLKS---------EEMQTVRQEQ 2479
Cdd:PRK03918 377 LKKRL--TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK--KAIEELKKakgkcpvcgRELTEEHRKE 452
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2480 LLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEvAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAE 2559
Cdd:PRK03918 453 LLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKE-SELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLK 531
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 2560 EGVRRQQEELQRLAqQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEE 2610
Cdd:PRK03918 532 EKLIKLKGEIKSLK-KELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1472-1736 |
1.73e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 51.17 E-value: 1.73e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1472 QALRARAEEAEAQKRQAQEEAERLRRQVqDETQRKRQA--EAELALRVQAEAEAAREKQRALQA-LEELRLQAEEAERRL 1548
Cdd:COG3206 164 QNLELRREEARKALEFLEEQLPELRKEL-EEAEAALEEfrQKNGLVDLSEEAKLLLQQLSELESqLAEARAELAEAEARL 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1549 RQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaeaeraraeaerele 1628
Cdd:COG3206 243 AALRAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQI---------------------- 300
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1629 rwqlkanEALRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrgKAEEQAVRQRelaEQELEKQRQLAEGTAQQRLAAE 1708
Cdd:COG3206 301 -------AALRAQLQQEAQRILASL--------------------EAELEALQAR---EASLQAQLAQLEARLAELPELE 350
|
250 260
....*....|....*....|....*...
gi 1920237962 1709 QELIRLRAETeqgEQQRQLLEEELARLQ 1736
Cdd:COG3206 351 AELRRLEREV---EVARELYESLLQRLE 375
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1706-1801 |
1.80e-05 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 51.49 E-value: 1.80e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1706 AAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlasKARAEEESRSTSEKSKQRL 1785
Cdd:PRK11448 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQL---QEKAAETSQERKQKRKEIT 222
|
90
....*....|....*.
gi 1920237962 1786 EAEAGRFrELAEEAAR 1801
Cdd:PRK11448 223 DQAAKRL-ELSEEETR 237
|
|
| CH_FIMB_rpt1 |
cd21294 |
first calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar ... |
38-153 |
1.83e-05 |
|
first calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar proteins; Fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409143 Cd Length: 125 Bit Score: 47.06 E-value: 1.83e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 38 DERDRVQkktFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEvlsgDSLPRERDVIRSSRLPRE-KGRM 107
Cdd:cd21294 4 NEDERRE---FTKHINAvlagdpdvgSRLPFPTDTFQLFDECKDGLVLSKLIN----DSVPDTIDERVLNKPPRKnKPLN 76
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 1920237962 108 RFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTII 153
Cdd:cd21294 77 NFQMIENNNIVINSAKAIGCSVVNIGAGDIIEGREHLILGLIWQII 122
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1336-1552 |
1.84e-05 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 50.64 E-value: 1.84e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1336 TLRRMEEEERLAEQQRAEERERLAEVEAALEkqrqlaEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1415
Cdd:pfam02029 134 EIREKEYQENKWSTEVRQAEEEGEEEEDKSE------EAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEV 207
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1416 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRV-VRLQLEATERQRGGAEG-ELQALRARAEEAEAQKRQAQEEAE 1493
Cdd:pfam02029 208 KSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFLeAEQKLEELRRRRQEKESeEFEKLRQKQQEAELELEELKKKRE 287
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1494 RLRRQVQDETQRKRQAEAELALRVQaeaeaaREKQRALQALEelRLQAEEAERRLRQAE 1552
Cdd:pfam02029 288 ERRKLLEEEEQRRKQEEAERKLREE------EEKRRMKEEIE--RRRAEAAEKRQKLPE 338
|
|
| growth_prot_Scy |
NF041483 |
polarized growth protein Scy; |
1839-2592 |
1.91e-05 |
|
polarized growth protein Scy;
Pssm-ID: 469371 [Multi-domain] Cd Length: 1293 Bit Score: 51.37 E-value: 1.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1839 SEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQ 1918
Cdd:NF041483 22 AEMDRLKTEREKAVQHAEDLGYQVEVLRAKLHEARRSLASRPAYDGADIGYQAEQLLRNAQIQADQLRADAERELRDARA 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1919 VEEEIlaLKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQ----------AEQEAARQRQLAAEEERRRREAEERVQK 1988
Cdd:NF041483 102 QTQRI--LQEHAEHQARLQAELHTEAVQRRQQLDQELAERRQtveshvnenvAWAEQLRARTESQARRLLDESRAEAEQA 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1989 SLAAEEEAARqrkAALEEVERLKAKVEEARRLRE----RAEQESARQLQLAQEAAQKRLQAEEKAHAF-------AVQQK 2057
Cdd:NF041483 180 LAAARAEAER---LAEEARQRLGSEAESARAEAEailrRARKDAERLLNAASTQAQEATDHAEQLRSStaaesdqARRQA 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2058 EQELQQTLQQEQSVLERLR-----SEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAE 2132
Cdd:NF041483 257 AELSRAAEQRMQEAEEALRearaeAEKVVAEAKEAAAKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAEQAL 336
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2133 KLRKEAEQEAARRAQAEQAALRQKQAAdAEMEKHKQFAEQALRQKAQVEQELTalRLQLEETDHQKSILDEELQRLKAEV 2212
Cdd:NF041483 337 ADARAEAEKLVAEAAEKARTVAAEDTA-AQLAKAARTAEEVLTKASEDAKATT--RAAAEEAERIRREAEAEADRLRGEA 413
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2213 TEAARQ-RGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAAR-----LSVAAQEA 2286
Cdd:NF041483 414 ADQAEQlKGAAKDDTKEYRAKTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARtaeelLTKAKADA 493
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2287 ARLRQLAEEDLAQQRALA-EKMLKEKMQAVQEATRLKAEAELLQQQkelAQEQARRLQEDKEQMAQQLAQETQGFQKTLE 2365
Cdd:NF041483 494 DELRSTATAESERVRTEAiERATTLRRQAEETLERTRAEAERLRAE---AEEQAEEVRAAAERAARELREETERAIAARQ 570
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2366 TERQRQLE-MSAEAERlRLRVAEMSRAQARAEedARRFRKQAEDIGERLyRTELAtqekvmlvQTLETQRQQSDRDAERL 2444
Cdd:NF041483 571 AEAAEELTrLHTEAEE-RLTAAEEALADARAE--AERIRREAAEETERL-RTEAA--------ERIRTLQAQAEQEAERL 638
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2445 R-EAIAELEHEkdKLKQEAQLLQLKSEEMQtvRQEQLLQETQALQQSFLSEKDSLLQRercIEQEKAKleqlfqdevaka 2523
Cdd:NF041483 639 RtEAAADASAA--RAEGENVAVRLRSEAAA--EAERLKSEAQESADRVRAEAAAAAER---VGTEAAE------------ 699
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2524 qalreeqqrqqqqmqqekqQLAASMEEARRRQHEAEEGVRRQQEEL-QRLAQQQQQQEKLLAEENQRLRE 2592
Cdd:NF041483 700 -------------------ALAAAQEEAARRRREAEETLGSARAEAdQERERAREQSEELLASARKRVEE 750
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1348-1502 |
1.96e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 49.15 E-value: 1.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1348 EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREaqglQRRMQEEVARREEVAVEAQEQKRSIqeelqhlrqSSEA 1427
Cdd:COG1579 23 EHRLKELPAELAELEDELAALEARLEAAKTELEDLEKE----IKRLELEIEEVEARIKKYEEQLGNV---------RNNK 89
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1428 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDE 1502
Cdd:COG1579 90 EYEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAE 164
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1386-2022 |
1.97e-05 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 50.88 E-value: 1.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1386 AQGLQR---RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATER 1462
Cdd:pfam05483 73 SEGLSRlysKLYKEAEKIKKWKVSIEAELKQKENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNA 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1463 QRGGAEgELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAElaLRVQAEaeaarekqralQALEELRLQAE 1542
Cdd:pfam05483 153 TRHLCN-LLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEKMILAFEE--LRVQAE-----------NARLEMHFKLK 218
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1543 EAERRLRQAEAERARQV-----QVALETAQrSAEAElqsehasfaEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAE 1617
Cdd:pfam05483 219 EDHEKIQHLEEEYKKEIndkekQVSLLLIQ-ITEKE---------NKMKDLTFLLEESRDKANQLEEKTKLQDENLKELI 288
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1618 RARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQla 1697
Cdd:pfam05483 289 EKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLR-- 366
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1698 egTAQQRLaaeqelirlraetEQGEQQRQLLEEELARLQREAAAATQ----KRRELEaELAKVRAEMEVLLASKARAEEE 1773
Cdd:pfam05483 367 --TEQQRL-------------EKNEDQLKIITMELQKKSSELEEMTKfknnKEVELE-ELKKILAEDEKLLDEKKQFEKI 430
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1774 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALaeEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLkteaeialk 1853
Cdd:pfam05483 431 AEELKGKEQELIFLLQAREKEIHDLEIQLTAI--KTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLL--------- 499
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1854 ekeaENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGL--VEDTLRQRR--------QVEEEI 1923
Cdd:pfam05483 500 ----ENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELesVREEFIQKGdevkckldKSEENA 575
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1924 LALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKsLAAEEEAARQR--- 2000
Cdd:pfam05483 576 RSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNK-LELELASAKQKfee 654
|
650 660 670
....*....|....*....|....*....|....*...
gi 1920237962 2001 ----------------KAALEEVERLKAKVEEARRLRE 2022
Cdd:pfam05483 655 iidnyqkeiedkkiseEKLLEEVEKAKAIADEAVKLQK 692
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1533-1971 |
2.10e-05 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 51.17 E-value: 2.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1533 ALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQ---------LR 1603
Cdd:COG3903 477 AAERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALrlaaalapfWF 556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1604 EEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQR 1683
Cdd:COG3903 557 LRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAAAAAAA 636
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1684 ELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVL 1763
Cdd:COG3903 637 AAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAALAAAAA 716
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1764 LASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATR 1843
Cdd:COG3903 717 AAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAAAAAAA 796
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1844 LKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEI 1923
Cdd:COG3903 797 AAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALAAAAAA 876
|
410 420 430 440
....*....|....*....|....*....|....*....|....*...
gi 1920237962 1924 LALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQL 1971
Cdd:COG3903 877 AAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAA 924
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1347-1775 |
2.21e-05 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 50.68 E-value: 2.21e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1347 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE 1426
Cdd:COG5278 84 ARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLAL 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1427 AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1506
Cdd:COG5278 164 ALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALALA 243
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1507 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1586
Cdd:COG5278 244 LLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAAA 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1587 QLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAE 1666
Cdd:COG5278 324 ALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAAA 403
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1667 REARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKR 1746
Cdd:COG5278 404 AEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAAA 483
|
410 420
....*....|....*....|....*....
gi 1920237962 1747 RELEAELAKVRAEMEVLLASKARAEEESR 1775
Cdd:COG5278 484 LAEAEAAAALAAAAALSLALALAALLLAA 512
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1372-1587 |
2.28e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 50.21 E-value: 2.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1372 AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLrqssEAEIQAKARQVEAAERsrlRIEEEIR 1451
Cdd:COG3883 14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEAL----QAEIDKLQAEIAEAEA---EIEERRE 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1452 VVRLQLEATERQrGGAEGELQAL--------------------RARAEEAEAQKrQAQEEAERLRRQVQDETQRKRQAEA 1511
Cdd:COG3883 87 ELGERARALYRS-GGSVSYLDVLlgsesfsdfldrlsalskiaDADADLLEELK-ADKAELEAKKAELEAKLAELEALKA 164
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1512 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ 1587
Cdd:COG3883 165 ELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 240
|
|
| MARTX_Nterm |
NF012221 |
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ... |
2153-2358 |
2.30e-05 |
|
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.
Pssm-ID: 467957 [Multi-domain] Cd Length: 1848 Bit Score: 50.99 E-value: 2.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADA-----EMEKHKQFAEQALRQKaqveqeltalrlQLEETDH---------QKSILDEELQRLKAEVTEAARQ 2218
Cdd:NF012221 1561 LADKERAEAdrqrlEQEKQQQLAAISGSQS------------QLESTDQnaletngqaQRDAILEESRAVTKELTTLAQG 1628
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2219 RGQVEEElfslRVQMEELGKlKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQ--------VAEEAARLSVAAQEAAR 2288
Cdd:NF012221 1629 LDALDSQ----ATYAGESGD-QWRNPFAGGLLdrVQEQLDDAKKISGKQLADAKQrhvdnqqkVKDAVAKSEAGVAQGEQ 1703
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2289 LRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQ 2358
Cdd:NF012221 1704 NQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQADAKGAK 1773
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
2153-2600 |
2.38e-05 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 50.59 E-value: 2.38e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTalrlQLEETDHQKSILDEELQRLKAEVTEAARQRGQVE-----EELF 2227
Cdd:pfam10174 58 LKEQYRVTQEENQHLQLTIQALQDELRAQRDLN----QLLQQDFTTSPVDGEDKFSTPELTEENFRRLQSEherqaKELF 133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2228 SLRVQMEELgklKARIEAENRALVLRDkDSAQRLL---QEEAEKMKQVAEEAARLSVAAQEAARLRQLaeEDLAQQRALA 2304
Cdd:pfam10174 134 LLRKTLEEM---ELRIETQKQTLGARD-ESIKKLLemlQSKGLPKKSGEEDWERTRRIAEAEMQLGHL--EVLLDQKEKE 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2305 EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLqedkEQMAQQLAQETQgfqkTLETErqrqLEMSAEAERLRLR 2384
Cdd:pfam10174 208 NIHLREELHRRNQLQPDPAKTKALQTVIEMKDTKISSL----ERNIRDLEDEVQ----MLKTN----GLLHTEDREEEIK 275
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2385 VAEMSRAQARaeedarrFRK-QAEDIGERLYRTE---LATQEKVmlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQ 2460
Cdd:pfam10174 276 QMEVYKSHSK-------FMKnKIDQLKQELSKKEselLALQTKL---ETLTNQNSDCKQHIEVLKESLTAKEQRAAILQT 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2461 EAQLLQLKSEEMQTV-----RQEQLLQETQALQQSFLSE-KDSLLQRERCIEQEKAKLEQL---FQDEVAKAQALREEQQ 2531
Cdd:pfam10174 346 EVDALRLRLEEKESFlnkktKQLQDLTEEKSTLAGEIRDlKDMLDVKERKINVLQKKIENLqeqLRDKDKQLAGLKERVK 425
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 2532 RQQQQMQQEKQQLAaSMEEARRrqhEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEE 2600
Cdd:pfam10174 426 SLQTDSSNTDTALT-TLEEALS---EKERIIERLKEQREREDRERLEELESLKKENKDLKEKVSALQPE 490
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3793-3829 |
2.50e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 44.01 E-value: 2.50e-05
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 3793 LRLLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDT 3829
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1631-2049 |
2.59e-05 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 50.29 E-value: 2.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1631 QLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQE 1710
Cdd:COG5278 105 QQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLALALAALLLAAAALLLLLLALAA 184
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1711 LIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAG 1790
Cdd:COG5278 185 LLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALALALLLAALLLALLAALALAALLA 264
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1791 RFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEA 1870
Cdd:COG5278 265 AALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAAAALAALLALALATALAAAAAAL 344
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1871 FQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGT 1950
Cdd:COG5278 345 ALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAAAAEAAAAAAAAAAASAAEALEL 424
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1951 AEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2030
Cdd:COG5278 425 AEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAAALAEAEAAAALAAAAALSLALA 504
|
410
....*....|....*....
gi 1920237962 2031 QLQLAQEAAQKRLQAEEKA 2049
Cdd:COG5278 505 LAALLLAAAEAALAAALAA 523
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
1265-1596 |
2.82e-05 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 50.40 E-value: 2.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1265 QRFAKQYINAIKDYelqlvtYKAQLEPVASPAKKPKvQSGSESIIQEYVDLRTRY-SELSTLTSQyirfiSETLRRMEEE 1343
Cdd:NF033838 53 NESQKEHAKEVESH------LEKILSEIQKSLDKRK-HTQNVALNKKLSDIKTEYlYELNVLKEK-----SEAELTSKTK 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1344 ERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ----RRMQEEVArreEVAVEAQEQKRSIQEELQ 1419
Cdd:NF033838 121 KELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNYPtntyKTLELEIA---ESDVEVKKAELELVKEEA 197
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1420 HLRQSSEAEIQAKAR-QVEAAERSRLrieEEIRVVRLQLEATERQRGGAE-GELQALRARAEEAEAQKRQA--------- 1488
Cdd:NF033838 198 KEPRDEEKIKQAKAKvESKKAEATRL---EKIKTDREKAEEEAKRRADAKlKEAVEKNVATSEQDKPKRRAkrgvlgepa 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1489 -----QEEAERLRRQVQDET-------QRKRQAEAE-LALRVQAEAEAAREKQR---ALQALEELRLQAEEAERRLRQAE 1552
Cdd:NF033838 275 tpdkkENDAKSSDSSVGEETlpspslkPEKKVAEAEkKVEEAKKKAKDQKEEDRrnyPTNTYKTLELEIAESDVKVKEAE 354
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 1920237962 1553 A----ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEH 1596
Cdd:NF033838 355 LelvkEEAKEPRNEEKIKQAKAKVESKKAEATRLEKIKTDRKKAEEEA 402
|
|
| COG5281 |
COG5281 |
Phage-related minor tail protein [Mobilome: prophages, transposons]; |
2232-2612 |
2.88e-05 |
|
Phage-related minor tail protein [Mobilome: prophages, transposons];
Pssm-ID: 444092 [Multi-domain] Cd Length: 603 Bit Score: 50.38 E-value: 2.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2232 QMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEK 2311
Cdd:COG5281 1 AAALAAAAALAAAAAAAAASAAAAAAAAALAAAAAAAAAAAGLAAAAAAAAAASLAAAAAAAALAAAAAAAAAAAAADAL 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2312 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2391
Cdd:COG5281 81 AAALAEDAAAAAAAAEAALAALAAAALALAAAALAEAALAAAAAAAAAAAAAAAAAAAAAAAAAEAAKAAAAAAAAAALA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2392 QARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE 2471
Cdd:COG5281 161 AAAAAAAAAAAAAAAAAALAAASAAAAAAAAKAAAEAAAEAAAAAEAAAAAAAAAAEAAAAEAQALAAAALAEQAALAAA 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2472 MQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEA 2551
Cdd:COG5281 241 SAAAQALAALAAAAAAAALALAAAAELALTAQAEAAAAAAAAAAAAAQAAEAAAAAAEAQALAAAAAAAAAQLAAAAAAA 320
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2552 R-RRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIA 2612
Cdd:COG5281 321 AqALRAAAQALAALAQRALAAAALAAAAQEAALAAAAAALQAALEAAAAAAAAELAAAGDWA 382
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
2153-2455 |
3.12e-05 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 49.68 E-value: 3.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHkqfAEQALRQKAQVEQELTALRLQLEETDHQksildeeLQRLKAEVTEAARQRGQVEEELFSLRVQ 2232
Cdd:pfam19220 113 LRDKTAQAEALERQ---LAAETEQNRALEEENKALREEAQAAEKA-------LQRAEGELATARERLALLEQENRRLQAL 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2233 MEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEkmkqvaeeaarlsVAAQEAARLRQLAEEDLAQQRALAEKM-LKEK 2311
Cdd:pfam19220 183 SEEQAAELAELTRRLAELETQLDATRARLRALEGQ-------------LAAEQAERERAEAQLEEAVEAHRAERAsLRMK 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2312 MQAVQeaTRLKAEAELLQQQKELAQEQARRLQEdKEQMAQQLAQETQGFQKTLEterqrqlEMSAEAERLRLRVAEMSRA 2391
Cdd:pfam19220 250 LEALT--ARAAATEQLLAEARNQLRDRDEAIRA-AERRLKEASIERDTLERRLA-------GLEADLERRTQQFQEMQRA 319
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 2392 QARAEEDARRFRKQAEDIGERLYRTE---LATQEKV-MLVQTLETQRQQSDRDAERLREaiaELEHEK 2455
Cdd:pfam19220 320 RAELEERAEMLTKALAAKDAALERAEeriASLSDRIaELTKRFEVERAALEQANRRLKE---ELQRER 384
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
2169-2400 |
3.18e-05 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 49.69 E-value: 3.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2169 FAEQALRQKAQ---VEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQrgqveeelfsLRVQMEELGKLKARIEA 2245
Cdd:PRK11637 38 FSAHASDNRDQlksIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRK----------LRETQNTLNQLNKQIDE 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2246 ENRalvlrdkdSAQRLLQEEAEKMKqvaeeaarlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQA----VQEAtRL 2321
Cdd:PRK11637 108 LNA--------SIAKLEQQQAAQER---------LLAAQLDAAFRQGEHTGLQLILSGEESQRGERILAyfgyLNQA-RQ 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2322 KAEAELLQQQKELAQEqaRRLQEDKEQMAQQLAQETQGFQKTLETER-----------------QRQL-EMSAEAERLRL 2383
Cdd:PRK11637 170 ETIAELKQTREELAAQ--KAELEEKQSQQKTLLYEQQAQQQKLEQARnerkktltglesslqkdQQQLsELRANESRLRD 247
|
250
....*....|....*...
gi 1920237962 2384 RVAEMSR-AQARAEEDAR 2400
Cdd:PRK11637 248 SIARAEReAKARAEREAR 265
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1406-1543 |
3.28e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 48.38 E-value: 3.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1406 EAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGA---------EGELQA 1473
Cdd:COG1579 21 RLEHRLKELPAELAELEDelaALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVrnnkeyealQKEIES 100
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1474 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEE 1543
Cdd:COG1579 101 LKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAA 170
|
|
| PspA |
COG1842 |
Phage shock protein A [Transcription, Signal transduction mechanisms]; |
1674-1896 |
3.46e-05 |
|
Phage shock protein A [Transcription, Signal transduction mechanisms];
Pssm-ID: 441447 [Multi-domain] Cd Length: 217 Bit Score: 48.28 E-value: 3.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQRELAEQELEK-QRQLAEgtaqqrlaAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRE-LEA 1751
Cdd:COG1842 16 ALLDKAEDPEKMLDQAIRDmEEDLVE--------ARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKGREdLAR 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1752 ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1831
Cdd:COG1842 88 EALERKAELEAQAEALEAQLAQLEEQVEKLKEALRQLESKLEELKAKKDTLKARAKAAKAQEKVNEALSGIDSDDATSAL 167
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1832 AEklaaiseatrlkteAEIALKEKEAENERLRRLAEDEAFQRRLLEeqaAQHKADIEARLAQLRK 1896
Cdd:COG1842 168 ER--------------MEEKIEEMEARAEAAAELAAGDSLDDELAE---LEADSEVEDELAALKA 215
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
858-1508 |
3.71e-05 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 50.35 E-value: 3.71e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 858 NQEAQEAIARlEAQHQALVALWhQLHTEMKSLLAWQSLGRDMQlirsWSLATFRTLKPEE------------QRQALRsL 925
Cdd:TIGR00618 223 VLEKELKHLR-EALQQTQQSHA-YLTQKREAQEEQLKKQQLLK----QLRARIEELRAQEavleetqerinrARKAAP-L 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 926 ELHYQAFLRDSQDAGGFGPEDRLQaEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRL 1005
Cdd:TIGR00618 296 AAHIKAVTQIEQQAQRIHTELQSK-MRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQ 374
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1006 PLDKEPARECAQRIT----EQQKAQAEVDGLGKGVARLSAEAEKVLALPEPSPAAptlRSELELTLGKLEQVRSLSAIYL 1081
Cdd:TIGR00618 375 HTLTQHIHTLQQQKTtltqKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHA---KKQQELQQRYAELCAAAITCTA 451
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1082 EKLKtisLVIRSTQEAEEVLRAHEEQLKEAQAVpaTLPELEATKAALKKLRAQAEAQQPVFDALR---------DELRGA 1152
Cdd:TIGR00618 452 QCEK---LEKIHLQESAQSLKEREQQLQTKEQI--HLQETRKKAVVLARLLELQEEPCPLCGSCIhpnparqdiDNPGPL 526
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1153 QEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLgawlrdakqRQEQIQAV 1232
Cdd:TIGR00618 527 TRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNL---------QNITVRLQ 597
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1233 PLansqaVREQLRQEKALLEDIERHGEKVEECQRFAK--QYINAIKDYELQLVTYKAQLEpvASPAKKPKVQSGSESIIQ 1310
Cdd:TIGR00618 598 DL-----TEKLSEAEDMLACEQHALLRKLQPEQDLQDvrLHLQQCSQELALKLTALHALQ--LTLTQERVREHALSIRVL 670
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1311 EYVDLRTRYSELSTLTSQY--IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQG 1388
Cdd:TIGR00618 671 PKELLASRQLALQKMQSEKeqLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMH 750
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1389 LQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAE 1468
Cdd:TIGR00618 751 QARTVLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEE 830
|
650 660 670 680
....*....|....*....|....*....|....*....|
gi 1920237962 1469 GELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQ 1508
Cdd:TIGR00618 831 EQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAK 870
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1095-1594 |
3.84e-05 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 50.34 E-value: 3.84e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1095 QEAEEVLRAHEEQlkeaQAVPATLPELEATKAALKK-LRAQAEAQQpvfdaLRDELRGAQEVG-------ERLQQRHGER 1166
Cdd:PRK04863 496 DVARELLRRLREQ----RHLAEQLQQLRMRLSELEQrLRQQQRAER-----LLAEFCKRLGKNlddedelEQLQEELEAR 566
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1167 ----DVEVERWRERVTLLlerwQAVLAQTDVRQRELEQLGRQLRYYRESADPL----GAWLRDAKQRQEQIQ--AVPLAN 1236
Cdd:PRK04863 567 leslSESVSEARERRMAL----RQQLEQLQARIQRLAARAPAWLAAQDALARLreqsGEEFEDSQDVTEYMQqlLERERE 642
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1237 SQAVREQLRQEK-ALLEDIER----HGEKVEECQRFAKQ-------------------YINA-------------IKDYE 1279
Cdd:PRK04863 643 LTVERDELAARKqALDEEIERlsqpGGSEDPRLNALAERfggvllseiyddvsledapYFSAlygparhaivvpdLSDAA 722
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1280 LQLVT--------YKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSelstltsqyiRFISETL-RRMEEEERLAEQQ 1350
Cdd:PRK04863 723 EQLAGledcpedlYLIEGDPDSFDDSVFSVEELEKAVVVKIADRQWRYS----------RFPEVPLfGRAAREKRIEQLR 792
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1351 RaeERERLAEveaalekqrqlaeahaqAKAQAEREAQGLQRRMQE---------EVARREEVAVEAQEQKRSIQEELQHL 1421
Cdd:PRK04863 793 A--EREELAE-----------------RYATLSFDVQKLQRLHQAfsrfigshlAVAFEADPEAELRQLNRRRVELERAL 853
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1422 RQSSEAEIQAKARQVEAAERSRL--------------RIEEEIRVVRLQLEATE------RQRGGA----EGELQALRAR 1477
Cdd:PRK04863 854 ADHESQEQQQRSQLEQAKEGLSAlnrllprlnlladeTLADRVEEIREQLDEAEeakrfvQQHGNAlaqlEPIVSVLQSD 933
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1478 AEEAEAQKR---QAQEEAERLRRQVQDET---QRKRQAEAELALRVQAEAEAAREKQRalQALEELRLQAEEAERRLRQA 1551
Cdd:PRK04863 934 PEQFEQLKQdyqQAQQTQRDAKQQAFALTevvQRRAHFSYEDAAEMLAKNSDLNEKLR--QRLEQAEQERTRAREQLRQA 1011
|
570 580 590 600
....*....|....*....|....*....|....*....|...
gi 1920237962 1552 EAERARQVQValetaqrsaEAELQSEHASFAEKTAQLERTLKE 1594
Cdd:PRK04863 1012 QAQLAQYNQV---------LASLKSSYDAKRQMLQELKQELQD 1045
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1399-1752 |
3.87e-05 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 49.52 E-value: 3.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1399 RREEVAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARA 1478
Cdd:COG4372 21 KTGILIAALSEQLRKALFELDKLQE----ELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAEL 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1479 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQ 1558
Cdd:COG4372 97 AQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1559 VQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEAL 1638
Cdd:COG4372 177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVI 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1639 RLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAET 1718
Cdd:COG4372 257 LKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILL 336
|
330 340 350
....*....|....*....|....*....|....
gi 1920237962 1719 EQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1752
Cdd:COG4372 337 AELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
1383-1577 |
3.88e-05 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 50.02 E-value: 3.88e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1383 EREAQGLQRRMQEEVarreEVAVEAQEQKRSIQEELQhlrqsseaEIQAKARqveaAERSRL--RIE-EEIRVVRLQLEA 1459
Cdd:PTZ00491 643 ERTRDSLQKSVQLAI----EITTKSQEAAARHQAELL--------EQEARGR----LERQKMhdKAKaEEQRTKLLELQA 706
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1460 TerqrgGAEGELQAlRARAE-EAEAQKRQAQEEAErlrrqVQDETQRKRqaeaelALRVQAEAEAAREKQRALQALEELR 1538
Cdd:PTZ00491 707 E-----SAAVESSG-QSRAEaLAEAEARLIEAEAE-----VEQAELRAK------ALRIEAEAELEKLRKRQELELEYEQ 769
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1920237962 1539 LQAE---EAERRLRQAEAERARQVQVAL--ETAQRSAEA--ELQSE 1577
Cdd:PTZ00491 770 AQNEleiAKAKELADIEATKFERIVEALgrETLIAIARAgpELQAK 815
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
946-1516 |
3.99e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 50.12 E-value: 3.99e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 946 DRLQAEREYGSCSRHYQQLLQSLEQGEQEESrcqrcISELKDIRL-QLEACETRTVHRLRlpldkepaRECAQRITEQQK 1024
Cdd:pfam15921 364 ERDQFSQESGNLDDQLQKLLADLHKREKELS-----LEKEQNKRLwDRDTGNSITIDHLR--------RELDDRNMEVQR 430
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1025 AQAEVDGL-----GKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEE 1099
Cdd:pfam15921 431 LEALLKAMksecqGQMERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKER 510
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1100 VlraheeqlkeaqavpatlpeLEATKAALKKLRAQAEAQQPVFDALRDE---LRGAQEVGERLQQRHGERDVEVERWRER 1176
Cdd:pfam15921 511 A--------------------IEATNAEITKLRSRVDLKLQELQHLKNEgdhLRNVQTECEALKLQMAEKDKVIEILRQQ 570
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1177 VTLLLE-------RWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLR--DAKQRQEQIQAVPLANSQAvrEQLRQE 1247
Cdd:pfam15921 571 IENMTQlvgqhgrTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIRelEARVSDLELEKVKLVNAGS--ERLRAV 648
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1248 KALLEDIERHGEKVEECQrfaKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQsgsesiiqeyvdLRTRYSELSTLTS 1327
Cdd:pfam15921 649 KDIKQERDQLLNEVKTSR---NELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQ------------LKSAQSELEQTRN 713
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1328 qyirfiseTLRRMEEEERLAeqqraeererlaeVEAALEKQRQLAEAHAQAKAqaereaqglqrrMQEEVARREEVAVEA 1407
Cdd:pfam15921 714 --------TLKSMEGSDGHA-------------MKVAMGMQKQITAKRGQIDA------------LQSKIQFLEEAMTNA 760
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRSIQEELQHLRQsseaeiqakarqveaaersrlrieeeirvvRLQLEATERQRggAEGELQALRA---RAEEAEAQ 1484
Cdd:pfam15921 761 NKEKHFLKEEKNKLSQ------------------------------ELSTVATEKNK--MAGELEVLRSqerRLKEKVAN 808
|
570 580 590
....*....|....*....|....*....|..
gi 1920237962 1485 KRQAQEEAERLRRQVQDETQRKRQAEAELALR 1516
Cdd:pfam15921 809 MEVALDKASLQFAECQDIIQRQEQESVRLKLQ 840
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
949-1387 |
4.13e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 49.77 E-value: 4.13e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 949 QAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACE-----TRTVHRLRLPLDKEPAR--ECAQRITE 1021
Cdd:COG4717 78 EELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLqllplYQELEALEAELAELPERleELEERLEE 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1022 QQKAQAEVDGLGKGVARLSAEAEKVLALPEPSpaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVL 1101
Cdd:COG4717 158 LRELEEELEELEAELAELQEELEELLEQLSLA-----TEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL 232
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1102 R------AHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRE 1175
Cdd:COG4717 233 EneleaaALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPA 312
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1176 RVTLLLERWQAVLAQTDVRQREleqlgrQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEkALLEDIE 1255
Cdd:COG4717 313 LEELEEEELEELLAALGLPPDL------SPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAE-AGVEDEE 385
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1256 RHGEKVEECQRFaKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELstltSQYIRFISE 1335
Cdd:COG4717 386 ELRAALEQAEEY-QELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEEL----REELAELEA 460
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1336 TLRRMEEEERLAE--QQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQ 1387
Cdd:COG4717 461 ELEQLEEDGELAEllQELEELKAELRELAEEWAALKLALELLEEAREEYREERL 514
|
|
| CH_PLS2_rpt3 |
cd21330 |
third calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or ... |
39-159 |
4.36e-05 |
|
third calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-2 contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409179 Cd Length: 125 Bit Score: 46.14 E-value: 4.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 39 ERDRVQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSgdsLPRERDVIRSSRLPREKGRMRfhKLQNVQIA 118
Cdd:cd21330 9 EGETREERTFRNWMNS--LGVNPRVNHLYSDLSDALVIFQLYEKIK---VPVDWNRVNKPPYPKLGENMK--KLENCNYA 81
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1920237962 119 LDYLRHR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 159
Cdd:cd21330 82 VELGKNKaKFSLVGIAGQDLNEGNRTLTLALIWQLMRRYTLN 123
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3346-3384 |
4.47e-05 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 43.09 E-value: 4.47e-05
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3346 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTATLL 3384
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3346-3381 |
4.73e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 43.24 E-value: 4.73e-05
10 20 30
....*....|....*....|....*....|....*.
gi 1920237962 3346 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTA 3381
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1333-1557 |
4.87e-05 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 49.48 E-value: 4.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1333 ISETLRRMEEEERLAEQQRAEERERLAEVEAAlEKQRQLAEAHAQAKAQAEREAQglqRRMQEEVARREEVavEAQEQKR 1412
Cdd:pfam02029 100 VAERKENNEEEENSSWEKEEKRDSRLGRYKEE-ETEIREKEYQENKWSTEVRQAE---EEGEEEEDKSEEA--EEVPTEN 173
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1413 SIQEELQHLRQSSEAEIQAKARQVEaaERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKrQAQEEA 1492
Cdd:pfam02029 174 FAKEEVKDEKIKKEKKVKYESKVFL--DQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFL-EAEQKL 250
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1493 ERLRRQVQD------ETQRKRQAEAELALrvqAEAEAAREKQRAL--------QALEELRLQAEEAERRLRQAEAERAR 1557
Cdd:pfam02029 251 EELRRRRQEkeseefEKLRQKQQEAELEL---EELKKKREERRKLleeeeqrrKQEEAERKLREEEEKRRMKEEIERRR 326
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1347-1744 |
5.00e-05 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 49.27 E-value: 5.00e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1347 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEElqhlrqsse 1426
Cdd:COG3064 4 ALEEKAAEAAAQERLEQAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAE--------- 74
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1427 aeiqaKARQVEAAERSRLRIEEEIRvvrlqlEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1506
Cdd:COG3064 75 -----AAKKLAEAEKAAAEAEKKAA------AEKAKAAKEAEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKA 143
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1507 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1586
Cdd:COG3064 144 AEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAA 223
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1587 QLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAE 1666
Cdd:COG3064 224 RAAAASREAALAAVEATEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAAL 303
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1667 REARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQ 1744
Cdd:COG3064 304 AAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLA 381
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1823-2213 |
5.18e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 49.74 E-value: 5.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1823 QRAEAERVLAEKLAAIsEATRLKTEAEialkEKEAENERLRRLAEDEAFQRRLLEEQAAqhkadIEARLAQLRKASESEL 1902
Cdd:pfam17380 281 QKAVSERQQQEKFEKM-EQERLRQEKE----EKAREVERRRKLEEAEKARQAEMDRQAA-----IYAEQERMAMEREREL 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1903 ERQKglVEDTLRQRRQVEEEilalkgsfekaaagkaELELELGRIRGTaeDTLRSKEQAEQEAARQRqlaaeeerrrrea 1982
Cdd:pfam17380 351 ERIR--QEERKRELERIRQE----------------EIAMEISRMREL--ERLQMERQQKNERVRQE------------- 397
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1983 EERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLR-ERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKeqel 2061
Cdd:pfam17380 398 LEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREvRRLEEERAREMERVRLEEQERQQQVERLRQQEEERK---- 473
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2062 qqtlqqeqsvleRLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQE 2141
Cdd:pfam17380 474 ------------RKKLELEKEKRDRKRAEEQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEE 541
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2142 aarraqaeqaalRQKQaadAEMEKHKQFAEQALRqkaqVEQELTALRLQLEETDHQKSILDEELQRLKAEVT 2213
Cdd:pfam17380 542 ------------RRKQ---QEMEERRRIQEQMRK----ATEERSRLEAMEREREMMRQIVESEKARAEYEAT 594
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1513-1782 |
5.35e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.99 E-value: 5.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1513 LALRVQAEAEAAREKQRALQALEElRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTL 1592
Cdd:COG4942 11 LALAAAAQADAAAEAEAELEQLQQ-EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1593 KEEHVAVVQLREeatrraqqqaeaeRARAEAERELERWQLKANEALRLRLQAEEVAQqksltqaeaekqKEEAEREARRR 1672
Cdd:COG4942 90 KEIAELRAELEA-------------QKEELAELLRALYRLGRQPPLALLLSPEDFLD------------AVRRLQYLKYL 144
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1673 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1752
Cdd:COG4942 145 APARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224
|
250 260 270
....*....|....*....|....*....|
gi 1920237962 1753 LAKVRAEMEVLLASKARAEEESRSTSEKSK 1782
Cdd:COG4942 225 LEALIARLEAEAAAAAERTPAAGFAALKGK 254
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4132-4160 |
5.38e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 42.85 E-value: 5.38e-05
10 20
....*....|....*....|....*....
gi 1920237962 4132 VRKRRVVIVDPETGKEMSVYEAYRKGLID 4160
Cdd:smart00250 6 AQSAIGGIIDPETGQKLSVEEALRRGLID 34
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1081-1549 |
5.40e-05 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 49.35 E-value: 5.40e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1081 LEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALkklraQAEAQQpVFDALRDELRGAQEVGERLQ 1160
Cdd:pfam05557 51 QELQKRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKLNEKESQ-----LADARE-VISCLKNELSELRRQIQRAE 124
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1161 QRHGERDVEVERWRERVTLLLERWQAV---LAQTDVRQREL---EQLGRQLRYYRESADPLGAWLRDAKQRQEQIqavpl 1234
Cdd:pfam05557 125 LELQSTNSELEELQERLDLLKAKASEAeqlRQNLEKQQSSLaeaEQRIKELEFEIQSQEQDSEIVKNSKSELARI----- 199
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1235 ANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKK------------PKVQ 1302
Cdd:pfam05557 200 PELEKELERLREHNKHLNENIENKLLLKEEVEDLKRKLEREEKYREEAATLELEKEKLEQELQSwvklaqdtglnlRSPE 279
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1303 SGSESIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEK-QRQL---------- 1371
Cdd:pfam05557 280 DLSRRIEQLQQREIVLKEENSSLTSS-ARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRlQRRVllltkerdgy 358
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1372 ------------AEAHAQAKAQAEREAQGLQRRMQ---EEVARREEVAVEA----QEQKRSIQEELQHLRQ--------S 1424
Cdd:pfam05557 359 railesydkeltMSNYSPQLLERIEEAEDMTQKMQahnEEMEAQLSVAEEElggyKQQAQTLERELQALRQqesladpsY 438
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1425 SEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQ--DE 1502
Cdd:pfam05557 439 SKEEVDSLRRKLETLELERQRLREQKNELEMELERRCLQGDYDPKKTKVLHLSMNPAAEAYQQRKNQLEKLQAEIErlKR 518
|
490 500 510 520
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1503 TQRKRQAEAELALRVQAEAEAAREKQralqaLEELRLQAEEAERRLR 1549
Cdd:pfam05557 519 LLKKLEDDLEQVLRLPETTSTMNFKE-----VLDLRKELESAELKNQ 560
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1643-1819 |
5.62e-05 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 49.03 E-value: 5.62e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1643 QAEEVAQQKsltQAEAEKQKEEAEREArrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQ----RLAAEQELIRLRAET 1718
Cdd:PRK09510 88 QAEELQQKQ---AAEQERLKQLEKERL----AAQEQKKQAEEAAKQAALKQKQAEEAAAKAaaaaKAKAEAEAKRAAAAA 160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1719 EQGEQQRQLLEEELArlQREAAAATQKRRELEA-----ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1793
Cdd:PRK09510 161 KKAAAEAKKKAEAEA--AKKAAAEAKKKAEAEAaakaaAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEA---- 234
|
170 180
....*....|....*....|....*.
gi 1920237962 1794 ELAEEAARLRALAEEAKRQRQLAEED 1819
Cdd:PRK09510 235 KAAAEKAAAAKAAEKAAAAKAAAEVD 260
|
|
| PspA |
COG1842 |
Phage shock protein A [Transcription, Signal transduction mechanisms]; |
1333-1538 |
5.85e-05 |
|
Phage shock protein A [Transcription, Signal transduction mechanisms];
Pssm-ID: 441447 [Multi-domain] Cd Length: 217 Bit Score: 47.51 E-value: 5.85e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1333 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVAR-REEVAVEAQEQK 1411
Cdd:COG1842 14 INALLDKAEDPEKMLDQAIRDMEEDLVEARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKgREDLAREALERK 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1412 RSIQEELQHLRQsseaeiqakarQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegelQALRARAEEAEAQKR----- 1486
Cdd:COG1842 94 AELEAQAEALEA-----------QLAQLEEQVEKLKEALRQLESKLEELKAKK-------DTLKARAKAAKAQEKvneal 155
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1487 ------QAQEEAERLRRQVqDETQRKRQAEAELALR--VQAEAEAAREKQRALQALEELR 1538
Cdd:COG1842 156 sgidsdDATSALERMEEKI-EEMEARAEAAAELAAGdsLDDELAELEADSEVEDELAALK 214
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1424-1637 |
6.01e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 48.67 E-value: 6.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1424 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQD-- 1501
Cdd:COG3883 13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGEra 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1502 -ETQRKRQAEAELAL------------RVQAEAEAAREKQRALQALEELRLQAEEAerrlrQAEAERARQVQVALETAQR 1568
Cdd:COG3883 93 rALYRSGGSVSYLDVllgsesfsdfldRLSALSKIADADADLLEELKADKAELEAK-----KAELEAKLAELEALKAELE 167
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1569 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEA 1637
Cdd:COG3883 168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAA 236
|
|
| CH_jitterbug-like_rpt3 |
cd21185 |
third calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ... |
193-270 |
6.07e-05 |
|
third calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409034 Cd Length: 98 Bit Score: 44.60 E-value: 6.07e-05
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 193 DNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDPEDVDVPQPDEKSIITYVSSL 270
Cdd:cd21185 20 NNFTTDWNDGRLLCGLVNALGGSVPGWPNLDPEESENNIQRGLEAGKS-LGVEPVLTAEEMADPEVEHLGIMAYAAQL 96
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1697-2193 |
6.09e-05 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 49.47 E-value: 6.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1697 AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV----------RAEMEVLLAS 1766
Cdd:COG3899 737 PDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYAnlgllllgdyEEAYEFGELA 816
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1767 KARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKT 1846
Cdd:COG3899 817 LALAERLGDRRLEARALFNLGFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAARL 896
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1847 EAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILAL 1926
Cdd:COG3899 897 LAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAAA 976
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1927 KGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 2006
Cdd:COG3899 977 AAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAA 1056
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2007 VERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAA 2086
Cdd:COG3899 1057 AAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALAALALAAAARAAAALL 1136
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2087 EEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKH 2166
Cdd:COG3899 1137 LLAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLA 1216
|
490 500
....*....|....*....|....*..
gi 1920237962 2167 KQFAEQALRQKAQVEQELTALRLQLEE 2193
Cdd:COG3899 1217 LEAAALLLLLLLAALALAAALLALRLL 1243
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
2153-2603 |
6.11e-05 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 49.45 E-value: 6.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFA--EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQ-RGQVEEELFSL 2229
Cdd:pfam12128 227 IRDIQAIAGIMKIRPEFTklQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTlDDQWKEKRDEL 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2230 R----VQMEELGKLKARIEA-ENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA---------RLRQLAEE 2295
Cdd:pfam12128 307 NgelsAADAAVAKDRSELEAlEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKhqdvtakynRRRSKIKE 386
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2296 DLA------QQRALAEKMLKEKMQAVQEATRLKAEAEL---LQQQKELAQEQARRLQEDKEQMAQQLAQETQGfQKTLET 2366
Cdd:pfam12128 387 QNNrdiagiKDKLAKIREARDRQLAVAEDDLQALESELreqLEAGKLEFNEEEYRLKSRLGELKLRLNQATAT-PELLLQ 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2367 ERQRQLEMSAEAERLRLRVAEMSRAQaraeEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQ-RQQSDRDAERLR 2445
Cdd:pfam12128 466 LENFDERIERAREEQEAANAEVERLQ----SELRQARKRRDQASEALRQASRRLEERQSALDELELQlFPQAGTLLHFLR 541
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2446 EAIAELEHEKDKLKQEAQL----LQLKSEEMQTVRQEQLLQETQALQQSflsEKDSLLQRERCIEQEKAKLEQLFQDEVA 2521
Cdd:pfam12128 542 KEAPDWEQSIGKVISPELLhrtdLDPEVWDGSVGGELNLYGVKLDLKRI---DVPEWAASEEELRERLDKAEEALQSARE 618
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2522 KAQALreeqqrqqqqmQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLA----QQQQQQEKLLAEENQRLRERLQHL 2597
Cdd:pfam12128 619 KQAAA-----------EEQLVQANGELEKASREETFARTALKNARLDLRRLFdekqSEKDKKNKALAERKDSANERLNSL 687
|
....*.
gi 1920237962 2598 EEERRA 2603
Cdd:pfam12128 688 EAQLKQ 693
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
2265-2400 |
6.95e-05 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 48.72 E-value: 6.95e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2265 EAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEAtrlKAEAELLQQQKELAQEQARRLQE 2344
Cdd:COG2268 196 EIIRDARIAEAEAERETEIAIAQANREAEEAELEQEREIETARIAEAEAELAKK---KAEERREAETARAEAEAAYEIAE 272
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 2345 DKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDAR 2400
Cdd:COG2268 273 ANAEREVQRQLEIAEREREIELQEKEAEREEAELEADVRKPAEAEKQAAEAEAEAE 328
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1820-2031 |
7.00e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.61 E-value: 7.00e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1820 AVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIeARLAQLRKASE 1899
Cdd:COG4942 18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAEL-AELEKEIAELR 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1900 SELERQKGLVEDTL----RQRRQVEEEILALKGSFEKAAAGKA-------ELELELGRIRGTAEDTLRSKEQAEQEAARQ 1968
Cdd:COG4942 97 AELEAQKEELAELLralyRLGRQPPLALLLSPEDFLDAVRRLQylkylapARREQAEELRADLAELAALRAELEAERAEL 176
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 1969 RQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQ 2031
Cdd:COG4942 177 EALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3127-3163 |
7.01e-05 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 42.85 E-value: 7.01e-05
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 3127 LRLLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKET 3163
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1304-1606 |
7.15e-05 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 48.53 E-value: 7.15e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1304 GSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAaLEKQ--------RQLAEAH 1375
Cdd:pfam19220 63 AYGKLRRELAGLTRRLSAAEGELEELVARLAKLEAALREAEAAKEELRIELRDKTAQAEA-LERQlaaeteqnRALEEEN 141
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1376 AQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRsiqeeLQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRL 1455
Cdd:pfam19220 142 KALREEAQAAEKALQRAEGELATARERLALLEQENRR-----LQALSEEQAAELAELTRRLAELETQLDATRARLRALEG 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1456 QL--EATERQRGGAEGELQALRARAEEAEaqkrqaqeeaerLRRQVQDETQRKRQAE---AELALRVQAEAEAAREKQRa 1530
Cdd:pfam19220 217 QLaaEQAERERAEAQLEEAVEAHRAERAS------------LRMKLEALTARAAATEqllAEARNQLRDRDEAIRAAER- 283
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1531 lqALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSE--HASFAEKTAQLERTlkEEHVAVVQLREEA 1606
Cdd:pfam19220 284 --RLKEASIERDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEmlTKALAAKDAALERA--EERIASLSDRIAE 357
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
1400-1560 |
7.74e-05 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 48.98 E-value: 7.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1400 REEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAE 1479
Cdd:pfam07111 490 RNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELT 569
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1480 EAEAQKRQAQEEA-----ERLRRQVQDETQ-----RKRQAEAELALRvQAEAEAAREKQRAlqalEELRLQAEEAerrlR 1549
Cdd:pfam07111 570 QQQEIYGQALQEKvaeveTRLREQLSDTKRrlneaRREQAKAVVSLR-QIQHRATQEKERN----QELRRLQDEA----R 640
|
170
....*....|..
gi 1920237962 1550 QAEAER-ARQVQ 1560
Cdd:pfam07111 641 KEEGQRlARRVQ 652
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2172-2399 |
8.01e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 48.29 E-value: 8.01e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2172 QALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKL--KARIEAENRA 2249
Cdd:COG3883 13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEieERREELGERA 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2250 LVLRDKDSAQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAEL 2327
Cdd:COG3883 93 RALYRSGGSVSYLDVllGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAE 172
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2328 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDA 2399
Cdd:COG3883 173 LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 244
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1995-2300 |
8.11e-05 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 48.97 E-value: 8.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1995 EAARQRKAALEEVERLKAKVEEARRLRER----AEQESARQLQLAQEAA----QKRLQAEEKAHAFAVQQKEQELQqtlq 2066
Cdd:pfam17380 286 ERQQQEKFEKMEQERLRQEKEEKAREVERrrklEEAEKARQAEMDRQAAiyaeQERMAMERERELERIRQEERKRE---- 361
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2067 qeqsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERlKQSAEEQAQAQAQAQAAAEKLRKEAEQEAArra 2146
Cdd:pfam17380 362 -----LERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAAR-KVKILEEERQRKIQQQKVEMEQIRAEQEEA--- 432
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2147 qaeqaalRQKQAADAEMEKHKQFaEQALRQKAQVEQELTALRLQLEETDHQKSILD---------EELQRLKAEVTEAAR 2217
Cdd:pfam17380 433 -------RQREVRRLEEERAREM-ERVRLEEQERQQQVERLRQQEEERKRKKLELEkekrdrkraEEQRRKILEKELEER 504
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2218 QRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEE--AEKMKQVAEEAARLSVAAQEAARLRQLAEE 2295
Cdd:pfam17380 505 KQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRriQEQMRKATEERSRLEAMEREREMMRQIVES 584
|
....*
gi 1920237962 2296 DLAQQ 2300
Cdd:pfam17380 585 EKARA 589
|
|
| Golgin_A5 |
pfam09787 |
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ... |
1468-1602 |
8.30e-05 |
|
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.
Pssm-ID: 462900 [Multi-domain] Cd Length: 305 Bit Score: 47.83 E-value: 8.30e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1468 EGELQALRARAEEAEAQkrqAQEEAERLRRQVQDetqRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERR 1547
Cdd:pfam09787 67 RGQIQQLRTELQELEAQ---QQEEAESSREQLQE---LEEQLATERSARREAEAELERLQEELRYLEEELRRSKATLQSR 140
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1548 LRQAEAERARQ-VQVALETAQRSAEAELQSE-HA---SFAEKTAQLERTLKEEHVAVVQL 1602
Cdd:pfam09787 141 IKDREAEIEKLrNQLTSKSQSSSSQSELENRlHQlteTLIQKQTMLEALSTEKNSLVLQL 200
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1818-2296 |
8.42e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 49.00 E-value: 8.42e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1818 EDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLleEQAAQHKADIEARLAQLRKA 1897
Cdd:COG4717 77 EEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQEL--EALEAELAELPERLEELEER 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1898 seselerqkglvedtLRQRRQVEEEILALKgsfEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEER 1977
Cdd:COG4717 155 ---------------LEELRELEEELEELE---AELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELE 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1978 RRREAEervqKSLAAEEEAARQRKAALEEVERLKakveearRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQK 2057
Cdd:COG4717 217 EAQEEL----EELEEELEQLENELEAAALEERLK-------EARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGL 285
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2058 EQELQQTLQQEQSVLERLRseaeaarraaeeaEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKE 2137
Cdd:COG4717 286 LALLFLLLAREKASLGKEA-------------EELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQEL 352
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2138 AEQEAARRAQAEQAALRQKQAADaeMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQ-KSILDEELQRLKAEVTEAA 2216
Cdd:COG4717 353 LREAEELEEELQLEELEQEIAAL--LAEAGVEDEEELRAALEQAEEYQELKEELEELEEQlEELLGELEELLEALDEEEL 430
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2217 RQR-GQVEEELFSLRVQMEELGKLKARIEAENRAlvLRDKDSAQRLLQEEAE---KMKQVAEEAARLSVAAQEAARLRQL 2292
Cdd:COG4717 431 EEElEELEEELEELEEELEELREELAELEAELEQ--LEEDGELAELLQELEElkaELRELAEEWAALKLALELLEEAREE 508
|
....
gi 1920237962 2293 AEED 2296
Cdd:COG4717 509 YREE 512
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1683-2395 |
8.65e-05 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 48.96 E-value: 8.65e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1683 RELAEQELEKQRQLAEGTAQQRLAAE---QELIRLRAETEQGEQQRQLLEEELARLQ-REAAAATQK-RRELEAELA--- 1754
Cdd:pfam15921 158 KCLKEDMLEDSNTQIEQLRKMMLSHEgvlQEIRSILVDFEEASGKKIYEHDSMSTMHfRSLGSAISKiLRELDTEISylk 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1755 ----KVRAEMEVLLA-SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAER 1829
Cdd:pfam15921 238 grifPVEDQLEALKSeSQNKIELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMR 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1830 VLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQL-----RKASESELER 1904
Cdd:pfam15921 318 QLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLladlhKREKELSLEK 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1905 QK-----------GLVEDTLRQR---RQVEEEIL-ALKGSFEKAAAGkaELELELGRIRGtaedtlrSKEQAEQEAARQR 1969
Cdd:pfam15921 398 EQnkrlwdrdtgnSITIDHLRRElddRNMEVQRLeALLKAMKSECQG--QMERQMAAIQG-------KNESLEKVSSLTA 468
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1970 QLAAEEERRRREAEERVQKSLAAE--EEAARQRKAALEEVER-LKAKVEEARRLRERAEQesarQLQLAQEaaqkrLQAE 2046
Cdd:pfam15921 469 QLESTKEMLRKVVEELTAKKMTLEssERTVSDLTASLQEKERaIEATNAEITKLRSRVDL----KLQELQH-----LKNE 539
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2047 EKaHAFAVQQKEQELQQTLQQEQSVLERLRseaeaarraaeeaeaareraereaaqsrRQVEEAERLkqsaeeqaqaqaq 2126
Cdd:pfam15921 540 GD-HLRNVQTECEALKLQMAEKDKVIEILR----------------------------QQIENMTQL------------- 577
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2127 aqaaaeklrkeaeqeaarraqaeqaalrqkqaadaeMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQ 2206
Cdd:pfam15921 578 ------------------------------------VGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIR 621
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2207 RLKAEVTEAARQRGQV----EEELFSLRVQMEELGKLKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQVAEE-AARL 2279
Cdd:pfam15921 622 ELEARVSDLELEKVKLvnagSERLRAVKDIKQERDQLLNEVKTSRNELnsLSEDYEVLKRNFRNKSEEMETTTNKlKMQL 701
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2280 SVAAQEAARLR---QLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAqe 2356
Cdd:pfam15921 702 KSAQSELEQTRntlKSMEGSDGHAMKVAMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELS-- 779
|
730 740 750 760
....*....|....*....|....*....|....*....|...
gi 1920237962 2357 tqgfqkTLETERQR---QLE-MSAEAERLRLRVAEMSRAQARA 2395
Cdd:pfam15921 780 ------TVATEKNKmagELEvLRSQERRLKEKVANMEVALDKA 816
|
|
| PksD |
COG3321 |
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ... |
1870-2395 |
9.74e-05 |
|
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];
Pssm-ID: 442550 [Multi-domain] Cd Length: 1386 Bit Score: 49.10 E-value: 9.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1870 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEkAAAGKAELELELGRIRG 1949
Cdd:COG3321 867 PFQREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAA-LLALVALAAAAAALLAL 945
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1950 TAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2029
Cdd:COG3321 946 AAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAAAALLALA 1025
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2030 RQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE 2109
Cdd:COG3321 1026 ALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALALAALAAA 1105
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2110 AERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRL 2189
Cdd:COG3321 1106 LLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALALALAAAL 1185
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2190 QLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEElgklkARIEAENRALVLRDKDSAQRLLQEEAEKM 2269
Cdd:COG3321 1186 AAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAA-----ALALLALAAAAAAVAALAAAAAALLAALA 1260
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2270 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2349
Cdd:COG3321 1261 ALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAAVAAA 1340
|
490 500 510 520
....*....|....*....|....*....|....*....|....*.
gi 1920237962 2350 AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARA 2395
Cdd:COG3321 1341 LALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALAAAVA 1386
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1507-1791 |
1.07e-04 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 48.41 E-value: 1.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1507 RQAEAELALRVQAEAEAAREKQR--ALQAleelRLQAEEAER--RLRQAEAERARQVQVALETAQRSAEAELQSEHASFA 1582
Cdd:PRK05035 432 RQAKAEIRAIEQEKKKAEEAKARfeARQA----RLEREKAAReaRHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIV 507
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1583 EKTAQlertlKEEHVAVVQLREEATRRAQQQAEAERARAEAERelerwQLKANEALRLRLQAEEVAQQKSLTqaeAEKQK 1662
Cdd:PRK05035 508 IKAGA-----RPDNSAVIAAREARKAQARARQAEKQAAAAADP-----KKAAVAAAIARAKAKKAAQQAANA---EAEEE 574
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1663 EEAEREARRRGKAEEQAvrqRELAEQELEKQRQLAEGTAQQRLAAEQELIrLRAETEQGEQQRQLLEEElarlqreaaAA 1742
Cdd:PRK05035 575 VDPKKAAVAAAIARAKA---KKAAQQAASAEPEEQVAEVDPKKAAVAAAI-ARAKAKKAEQQANAEPEE---------PV 641
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1920237962 1743 TQKRRELEAELAKVRAEmevlLASKARAEEESRSTSEKSKQRLEAEAGR 1791
Cdd:PRK05035 642 DPRKAAVAAAIARAKAR----KAAQQQANAEPEEAEDPKKAAVAAAIAR 686
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1677-2387 |
1.14e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 48.56 E-value: 1.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQlAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR-----LQREAAAATQKRRELEA 1751
Cdd:pfam05483 98 EAELKQKENKLQENRKIIE-AQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNATRhlcnlLKETCARSAEKTKKYEY 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1752 ELAKVR---AEMEVLLASKARAEEESRSTSEKSkqRLEAEAgrfrELAEEAARLRALAEEAKRQRQlaeedavrqraEAE 1828
Cdd:pfam05483 177 EREETRqvyMDLNNNIEKMILAFEELRVQAENA--RLEMHF----KLKEDHEKIQHLEEEYKKEIN-----------DKE 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1829 RVLAEKLAAISEATRLKTEAEIALKEKEaenERLRRLAEDEAFQRRLLEeQAAQHKADIEARLAQLRKASESELERQKGL 1908
Cdd:pfam05483 240 KQVSLLLIQITEKENKMKDLTFLLEESR---DKANQLEEKTKLQDENLK-ELIEKKDHLTKELEDIKMSLQRSMSTQKAL 315
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1909 VED---TLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEerrrreaeer 1985
Cdd:pfam05483 316 EEDlqiATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITME---------- 385
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1986 VQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQ--ESARQLQLAQEAAQKRLQAEEK-AHAFAVQqkeqeLQ 2062
Cdd:pfam05483 386 LQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQfeKIAEELKGKEQELIFLLQAREKeIHDLEIQ-----LT 460
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2063 QTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLkqsaeeqaqaqaqaqaaaeklrkeaeqea 2142
Cdd:pfam05483 461 AIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDM----------------------------- 511
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2143 arraqaeQAALRQKQAadaEMEKHKQFAEQALRQKAQVEQELTALRLQLE--------ETDHQKSILD---EELQRLKAE 2211
Cdd:pfam05483 512 -------TLELKKHQE---DIINCKKQEERMLKQIENLEEKEMNLRDELEsvreefiqKGDEVKCKLDkseENARSIEYE 581
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2212 VTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEeaarLSVAAQEAARLRQ 2291
Cdd:pfam05483 582 VLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLELE----LASAKQKFEEIID 657
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2292 LAEEDLAQQRALAEKMLKEKMQA---VQEATRLKAEAELLQQQK--------ELAQEQARRLQEDKEQ---MAQQLAQET 2357
Cdd:pfam05483 658 NYQKEIEDKKISEEKLLEEVEKAkaiADEAVKLQKEIDKRCQHKiaemvalmEKHKHQYDKIIEERDSelgLYKNKEQEQ 737
|
730 740 750 760
....*....|....*....|....*....|....*....|.
gi 1920237962 2358 QGFQKTLETER----------QRQLEMS-AEAERLRLRVAE 2387
Cdd:pfam05483 738 SSAKAALEIELsnikaellslKKQLEIEkEEKEKLKMEAKE 778
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1479-1558 |
1.21e-04 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 48.41 E-value: 1.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1479 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE------AEAAREKQRALQA-LEELRLQAEEAERRLRQA 1551
Cdd:PRK11448 138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELvaleglAAELEEKQQELEAqLEQLQEKAAETSQERKQK 217
|
....*..
gi 1920237962 1552 EAERARQ 1558
Cdd:PRK11448 218 RKEITDQ 224
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1481-1606 |
1.24e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 47.88 E-value: 1.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1481 AEAQKRQAQEEAERLRRQVQDETQRKRQAEaELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLR----QAEAERA 1556
Cdd:PRK09510 61 VEQYNRQQQQQKSAKRAEEQRKKKEQQQAE-ELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAlkqkQAEEAAA 139
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1557 RQVQVA----------LETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1606
Cdd:PRK09510 140 KAAAAAkakaeaeakrAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEA 199
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1994-2607 |
1.26e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 48.37 E-value: 1.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1994 EEAARQRKAALEEVERLKAKVEEARRlreraeqeSARQLQLAQEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqSVLE 2073
Cdd:COG4913 224 FEAADALVEHFDDLERAHEALEDARE--------QIELLEPIRELAERYAAARERL--------------------AELE 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2074 RLRSEAEAARRAAEEAEAARERAEREAAQSRRQvEEAERLKQsaeeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAAL 2153
Cdd:COG4913 276 YLRAALRLWFAQRRLELLEAELEELRAELARLE-AELERLEA-----------------RLDALREELDELEAQIRGNGG 337
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2154 RQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETdhqKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQM 2233
Cdd:COG4913 338 DRLEQLEREIERLERELEERERRRARLEALLAALGLPLPAS---AEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2234 EELGKLKARIEAENRALVLRDKDSAQRLLQeeaekMKQVAEEAARLS-VAAQEAARLRQLAEEDLAQQRAlAEKML---- 2308
Cdd:COG4913 415 RDLRRELRELEAEIASLERRKSNIPARLLA-----LRDALAEALGLDeAELPFVGELIEVRPEEERWRGA-IERVLggfa 488
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2309 ------KEKMQAVQEAT-RLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETE--RQRQLEMSAEAE 2379
Cdd:COG4913 489 ltllvpPEHYAAALRWVnRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWLEAElgRRFDYVCVDSPE 568
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2380 RLRLRVAEMSRA-QARAeeDARRFRKQAEDIGERLYRTELATQEKvmlVQTLETQRQQSDRDAERLREAIAELEHEKDKL 2458
Cdd:COG4913 569 ELRRHPRAITRAgQVKG--NGTRHEKDDRRRIRSRYVLGFDNRAK---LAALEAELAELEEELAEAEERLEALEAELDAL 643
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2459 KQEAQLLQ-LKSEEMQTVRQEQLLQETQALQQsflsekdsllQRERcIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQM 2537
Cdd:COG4913 644 QERREALQrLAEYSWDEIDVASAEREIAELEA----------ELER-LDASSDDLAAL-EEQLEELEAELEELEEELDEL 711
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 2538 QQEKQQLAASMEEARRRQHEAEEGVRRqQEELQRLAQQQQQQEKLLAEENQRLRERL-QHLEEERRAALAR 2607
Cdd:COG4913 712 KGEIGRLEKELEQAEEELDELQDRLEA-AEDLARLELRALLEERFAAALGDAVERELrENLEERIDALRAR 781
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
2176-2419 |
1.33e-04 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 47.53 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2176 QKAQVEQELTALRLQleetdhQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDK 2255
Cdd:TIGR02794 44 DPGAVAQQANRIQQQ------KKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2256 DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARlrQLAEEDLAQQRALAEKMLKE-KMQAVQEA-----TRLKAEAELLQ 2329
Cdd:TIGR02794 118 QKQAEEAKAKQAAEAKAKAEAEAERKAKEEAAK--QAEEEAKAKAAAEAKKKAEEaKKKAEAEAkakaeAEAKAKAEEAK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2330 QQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQaedI 2409
Cdd:TIGR02794 196 AKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAGSEVDKYAAIIQQA---I 272
|
250
....*....|
gi 1920237962 2410 GERLYRTELA 2419
Cdd:TIGR02794 273 QQNLYDDPSF 282
|
|
| PRK07735 |
PRK07735 |
NADH-quinone oxidoreductase subunit C; |
1343-1572 |
1.33e-04 |
|
NADH-quinone oxidoreductase subunit C;
Pssm-ID: 236081 [Multi-domain] Cd Length: 430 Bit Score: 48.05 E-value: 1.33e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1343 EERLAEQQRAEERERLAEVEAAlEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELqhlr 1422
Cdd:PRK07735 11 KKEAARRAKEEARKRLVAKHGA-EISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEV---- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1423 qsSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDE 1502
Cdd:PRK07735 86 --TEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1503 TQRKRQAEAELALrvqaeaEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEA 1572
Cdd:PRK07735 164 KAKAKAAAAAKAK------AAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGN 227
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1153-1558 |
1.37e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 48.50 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1153 QEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAV 1232
Cdd:TIGR00606 694 QEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETL 773
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1233 planSQAVREQLRQEKALLED---IERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSE--- 1306
Cdd:TIGR00606 774 ----LGTIMPEEESAKVCLTDvtiMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIElnr 849
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1307 SIIQEYVD-LRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQR---------AEERERLAEVEAALEKQRQLAEAHA 1376
Cdd:TIGR00606 850 KLIQDQQEqIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTevqslireiKDAKEQDSPLETFLEKDQQEKEELI 929
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1377 QAKAQAEREAQGLQRRMQEEVarrEEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ 1456
Cdd:TIGR00606 930 SSKETSNKKAQDKVNDIKEKV---KNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQD 1006
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1457 LEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALrvqaeaeAAREKQRALQALEE 1536
Cdd:TIGR00606 1007 IDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDL-------IKRNHVLALGRQKG 1079
|
410 420
....*....|....*....|..
gi 1920237962 1537 LRLQAEEAERRLRQAEAERARQ 1558
Cdd:TIGR00606 1080 YEKEIKHFKKELREPQFRDAEE 1101
|
|
| KpsE |
COG3524 |
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis]; |
2169-2335 |
1.45e-04 |
|
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442746 [Multi-domain] Cd Length: 370 Bit Score: 47.54 E-value: 1.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2169 FAEQALrqkAQVEQELTALRLQLEETDHQKSILDEElqrlkAEVTEAARQRGQVEEELFSLRVQMEELgklkarieaenR 2248
Cdd:COG3524 181 FAEEEV---ERAEERLRDAREALLAFRNRNGILDPE-----ATAEALLQLIATLEGQLAELEAELAAL-----------R 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2249 AlVLRDKDSAQRLLQEEAEKM-KQVAEEAARLSVAAQEAARLRQLAE-EDLAQQRALAEKMLKEKMQAVQEAtrlKAEAE 2326
Cdd:COG3524 242 S-YLSPNSPQVRQLRRRIAALeKQIAAERARLTGASGGDSLASLLAEyERLELEREFAEKAYTSALAALEQA---RIEAA 317
|
....*....
gi 1920237962 2327 llQQQKELA 2335
Cdd:COG3524 318 --RQQRYLA 324
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3090-3126 |
1.45e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 41.70 E-value: 1.45e-04
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 3090 KLLSAEKAVTGYKDPYSGQSVSLFQALKKGLIPREQG 3126
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
2170-2559 |
1.45e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 48.23 E-value: 1.45e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2170 AEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVT--EAARQRGQVEEELFSLRVQMEELGKLKARIEAEN 2247
Cdd:COG4717 76 LEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEklEKLLQLLPLYQELEALEAELAELPERLEELEERL 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2248 RALVlRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKekmQAVQEATRLKAEAEL 2327
Cdd:COG4717 156 EELR-ELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELE---EAQEELEELEEELEQ 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2328 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLET----------------------ERQRQLEMSAEAERLRLRV 2385
Cdd:COG4717 232 LENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSliltiagvlflvlgllallfllLAREKASLGKEAEELQALP 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2386 AEMSRAQARAEEDARRFRKQAEDIGER---LYRTELATQEKVMLVQTLETQRQQSDRDAER---LREAIAELEHEKDKLK 2459
Cdd:COG4717 312 ALEELEEEELEELLAALGLPPDLSPEElleLLDRIEELQELLREAEELEEELQLEELEQEIaalLAEAGVEDEEELRAAL 391
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2460 QEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQ 2539
Cdd:COG4717 392 EQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGEL 471
|
410 420
....*....|....*....|
gi 1920237962 2540 EKQQLAASMEEARRRQHEAE 2559
Cdd:COG4717 472 AELLQELEELKAELRELAEE 491
|
|
| CH_NAV2 |
cd21285 |
calponin homology (CH) domain found in neuron navigator 2; Neuron navigator 2 (NAV2), also ... |
41-152 |
1.48e-04 |
|
calponin homology (CH) domain found in neuron navigator 2; Neuron navigator 2 (NAV2), also called helicase APC down-regulated 1 (HELAD1), pore membrane and/or filament-interacting-like protein 2 (POMFIL2), retinoic acid inducible in neuroblastoma 1 (RAINB1), Steerin-2 (STEERIN2), or Unc-53 homolog 2 (unc53H2), possesses 3' to 5' helicase activity and exonuclease activity. It is involved in neuronal development, specifically in the development of different sensory organs. NAV2 contains a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409134 Cd Length: 121 Bit Score: 44.57 E-value: 1.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 41 DRVQKKTFTKWVNKHLIKA--QRHISDLYEDLRDGHNLISLLEVLSGDSLpreRDVirsSRLPREKGRMrfhkLQNVQIA 118
Cdd:cd21285 8 NGFDKQIYTDWANHYLAKSghKRLIKDLQQDVTDGVLLAEIIQVVANEKI---EDI---NGCPKNRSQM----IENIDAC 77
|
90 100 110
....*....|....*....|....*....|....
gi 1920237962 119 LDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTI 152
Cdd:cd21285 78 LSFLAAKGINIQGLSAEEIRNGNLKAILGLFFSL 111
|
|
| GBP_C |
cd16269 |
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ... |
2263-2361 |
1.49e-04 |
|
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.
Pssm-ID: 293879 [Multi-domain] Cd Length: 291 Bit Score: 47.19 E-value: 1.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2263 QEEAEKMKQVAEEAARLSVAAQEAARL----RQLAEEDLAQQRALAE--KMLKEKMQavQEATRLKAEAELLQQQKElaQ 2336
Cdd:cd16269 191 QALTEKEKEIEAERAKAEAAEQERKLLeeqqRELEQKLEDQERSYEEhlRQLKEKME--EERENLLKEQERALESKL--K 266
|
90 100
....*....|....*....|....*
gi 1920237962 2337 EQARRLQEDKEQMAQQLAQETQGFQ 2361
Cdd:cd16269 267 EQEALLEEGFKEQAELLQEEIRSLK 291
|
|
| CH_PLS_rpt1 |
cd21292 |
first calponin homology (CH) domain found in the plastin family; The plastin family includes ... |
44-153 |
1.53e-04 |
|
first calponin homology (CH) domain found in the plastin family; The plastin family includes plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409141 Cd Length: 145 Bit Score: 44.96 E-value: 1.53e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 44 QKKTFTKWVN---------KHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPrERDVIRSSRLPrekgrmrFHKLQN 114
Cdd:cd21292 25 EKVAFVNWINknlgddpdcKHLLPMDPNTDDLFEKVKDGILLCKMINLSVPDTID-ERAINKKKLTV-------FTIHEN 96
|
90 100 110
....*....|....*....|....*....|....*....
gi 1920237962 115 VQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTII 153
Cdd:cd21292 97 LTLALNSASAIGCNVVNIGAEDLKEGKPHLVLGLLWQII 135
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1677-1872 |
1.54e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 48.09 E-value: 1.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQLAEgTAQQRLAA---EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1753
Cdd:COG3206 171 EEARKALEFLEEQLPELRKELE-EAEAALEEfrqKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1754 AKVRAEMEVLLASKA--------------RAEEESRSTSEKSK-QRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1818
Cdd:COG3206 250 GSGPDALPELLQSPViqqlraqlaeleaeLAELSARYTPNHPDvIALRAQIAALRAQLQQEAQRILASLEAELEALQARE 329
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1819 DAVRQRAEAERVLAEKLAAIS-EATRLKTEAEIALKEKEAENERLRRLAEDEAFQ 1872
Cdd:COG3206 330 ASLQAQLAQLEARLAELPELEaELRRLEREVEVARELYESLLQRLEEARLAEALT 384
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1416-1790 |
1.54e-04 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 47.97 E-value: 1.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1416 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEAT----ERQRGGAEGELQALRARAEEAEAQKRQAQEE 1491
Cdd:pfam07888 30 ELLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQrrelESRVAELKEELRQSREKHEELEEKYKELSAS 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1492 AERL---RRQVQDETQRKRQAEAELALRVQAEAEAAREKQralQALEELRLQAEEAERRLRQAEAERaRQVQVALETAQ- 1567
Cdd:pfam07888 110 SEELseeKDALLAQRAAHEARIRELEEDIKTLTQRVLERE---TELERMKERAKKAGAQRKEEEAER-KQLQAKLQQTEe 185
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1568 --RSAEAELQSEHASFAEKTAQLERtLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKAnEALRLRLqaE 1645
Cdd:pfam07888 186 elRSLSKEFQELRNSLAQRDTQVLQ-LQDTITTLTQKLTTAHRKEAENEALLEELRSLQERLNASERKV-EGLGEEL--S 261
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1646 EVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQR 1725
Cdd:pfam07888 262 SMAAQRDRTQAELHQARLQAAQLTLQLADASLALREGRARWAQERETLQQSAEADKDRIEKLSAELQRLEERLQEERMER 341
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1726 QLLEEELArlqREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSE---KSKQRLEAEAG 1790
Cdd:pfam07888 342 EKLEVELG---REKDCNRVQLSESRRELQELKASLRVAQKEKEQLQAEKQELLEyirQLEQRLETVAD 406
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1306-1517 |
1.58e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 48.09 E-value: 1.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1306 ESIIQEYVDLRTRYSelSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERL------AEVEAALEKQRQLAEAHAQAK 1379
Cdd:COG3206 155 NALAEAYLEQNLELR--REEARKALEFLEEQLPELRKELEEAEAALEEFRQKNglvdlsEEAKLLLQQLSELESQLAEAR 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1380 AQAeREAQGLQRRMQEEVARREEVAVEA-------------QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRI 1446
Cdd:COG3206 233 AEL-AEAEARLAALRAQLGSGPDALPELlqspviqqlraqlAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEA 311
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1447 EEEIRVVRLQLEATERQRGGAEGELQALRARAE---EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRV 1517
Cdd:COG3206 312 QRILASLEAELEALQAREASLQAQLAQLEARLAelpELEAELRRLEREVEVARELYESLLQRLEEARLAEALTV 385
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3962-3996 |
1.65e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 41.70 E-value: 1.65e-04
10 20 30
....*....|....*....|....*....|....*
gi 1920237962 3962 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEF 3996
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
2170-2604 |
1.67e-04 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 48.02 E-value: 1.67e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2170 AEQALRQKAQVEQELTALRLQL-------EETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLK-A 2241
Cdd:COG3096 524 LEQRLRQQQNAERLLEEFCQRIgqqldaaEELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARApA 603
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2242 RIEAENRALVLRD------KDSA------QRLLQEE----------AEKMKQVAEEAARLSVAA-QEAARLRQLAE---- 2294
Cdd:COG3096 604 WLAAQDALERLREqsgealADSQevtaamQQLLEREreatverdelAARKQALESQIERLSQPGgAEDPRLLALAErlgg 683
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2295 -------EDLAQQRA-LAEKMLKEKMQA--VQEATRLKAEAE----------LLQQQKELAQEQARRLQEDKEQMAQQLA 2354
Cdd:COG3096 684 vllseiyDDVTLEDApYFSALYGPARHAivVPDLSAVKEQLAgledcpedlyLIEGDPDSFDDSVFDAEELEDAVVVKLS 763
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2355 QETQGFQKTLE------TERQRQLE-MSAEAERLRLRVAEMS---RAQARAEEDARRFrkqaedIGERLYRTELATQEKV 2424
Cdd:COG3096 764 DRQWRYSRFPEvplfgrAAREKRLEeLRAERDELAEQYAKASfdvQKLQRLHQAFSQF------VGGHLAVAFAPDPEAE 837
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2425 MlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQsflsEKDSLLQRERC 2504
Cdd:COG3096 838 L--AALRQRRSELERELAQHRAQEQQLRQQLDQLKEQLQLLNKLLPQANLLADETLADRLEELRE----ELDAAQEAQAF 911
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2505 IEQEKAKLEQLfqdeVAKAQALReeqqrqqqQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA 2584
Cdd:COG3096 912 IQQHGKALAQL----EPLVAVLQ--------SDPEQFEQLQADYLQAKEQQRRLKQQIFALSEVVQRRPHFSYEDAVGLL 979
|
490 500
....*....|....*....|....
gi 1920237962 2585 EENQ----RLRERLQHLEEERRAA 2604
Cdd:COG3096 980 GENSdlneKLRARLEQAEEARREA 1003
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1708-1858 |
1.72e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 46.46 E-value: 1.72e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1708 EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRS-TSEKSKQRLE 1786
Cdd:COG1579 16 DSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNvRNNKEYEALQ 95
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1787 AE----AGRFRELAEEAARLRALAEEAKRQRQLAEEdavrQRAEAERVLAEKLAAISEATRlKTEAEIALKEKEAE 1858
Cdd:COG1579 96 KEieslKRRISDLEDEILELMERIEELEEELAELEA----ELAELEAELEEKKAELDEELA-ELEAELEELEAERE 166
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1790-2036 |
1.91e-04 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 47.58 E-value: 1.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1790 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDE 1869
Cdd:pfam07888 34 NRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEEL 113
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1870 AFQRRLLEEQAAQHKADIE------ARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELE 1943
Cdd:pfam07888 114 SEEKDALLAQRAAHEARIReleediKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKE 193
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1944 LGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAeervqKSLAAEEEAARQRKAALEE-VERLKAKVEEARRLRE 2022
Cdd:pfam07888 194 FQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAEN-----EALLEELRSLQERLNASERkVEGLGEELSSMAAQRD 268
|
250
....*....|....*
gi 1920237962 2023 RAEQESAR-QLQLAQ 2036
Cdd:pfam07888 269 RTQAELHQaRLQAAQ 283
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
2174-2614 |
2.07e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 47.59 E-value: 2.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2174 LRQKAQVEQ-ELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIeaenralvl 2252
Cdd:PRK01156 188 LEEKLKSSNlELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEI--------- 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2253 RDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQK 2332
Cdd:PRK01156 259 KTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDY 338
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2333 ELAQEQARRLQEDKEQMAQQLAQET--QGFQKTLETERQRQLEMSAEAERLRlrvAEMSRAQARAEEDARRFRKQAEDIG 2410
Cdd:PRK01156 339 NDYIKKKSRYDDLNNQILELEGYEMdyNSYLKSIESLKKKIEEYSKNIERMS---AFISEILKIQEIDPDAIKKELNEIN 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2411 ERL--YRTELAT--QEKVMLVQTLETQRQQSD----------------------------RDAERLREAIAELEHEKDKL 2458
Cdd:PRK01156 416 VKLqdISSKVSSlnQRIRALRENLDELSRNMEmlngqsvcpvcgttlgeeksnhiinhynEKKSRLEEKIREIEIEVKDI 495
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2459 KQEAQLLQ-----LKSEEM-QTVRQEQLLQETQALQQSFLSE----KDSLLQRERCIEQEKA-KLEQLFQDEVAKAQALR 2527
Cdd:PRK01156 496 DEKIVDLKkrkeyLESEEInKSINEYNKIESARADLEDIKIKinelKDKHDKYEEIKNRYKSlKLEDLDSKRTSWLNALA 575
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2528 EEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEG-----------VRRQQEELQRLaqqqqQQEKLLAEENQRLRERLQH 2596
Cdd:PRK01156 576 VISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGfpddksyidksIREIENEANNL-----NNKYNEIQENKILIEKLRG 650
|
490
....*....|....*...
gi 1920237962 2597 LEEERRAALARSEEIAPS 2614
Cdd:PRK01156 651 KIDNYKKQIAEIDSIIPD 668
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1306-1650 |
2.24e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 2.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1306 ESIIQEYVDLRTRYSELST---LTSQYIRFISETLRRMEEEERLAE-QQRAEE-RERLAEVEAALEKQRQLAEAHAQAKA 1380
Cdd:COG4717 98 EELEEELEELEAELEELREeleKLEKLLQLLPLYQELEALEAELAElPERLEElEERLEELRELEEELEELEAELAELQE 177
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1381 QAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQ---- 1456
Cdd:COG4717 178 ELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQE----ELEELEEELEQLENELEAAALEERLKEARllll 253
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1457 --------------LEATERQRGGA----EGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ 1518
Cdd:COG4717 254 iaaallallglggsLLSLILTIAGVlflvLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1519 AEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV----------------------QVALETAQRSAEAELQS 1576
Cdd:COG4717 334 LSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAllaeagvedeeelraaleqaeeYQELKEELEELEEQLEE 413
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1577 -----EHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKAnEALRLRLQAEEVAQQ 1650
Cdd:COG4717 414 llgelEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQ-ELEELKAELRELAEE 491
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4271-4304 |
2.26e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 41.31 E-value: 2.26e-04
10 20 30
....*....|....*....|....*....|....
gi 1920237962 4271 EETGPVAGILDTETLEKVSITEAMHRNLVDNITG 4304
Cdd:smart00250 5 EAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
2168-2614 |
2.30e-04 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 47.21 E-value: 2.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2168 QFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAEN 2247
Cdd:COG5278 76 SFLEPYEEARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIR 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2248 RALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAEL 2327
Cdd:COG5278 156 ARLLLLALALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELL 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2328 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAE 2407
Cdd:COG5278 236 AALALALALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAA 315
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2408 DIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQAL 2487
Cdd:COG5278 316 AAAAAAAAALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAI 395
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2488 QQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQE 2567
Cdd:COG5278 396 AAAAAAAAAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALA 475
|
410 420 430 440
....*....|....*....|....*....|....*....|....*..
gi 1920237962 2568 ELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPS 2614
Cdd:COG5278 476 ALAAAAAALAEAEAAAALAAAAALSLALALAALLLAAAEAALAAALA 522
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
2373-2607 |
2.50e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.60 E-value: 2.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2373 EMSAEAERLRLRVAEMSRAQARAEeDARRFRKQAEDIgERLYRTELATQEKVMLVQTLETQRQ--QSDRDAERLREAIAE 2450
Cdd:COG4913 222 DTFEAADALVEHFDDLERAHEALE-DAREQIELLEPI-RELAERYAAARERLAELEYLRAALRlwFAQRRLELLEAELEE 299
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2451 LEHEKDKLKQEAQLLQLKSEEMQtvrqEQLLQETQALQQSFLSEKDSLlqrERCIEQEKAKLEQLFQdevaKAQALREEQ 2530
Cdd:COG4913 300 LRAELARLEAELERLEARLDALR----EELDELEAQIRGNGGDRLEQL---EREIERLERELEERER----RRARLEALL 368
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 2531 QRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQqqqekllaeenQRLRERLQHLEEERRAALAR 2607
Cdd:COG4913 369 AALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL-----------RDLRRELRELEAEIASLERR 434
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1469-1691 |
2.58e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 46.72 E-value: 2.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1469 GELQALRARAEEAEAQ-KRQAQEEAERLRRQVQDETQRKRQAEAElalRVQAEAEAAREKQRALQALEELRlQAEEAERR 1547
Cdd:PRK09510 65 NRQQQQQKSAKRAEEQrKKKEQQQAEELQQKQAAEQERLKQLEKE---RLAAQEQKKQAEEAAKQAALKQK-QAEEAAAK 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1548 LRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaeaeraraeaerel 1627
Cdd:PRK09510 141 AAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEA--------------------- 199
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1628 erwQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELE 1691
Cdd:PRK09510 200 ---KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
2375-2623 |
2.73e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 46.68 E-value: 2.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2375 SAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHE 2454
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2455 KDKLKQEAQLLQLKSEEMQTVRQEQLLqetqaLQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQ 2534
Cdd:COG4942 99 LEAQKEELAELLRALYRLGRQPPLALL-----LSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAER 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2535 QqmqqEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPS 2614
Cdd:COG4942 174 A----ELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFA 249
|
....*....
gi 1920237962 2615 RAAAARALP 2623
Cdd:COG4942 250 ALKGKLPWP 258
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1368-1547 |
3.00e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 47.08 E-value: 3.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1368 QRQLAEAHAQAK---AQAEREAQglqrrmqeevARREEVAVEAqeqkrsiQEELQHLRQSSEAEIQAKARQVEAAERsRL 1444
Cdd:PRK12704 30 EAKIKEAEEEAKrilEEAKKEAE----------AIKKEALLEA-------KEEIHKLRNEFEKELRERRNELQKLEK-RL 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1445 RIEEEIrvVRLQLEATERQRGGAEGELQALRARAEEAEAQkrqaQEEAERLRRQVQDETQR-----KRQAEAELALRVqa 1519
Cdd:PRK12704 92 LQKEEN--LDRKLELLEKREEELEKKEKELEQKQQELEKK----EEELEELIEEQLQELERisgltAEEAKEILLEKV-- 163
|
170 180
....*....|....*....|....*....
gi 1920237962 1520 EAEAAREKQRALQALEElrlQA-EEAERR 1547
Cdd:PRK12704 164 EEEARHEAAVLIKEIEE---EAkEEADKK 189
|
|
| MARTX_Nterm |
NF012221 |
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ... |
1674-1886 |
3.02e-04 |
|
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.
Pssm-ID: 467957 [Multi-domain] Cd Length: 1848 Bit Score: 47.52 E-value: 3.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQRELAEQ-----ELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQRQLLEEElarlqreaaaatqkRRE 1748
Cdd:NF012221 1555 DAAQNALADKERAEAdrqrlEQEKQQQLAAISGSQ---SQLESTDQNALETNGQAQRDAILEE--------------SRA 1617
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1749 LEAELAKVRAEMEVLLAS-------------------KARAEEESRSTSEKSKQRLEAEAGRF----RELAEEAARLRAL 1805
Cdd:NF012221 1618 VTKELTTLAQGLDALDSQatyagesgdqwrnpfagglLDRVQEQLDDAKKISGKQLADAKQRHvdnqQKVKDAVAKSEAG 1697
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1806 AEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRlLEEQAAQHKA 1885
Cdd:NF012221 1698 VAQGEQNQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQ-ADAKGAKQDE 1776
|
.
gi 1920237962 1886 D 1886
Cdd:NF012221 1777 S 1777
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1674-1867 |
3.09e-04 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 46.45 E-value: 3.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1753
Cdd:pfam13868 149 EEREEDERILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAE 228
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1754 AKVRAEMEVLLASKARAEEESRsTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1833
Cdd:pfam13868 229 KKARQRQELQQAREEQIELKER-RLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQRA 307
|
170 180 190
....*....|....*....|....*....|....
gi 1920237962 1834 KLAAISEATRLKTEAEIALKEKEAENERLRRLAE 1867
Cdd:pfam13868 308 AEREEELEEGERLREEEAERRERIEEERQKKLKE 341
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
945-1606 |
3.11e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 47.27 E-value: 3.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 945 EDRLQAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQRITEQQK 1024
Cdd:pfam02463 293 KEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLE 372
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1025 AQAEVDGLGKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAH 1104
Cdd:pfam02463 373 EELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEE 452
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1105 EEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQE-------VGERLQQRHGERDVEVERWRERV 1177
Cdd:pfam02463 453 LEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKEskarsglKVLLALIKDGVGGRIISAHGRLG 532
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1178 TLLLERWQAVLAQTDVRQRELEQLG-----RQLRYYRESADPLGAWLRDAKQRQEQIQAVPLA---NSQAVREQLRQEKA 1249
Cdd:pfam02463 533 DLGVAVENYKVAISTAVIVEVSATAdeveeRQKLVRALTELPLGARKLRLLIPKLKLPLKSIAvleIDPILNLAQLDKAT 612
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1250 LLEDIERHGEKV----EECQRFAKQYINAiKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTL 1325
Cdd:pfam02463 613 LEADEDDKRAKVvegiLKDTELTKLKESA-KAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAK 691
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1326 TSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL-QRRMQEEVARREEVA 1404
Cdd:pfam02463 692 EEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSrLKKEEKEEEKSELSL 771
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1405 VEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrLQLEATERQRGGAEGELQALRARAEEAEAQ 1484
Cdd:pfam02463 772 KEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLE---EEQLLIEQEEKIKEEELEELALELKEEQKL 848
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1485 KRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALE 1564
Cdd:pfam02463 849 EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAE 928
|
650 660 670 680
....*....|....*....|....*....|....*....|..
gi 1920237962 1565 TAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1606
Cdd:pfam02463 929 ILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKE 970
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
1765-1962 |
3.28e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 46.34 E-value: 3.28e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1765 ASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQrqlaEEDAVRQRAEAERVLAEKLAAISEATRL 1844
Cdd:PRK09510 72 KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ----AEEAAKQAALKQKQAEEAAAKAAAAAKA 147
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1845 KTEAEIALKEKEAENerlrrlAEDEAfQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEIL 1924
Cdd:PRK09510 148 KAEAEAKRAAAAAKK------AAAEA-KKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220
|
170 180 190
....*....|....*....|....*....|....*...
gi 1920237962 1925 ALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAE 1962
Cdd:PRK09510 221 AEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
2153-2327 |
3.37e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 46.34 E-value: 3.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEE---LFSL 2229
Cdd:PRK09510 80 QRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEakrAAAA 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2230 RVQMEELGKLKARIEAENRALVLRDK----DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAE 2305
Cdd:PRK09510 160 AKKAAAEAKKKAEAEAAKKAAAEAKKkaeaEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAE 239
|
170 180
....*....|....*....|..
gi 1920237962 2306 KmlKEKMQAVQEATRLKAEAEL 2327
Cdd:PRK09510 240 K--AAAAKAAEKAAAAKAAAEV 259
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1681-2397 |
3.55e-04 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 47.10 E-value: 3.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1681 RQRELAEQEleKQRQLAEGTAQQRLAAEQ-ELIRLRAeTEQGEQQRQLLEeelaRLQREAAAATQKRR---ELEAELAKV 1756
Cdd:PRK10246 251 RLDELQQEA--SRRQQALQQALAAEEKAQpQLAALSL-AQPARQLRPHWE----RIQEQSAALAHTRQqieEVNTRLQST 323
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1757 RAEMEVLLASKARAEEESRSTSEKSKQRLeAEAGRFRELAEEAARLRALAEEAKRQRQlaEEDAVRQRAEAERvlaEKLA 1836
Cdd:PRK10246 324 MALRARIRHHAAKQSAELQAQQQSLNTWL-AEHDRFRQWNNELAGWRAQFSQQTSDRE--QLRQWQQQLTHAE---QKLN 397
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1837 AISEATRLKTEAEIAlkekeaenERLRRLAEDEAFQRRLLEEQaAQHkADIEARLAQLrKASESELERQKGLVEDTLRQR 1916
Cdd:PRK10246 398 ALPAITLTLTADEVA--------AALAQHAEQRPLRQRLVALH-GQI-VPQQKRLAQL-QVAIQNVTQEQTQRNAALNEM 466
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1917 RQVEEEilalkgsfekaaagKAElelELGRIRGTAEDTLRSKEQAEQEAarQRQLAAEEERRRREAEERVQKSLAAEEEA 1996
Cdd:PRK10246 467 RQRYKE--------------KTQ---QLADVKTICEQEARIKDLEAQRA--QLQAGQPCPLCGSTSHPAVEAYQALEPGV 527
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1997 ARQRKAALE-EVERLKakvEEARRLRERAEQeSARQLQLAQEAAQkRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERL 2075
Cdd:PRK10246 528 NQSRLDALEkEVKKLG---EEGAALRGQLDA-LTKQLQRDESEAQ-SLRQEEQALTQQWQAVCASLNITLQPQDDIQPWL 602
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2076 rseaeaarraaeeaeaareraereaaqsrrqvEEAERLKQsaeeqaqaqaqaqaaaeklrkeaeqeaarraqaEQAALRQ 2155
Cdd:PRK10246 603 --------------------------------DAQEEHER---------------------------------QLRLLSQ 617
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2156 KQAADAEMEKH----KQFAEQALRQKAQVEQELTALRLQLEETDHQKSIL---DEELQRLKAEVTEAARQRGQVE--EEL 2226
Cdd:PRK10246 618 RHELQGQIAAHnqqiIQYQQQIEQRQQQLLTALAGYALTLPQEDEEASWLatrQQEAQSWQQRQNELTALQNRIQqlTPL 697
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2227 FSLRVQMEELGKLKARIEAEN------RALVLRDKDSA--QRLLQEEAEKMKQVAEEAARL--SVAAQEAARLRQLAEED 2296
Cdd:PRK10246 698 LETLPQSDDLPHSEETVALDNwrqvheQCLSLHSQLQTlqQQDVLEAQRLQKAQAQFDTALqaSVFDDQQAFLAALLDEE 777
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2297 LAQQralAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQK--TLETERQRQLEM 2374
Cdd:PRK10246 778 TLTQ---LEQLKQNLENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVTVEQIQQELAQLAQQLREntTRQGEIRQQLKQ 854
|
730 740
....*....|....*....|....
gi 1920237962 2375 SAEA-ERLRLRVAEMSRAQARAEE 2397
Cdd:PRK10246 855 DADNrQQQQALMQQIAQATQQVED 878
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1677-1924 |
3.75e-04 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 46.21 E-value: 3.75e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRaeteqgeQQRQLLEEELARLQR-------EAAAATQKRREL 1749
Cdd:pfam19220 128 AAETEQNRALEEENKALREEAQAAEKALQRAEGELATAR-------ERLALLEQENRRLQAlseeqaaELAELTRRLAEL 200
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1750 EAELAKVRAEMEVLLASKAraeeESRSTSEKSKQRLEAEAGRFR-ELAEEAARLRALAEEAKRQRQLAEE--DAVRQRAE 1826
Cdd:pfam19220 201 ETQLDATRARLRALEGQLA----AEQAERERAEAQLEEAVEAHRaERASLRMKLEALTARAAATEQLLAEarNQLRDRDE 276
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1827 AERVLAEKLaaiSEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQK 1906
Cdd:pfam19220 277 AIRAAERRL---KEASIERDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEMLTKALAAKDAALERAEERIASLSD 353
|
250
....*....|....*...
gi 1920237962 1907 GLveDTLRQRRQVEEEIL 1924
Cdd:pfam19220 354 RI--AELTKRFEVERAAL 369
|
|
| EmrA |
COG1566 |
Multidrug resistance efflux pump EmrA [Defense mechanisms]; |
1406-1541 |
3.77e-04 |
|
Multidrug resistance efflux pump EmrA [Defense mechanisms];
Pssm-ID: 441174 [Multi-domain] Cd Length: 331 Bit Score: 46.19 E-value: 3.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1406 EAQEQKRSIQEELQHLRQSS--EAEIQAKARQVEAAERSRLRIEEEI-RVVRLQleateRQRGGAEGELQALRARAEEAE 1482
Cdd:COG1566 87 QAEAQLAAAEAQLARLEAELgaEAEIAAAEAQLAAAQAQLDLAQRELeRYQALY-----KKGAVSQQELDEARAALDAAQ 161
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1483 AQKRQAQEEAERLRRQVQDETQrKRQAEAELAlrvQAEAEAAREKQRalqaLEELRLQA 1541
Cdd:COG1566 162 AQLEAAQAQLAQAQAGLREEEE-LAAAQAQVA---QAEAALAQAELN----LARTTIRA 212
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3013-3050 |
3.80e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.54 E-value: 3.80e-04
10 20 30
....*....|....*....|....*....|....*...
gi 1920237962 3013 RRALRGSGVIAGVWLEEAGQKLSIYEALRKDLLQPEAA 3050
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
1674-1845 |
3.86e-04 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 46.93 E-value: 3.86e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQR-ELAEQElekqrqlaegtAQQRLaaEQELIRLRAETEqgEQQRQLLeeelarlqreaaaatqkrrELEAE 1752
Cdd:PTZ00491 662 KSQEAAARHQaELLEQE-----------ARGRL--ERQKMHDKAKAE--EQRTKLL-------------------ELQAE 707
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1753 LAKVRAemevllASKARAEEESRSTSEKSKQRLEAEAGRFRelaEEAARLRALAE-EAKRQRQLAEEDAVRQRAEAERVL 1831
Cdd:PTZ00491 708 SAAVES------SGQSRAEALAEAEARLIEAEAEVEQAELR---AKALRIEAEAElEKLRKRQELELEYEQAQNELEIAK 778
|
170
....*....|....
gi 1920237962 1832 AEKLAAIsEATRLK 1845
Cdd:PTZ00491 779 AKELADI-EATKFE 791
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
2160-2356 |
3.87e-04 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 46.83 E-value: 3.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2160 DAEMEKHKQFAEQALR---QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQR------GQVEEELFSLR 2230
Cdd:PRK11281 55 EAEDKLVQQDLEQTLAlldKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETlstlslRQLESRLAQTL 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2231 VQMEELGklKARIEAENRALVLRDK--------DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAeedLAQQRA 2302
Cdd:PRK11281 135 DQLQNAQ--NDLAEYNSQLVSLQTQperaqaalYANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQAL---LNAQND 209
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2303 LAEKMLK--EKMQAVQEATR--LKAEAELLQQQKELAQE--QARRLQEDKEQMAQQLAQE 2356
Cdd:PRK11281 210 LQRKSLEgnTQLQDLLQKQRdyLTARIQRLEHQLQLLQEaiNSKRLTLSEKTVQEAQSQD 269
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
1673-1854 |
4.06e-04 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 46.77 E-value: 4.06e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1673 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQR-LAAEQELIRLRAEteqgeqQRQLLEEELARLQREAAAATQKRRELEA 1751
Cdd:COG2433 375 GLSIEEALEELIEKELPEEEPEAEREKEHEEReLTEEEEEIRRLEE------QVERLEAEVEELEAELEEKDERIERLER 448
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1752 ELAKVRAEMEvllaSKARAEEESRstsekskqRLEAEAGRF-RELAEEAARLRALAEEAKRQRQLAEEDavrqrAEAERV 1830
Cdd:COG2433 449 ELSEARSEER----REIRKDREIS--------RLDREIERLeRELEEERERIEELKRKLERLKELWKLE-----HSGELV 511
|
170 180
....*....|....*....|....
gi 1920237962 1831 LAEKLAAISEATRLKTEAEIALKE 1854
Cdd:COG2433 512 PVKVVEKFTKEAIRRLEEEYGLKE 535
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3421-3457 |
4.32e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.54 E-value: 4.32e-04
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 3421 KLLSAEKAVTGYRDPYSGSTISLFQAMKKGLVLREHG 3457
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| MAP7 |
pfam05672 |
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ... |
1366-1497 |
4.34e-04 |
|
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.
Pssm-ID: 461709 [Multi-domain] Cd Length: 153 Bit Score: 43.88 E-value: 4.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1366 EKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHL--RQSSEAEIQAKARQVEAAERSR 1443
Cdd:pfam05672 11 EAARILAEKRRQAREQREREEQERLEKEEEERLRKEELRRRAEEERARREEEARRLeeERRREEEERQRKAEEEAEEREQ 90
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1444 LRIEEEIRVVRLQLEATERQRGGAEGELQalraraeeaEAQKRQAQEEAERLRR 1497
Cdd:pfam05672 91 REQEEQERLQKQKEEAEAKAREEAERQRQ---------EREKIMQQEEQERLER 135
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1314-1594 |
4.34e-04 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 46.43 E-value: 4.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1314 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQrAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1393
Cdd:pfam07888 77 ELESRVAELKEELRQSREKHEELEEKYKELSASSEEL-SEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERM 155
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1394 QEEVARREEVAVEAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGE 1470
Cdd:pfam07888 156 KERAKKAGAQRKEEEAERKQLQAKLQQTEEelrSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEAL 235
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1471 LQALRARAEEAEAQKRQAQ------EEAERLRRQVQDETQRKRQAEAELALRVQAEAEAARE-KQRALQALEELRLQAE- 1542
Cdd:pfam07888 236 LEELRSLQERLNASERKVEglgeelSSMAAQRDRTQAELHQARLQAAQLTLQLADASLALREgRARWAQERETLQQSAEa 315
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 1543 EAERRLRQAEAERARQVQVALETAQR-SAEAELQSEHASFAEKTAQLERTLKE 1594
Cdd:pfam07888 316 DKDRIEKLSAELQRLEERLQEERMEReKLEVELGREKDCNRVQLSESRRELQE 368
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1428-1595 |
4.44e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 45.30 E-value: 4.44e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1428 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQvQDETQRKR 1507
Cdd:COG1579 11 DLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQ-LGNVRNNK 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1508 QAEAelalrVQAEAEAAREKQRAL-QALEELRLQAEEAERRLRQAEAERARQVQ--VALETAQRSAEAELQSEHASFAEK 1584
Cdd:COG1579 90 EYEA-----LQKEIESLKRRISDLeDEILELMERIEELEEELAELEAELAELEAelEEKKAELDEELAELEAELEELEAE 164
|
170
....*....|.
gi 1920237962 1585 TAQLERTLKEE 1595
Cdd:COG1579 165 REELAAKIPPE 175
|
|
| tolA_full |
TIGR02794 |
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ... |
2253-2423 |
4.50e-04 |
|
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]
Pssm-ID: 274303 [Multi-domain] Cd Length: 346 Bit Score: 45.99 E-value: 4.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2253 RDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATR-LKAEAELLQQQ 2331
Cdd:TIGR02794 65 KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAkAKAEAEAERKA 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2332 KELAQEQARrlqEDKEQMAQQLAQetqgfQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGE 2411
Cdd:TIGR02794 145 KEEAAKQAE---EEAKAKAAAEAK-----KKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAKAKAAAEAAAKAEAE 216
|
170
....*....|..
gi 1920237962 2412 RLYRTELATQEK 2423
Cdd:TIGR02794 217 AAAAAAAEAERK 228
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1674-1903 |
4.61e-04 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 46.48 E-value: 4.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEE--------QAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEEL--------ARLQR 1737
Cdd:PRK05035 447 KAEEakarfearQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARpdnsaviaAREAR 526
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1738 EAAAATQKRRELEAE-----LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrelaeeAARLRALAeeAKRQ 1812
Cdd:PRK05035 527 KAQARARQAEKQAAAaadpkKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVA---------AAIARAKA--KKAA 595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1813 RQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAedeafqrrlleeqAAQHKAdiEARLA 1892
Cdd:PRK05035 596 QQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVA-------------AAIARA--KARKA 660
|
250
....*....|.
gi 1920237962 1893 QLRKASESELE 1903
Cdd:PRK05035 661 AQQQANAEPEE 671
|
|
| COG3903 |
COG3903 |
Predicted ATPase [General function prediction only]; |
1598-2053 |
4.89e-04 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443109 [Multi-domain] Cd Length: 933 Bit Score: 46.55 E-value: 4.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1598 AVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEE 1677
Cdd:COG3903 477 AAERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALRLAAALAPFWF 556
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1678 QAVRQRElAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVR 1757
Cdd:COG3903 557 LRGLLRE-GRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAAAAAA 635
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1758 AEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA 1837
Cdd:COG3903 636 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAALAAAA 715
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1838 ISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRR 1917
Cdd:COG3903 716 AAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAAAAAA 795
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1918 QVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAA 1997
Cdd:COG3903 796 AAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALAAAAA 875
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1998 RQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFA 2053
Cdd:COG3903 876 AAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
2440-2570 |
5.00e-04 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 46.36 E-value: 5.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2440 DAERLREAIAELEHEKDKLKQEAQLLqlkseemqtvrqEQLLQETQALQQSFLSEKDSLLQRErciEQEKAKLEQLFQDE 2519
Cdd:PRK00409 514 DKEKLNELIASLEELERELEQKAEEA------------EALLKEAEKLKEELEEKKEKLQEEE---DKLLEEAEKEAQQA 578
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2520 V--AKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGV-------RRQQEELQ 2570
Cdd:PRK00409 579 IkeAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKekkkkkqKEKQEELK 638
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1415-1773 |
5.14e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 45.66 E-value: 5.14e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1415 QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1494
Cdd:COG4372 12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1495 LRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAEL 1574
Cdd:COG4372 92 AQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQ 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1575 QSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLT 1654
Cdd:COG4372 172 ELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEEL 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1655 QAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR 1734
Cdd:COG4372 252 LEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELA 331
|
330 340 350
....*....|....*....|....*....|....*....
gi 1920237962 1735 LQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1773
Cdd:COG4372 332 LAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
|
|
| rne |
PRK10811 |
ribonuclease E; Reviewed |
1341-1606 |
5.26e-04 |
|
ribonuclease E; Reviewed
Pssm-ID: 236766 [Multi-domain] Cd Length: 1068 Bit Score: 46.57 E-value: 5.26e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1341 EEEERLAEQQRAEERER----------LAEVEAALEKQR---QLAEAHAQAKAQAEREAQGLQRR----MQEEVARREEV 1403
Cdd:PRK10811 507 EEAMALPSEEEFAERKRpeqpalatfaMPDVPPAPTPAEpaaPVVAAAPKAAAATPPAQPGLLSRffgaLKALFSGGEET 586
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1404 AVEAQEQKRSIQEElqhlRQSSEaeiQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEA 1483
Cdd:PRK10811 587 KPQEQPAPKAEAKP----ERQQD---RRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQA 659
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1484 QKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvQAEAEAarekqralQALEELRLQAEEAERRLRQAEAERAR------ 1557
Cdd:PRK10811 660 EVTEKARTQDEQQQAPRRERQRRRNDEKRQA---QQEAKA--------LNVEEQSVQETEQEERVQQVQPRRKQrqlnqk 728
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1558 ---QVQVALETAQRSAEAELQSEHASfAEKTAQLERTLKEEHVAVVQLREEA 1606
Cdd:PRK10811 729 vriEQSVAEEAVAPVVEETVAAEPVV-QEVPAPRTELVKVPLPVVAQTAPEQ 779
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1424-1558 |
5.29e-04 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 46.44 E-value: 5.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1424 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlRRQVQDET 1503
Cdd:PRK12678 69 TPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARR-GAARKAGE 147
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 1504 QRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQ 1558
Cdd:PRK12678 148 GGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDD 202
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3382-3417 |
5.63e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.16 E-value: 5.63e-04
10 20 30
....*....|....*....|....*....|....*.
gi 1920237962 3382 TLLLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPE 3417
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| MARTX_Nterm |
NF012221 |
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ... |
2280-2558 |
5.77e-04 |
|
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.
Pssm-ID: 467957 [Multi-domain] Cd Length: 1848 Bit Score: 46.37 E-value: 5.77e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2280 SVAAQEAARLRQLAEEDLAQQRALAEKmlkekmqavqeatrlkaeaellqqqkELAQEQARRLQEDKeqmAQQLAqETQG 2359
Cdd:NF012221 1538 SESSQQADAVSKHAKQDDAAQNALADK--------------------------ERAEADRQRLEQEK---QQQLA-AISG 1587
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2360 FQKTLETERQRQLEMSAEAERlrlrvaemsraqARAEEDARRFRKQAEDIGERLyrtelatqekvmlvQTLETQRQQSDR 2439
Cdd:NF012221 1588 SQSQLESTDQNALETNGQAQR------------DAILEESRAVTKELTTLAQGL--------------DALDSQATYAGE 1641
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2440 DAERLREAIAE--LEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSllqrerciEQEKAKLEQLFQ 2517
Cdd:NF012221 1642 SGDQWRNPFAGglLDRVQEQLDDAKKISGKQLADAKQRHVDNQQKVKDAVAKSEAGVAQG--------EQNQANAEQDID 1713
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 1920237962 2518 DEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRR-QHEA 2558
Cdd:NF012221 1714 DAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRgEQDA 1755
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1664-1827 |
5.81e-04 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 46.02 E-value: 5.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1664 EAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtaQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREaaaat 1743
Cdd:COG2268 229 EQEREIETARIAEAEAELAKKKAEERREAETARAE---AEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAE----- 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1744 QKRRELEAELaKVRAEmevllASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEedAVRQ 1823
Cdd:COG2268 301 REEAELEADV-RKPAE-----AEKQAAEAEAEAEAEAIRAKGLAEAEGKRALAEAWNKLGDAAILLMLIEKLPE--IAEA 372
|
....
gi 1920237962 1824 RAEA 1827
Cdd:COG2268 373 AAKP 376
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3053-3086 |
6.21e-04 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 40.16 E-value: 6.21e-04
10 20 30
....*....|....*....|....*....|....
gi 1920237962 3053 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPE 3086
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
1702-1970 |
6.40e-04 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 45.84 E-value: 6.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1702 QQRLAAEQELIRlraeteQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKS 1781
Cdd:PRK11637 53 QQDIAAKEKSVR------QQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLL 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1782 KQRLEAEagrFRELAEEAARLRALAEEAKRqrqlaeedavrqraeAERVLAeKLAAISEAtRLKTEAEialkekeaener 1861
Cdd:PRK11637 127 AAQLDAA---FRQGEHTGLQLILSGEESQR---------------GERILA-YFGYLNQA-RQETIAE------------ 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1862 LRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVedtlrqrrqveeeilALKGSFEKAAAGKAELE 1941
Cdd:PRK11637 175 LKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLT---------------GLESSLQKDQQQLSELR 239
|
250 260 270
....*....|....*....|....*....|....*..
gi 1920237962 1942 LELGRIRGT-AEDTLRSKEQAEQEA-------ARQRQ 1970
Cdd:PRK11637 240 ANESRLRDSiARAEREAKARAEREAreaarvrDKQKQ 276
|
|
| CH_PARVA_B_rpt2 |
cd21306 |
second calponin homology (CH) domain found in the alpha/beta parvin subfamily; The alpha/beta ... |
43-156 |
6.64e-04 |
|
second calponin homology (CH) domain found in the alpha/beta parvin subfamily; The alpha/beta parvin subfamily includes alpha-parvin and beta-parvin. Alpha-parvin, also called actopaxin, calponin-like integrin-linked kinase-binding protein (CH-ILKBP), or matrix-remodeling-associated protein 2, plays a role in sarcomere organization and in smooth muscle cell contraction. It is required for normal development of the embryonic cardiovascular system, and for normal septation of the heart outflow tract. Beta-parvin, also called affixin, is an adapter protein that plays a role in integrin signaling via ILK and in activation of the GTPases Cdc42 and Rac1 by guanine exchange factors, such as ARHGEF6. Both alpha-parvin and beta-parvin are involved in the reorganization of the actin cytoskeleton and the formation of lamellipodia, and both play roles in cell adhesion, cell spreading, establishment or maintenance of cell polarity, and cell migration. Members of this subfamily contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.
Pssm-ID: 409155 Cd Length: 121 Bit Score: 42.41 E-value: 6.64e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 43 VQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPrerdvIRSSRLPREKGRmrfHKLQNVQIALDYL 122
Cdd:cd21306 16 VVKKSLITFVNKHLNKLNLEVTDLDTQFHDGVYLVLLMGLLEGYFVP-----LHSFHLTPTSFE---QKVHNVQFAFELM 87
|
90 100 110
....*....|....*....|....*....|....
gi 1920237962 123 RHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 156
Cdd:cd21306 88 QDAGLPKPKARPEDIVNLDLKSTLRVLYNLFTKY 121
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1998-2487 |
7.09e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.91 E-value: 7.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1998 RQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRS 2077
Cdd:COG4717 64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAE 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2078 EAeaarraaeeaeaareraereaaqsrRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAArraqaeqaalRQKQ 2157
Cdd:COG4717 144 LP-------------------------ERLEELEERLEELRELEEELEELEAELAELQEELEELLE----------QLSL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2158 AADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA-EVTEAARQRGQVEEELFSLRVQMEEL 2236
Cdd:COG4717 189 ATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALeERLKEARLLLLIAAALLALLGLGGSL 268
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2237 GKLKARI------EAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE 2310
Cdd:COG4717 269 LSLILTIagvlflVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEE 348
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2311 KMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKtleTERQRQLEMSAEAERLRLRVAEMSR 2390
Cdd:COG4717 349 LQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQEL---KEELEELEEQLEELLGELEELLEAL 425
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2391 AQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAER--LREAIAELEHEKDKLKQEAQLLQLK 2468
Cdd:COG4717 426 DEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQELeeLKAELRELAEEWAALKLALELLEEA 505
|
490
....*....|....*....
gi 1920237962 2469 SEEMQTVRQEQLLQETQAL 2487
Cdd:COG4717 506 REEYREERLPPVLERASEY 524
|
|
| CEP63 |
pfam17045 |
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole ... |
1673-1795 |
7.20e-04 |
|
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole activity.
Pssm-ID: 465338 [Multi-domain] Cd Length: 264 Bit Score: 44.81 E-value: 7.20e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1673 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQ-ELIRLRAETEQGEQQRQLLEEELARLQR-----EAAAATQKR 1746
Cdd:pfam17045 127 GKLEEFRQKSLEWEQQRLQYQQQVASLEAQRKALAEQsSLIQSAAYQVQLEGRKQCLEASQSEIQRlrsklERAQDSLCA 206
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1747 RELEAELAKVRAE-----MEVLLASKARAEEESRStSEKSKQRLEAEAGRFREL 1795
Cdd:pfam17045 207 QELELERLRMRVSelgdsNRKLLEEQQRLLEELRM-SQRQLQVLQNELMELKAT 259
|
|
| Filament |
pfam00038 |
Intermediate filament protein; |
1330-1594 |
7.55e-04 |
|
Intermediate filament protein;
Pssm-ID: 459643 [Multi-domain] Cd Length: 313 Bit Score: 44.91 E-value: 7.55e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1330 IRFISETLRRMEEEERLAEQQRAEERERLAEV-EAALEKQRQLAEAHAQAKAQAEREAQGLQrrmqeevarreevavEAQ 1408
Cdd:pfam00038 20 VRFLEQQNKLLETKISELRQKKGAEPSRLYSLyEKEIEDLRRQLDTLTVERARLQLELDNLR---------------LAA 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1409 EQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIrvvrlqleaterqrggaegelQALRaraEEAEAQKRQA 1488
Cdd:pfam00038 85 EDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKI---------------------ESLK---EELAFLKKNH 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1489 QEEAERLRRQVQDETQ-------RKRQAEAELA-LRVQAEAEAAREKQRA----LQALEELRLQAEEAERRLRQAEAERA 1556
Cdd:pfam00038 141 EEEVRELQAQVSDTQVnvemdaaRKLDLTSALAeIRAQYEEIAAKNREEAeewyQSKLEELQQAAARNGDALRSAKEEIT 220
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1557 ---RQVQ-------------VALETAQRSAEAELQSEHASFAEKTAQLERTLKE 1594
Cdd:pfam00038 221 elrRTIQsleielqslkkqkASLERQLAETEERYELQLADYQELISELEAELQE 274
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
2157-2480 |
7.65e-04 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 45.29 E-value: 7.65e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2157 QAADAEMEKHKQFAEQALRQKAQVEQELtalRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL 2236
Cdd:pfam13868 16 LAAKCNKERDAQIAEKKRIKAEEKEEER---RLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEE 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2237 GKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQ 2316
Cdd:pfam13868 93 YEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDE-FNEEQAEWKELEKEEEREEDERILEYLKEKAEREEER 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2317 EATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGF---QKTLETERQRQLEMSAEAERLRLRVAEMSRA 2391
Cdd:pfam13868 172 EAEREEIEEEKEREIARLRaqQEKAQDEKAERDELRAKLYQEEQERkerQKEREEAEKKARQRQELQQAREEQIELKERR 251
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2392 QARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQtLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE 2471
Cdd:pfam13868 252 LAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKR-LEHRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRER 330
|
....*....
gi 1920237962 2472 MQTVRQEQL 2480
Cdd:pfam13868 331 IEEERQKKL 339
|
|
| tolA |
PRK09510 |
cell envelope integrity inner membrane protein TolA; Provisional |
2195-2356 |
8.00e-04 |
|
cell envelope integrity inner membrane protein TolA; Provisional
Pssm-ID: 236545 [Multi-domain] Cd Length: 387 Bit Score: 45.18 E-value: 8.00e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2195 DHQKSILDEELQRLKAEVTEAARQRGQVEEELfsLRVQMEELGKLKARieaenralvlRDKDSAQRLLQEEAEKMKQVAE 2274
Cdd:PRK09510 69 QQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQ--ERLKQLEKERLAAQ----------EQKKQAEEAAKQAALKQKQAEE 136
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2275 EAARLSVAAQEAARLRQLAEEDLAQQrALAEKMLKEKMQAVQEA---TRLKAEAELLQQQKELAQEQArrlQEDKEQMAQ 2351
Cdd:PRK09510 137 AAAKAAAAAKAKAEAEAKRAAAAAKK-AAAEAKKKAEAEAAKKAaaeAKKKAEAEAAAKAAAEAKKKA---EAEAKKKAA 212
|
....*
gi 1920237962 2352 QLAQE 2356
Cdd:PRK09510 213 AEAKK 217
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1088-1743 |
8.01e-04 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 45.87 E-value: 8.01e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1088 SLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPE-LEATKAALKKLRAQAEAQqpvfdalRDELR-GAQEVGERLQQRHGE 1165
Cdd:pfam05483 158 NLLKETCARSAEKTKKYEYEREETRQVYMDLNNnIEKMILAFEELRVQAENA-------RLEMHfKLKEDHEKIQHLEEE 230
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1166 RDVEVERWRERVTLLLerwqavlAQTDVRQRELEQLGRQLRYYRESADPLgawlrdakQRQEQIQAVPLANSQAVREQLR 1245
Cdd:pfam05483 231 YKKEINDKEKQVSLLL-------IQITEKENKMKDLTFLLEESRDKANQL--------EEKTKLQDENLKELIEKKDHLT 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1246 QEkalLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYK-AQLEPVASPAK---------KPKVQSGSESIIQEYVDL 1315
Cdd:pfam05483 296 KE---LEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKeAQMEELNKAKAahsfvvtefEATTCSLEELLRTEQQRL 372
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1316 RTRYSELSTLTSQYIRFISEtLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLaEAHAQAKAQAEREAQGLQRRMQE 1395
Cdd:pfam05483 373 EKNEDQLKIITMELQKKSSE-LEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQF-EKIAEELKGKEQELIFLLQAREK 450
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1396 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEiqaKARQVEAAERSRLRIEEEIRVVR------LQLEATERQRGGAEG 1469
Cdd:pfam05483 451 EIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKE---KLKNIELTAHCDKLLLENKELTQeasdmtLELKKHQEDIINCKK 527
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1470 ELQALRARAEEAEAQKRQAQEEAERLRR---QVQDETQRKRQAEAELALRVQAEAEAAREKQRALQ-ALEELRLQAEEAE 1545
Cdd:pfam05483 528 QEERMLKQIENLEEKEMNLRDELESVREefiQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILEnKCNNLKKQIENKN 607
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1546 RRLRQAEAE-RARQVQVALETAQRSA--------EAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRaqqqaea 1616
Cdd:pfam05483 608 KNIEELHQEnKALKKKGSAENKQLNAyeikvnklELELASAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEKA------- 680
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1617 eraraeaerelerwQLKANEALRLRLQAEEVAQQKsltqaeaekqkEEAEREARRRGKAEEQAVRQRELAEQELEKQRQL 1696
Cdd:pfam05483 681 --------------KAIADEAVKLQKEIDKRCQHK-----------IAEMVALMEKHKHQYDKIIEERDSELGLYKNKEQ 735
|
650 660 670 680
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1697 AEGTAqqRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAAT 1743
Cdd:pfam05483 736 EQSSA--KAALEIELSNIKAELLSLKKQLEIEKEEKEKLKMEAKENT 780
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
2176-2515 |
8.41e-04 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 45.78 E-value: 8.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2176 QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVT-------EAARQRGQVEEELFSLRVQMEELGKLKARIEAENR 2248
Cdd:TIGR04523 118 QKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEklnnkynDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLL 197
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2249 AL-----VLRDKDSAQRLLQEEAEKMK----QVAEEAARLSVAAQEAARLRQLAEE---DLAQQRALAEKMLKEKMQAVQ 2316
Cdd:TIGR04523 198 KLelllsNLKKKIQKNKSLESQISELKkqnnQLKDNIEKKQQEINEKTTEISNTQTqlnQLKDEQNKIKKQLSEKQKELE 277
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2317 EATR-----------LKAEAELLQQQKE---------LAQEQARRLQEDKEQMAQ------QLAQETQGFQKTLETERQR 2370
Cdd:TIGR04523 278 QNNKkikelekqlnqLKSEISDLNNQKEqdwnkelksELKNQEKKLEEIQNQISQnnkiisQLNEQISQLKKELTNSESE 357
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2371 QLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAE 2450
Cdd:TIGR04523 358 NSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIK 437
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 2451 LEHEKDKLKQEAQLLQLKSEEMQTVRqEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQL 2515
Cdd:TIGR04523 438 NNSEIKDLTNQDSVKELIIKNLDNTR-ESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKL 501
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1674-2055 |
8.62e-04 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 45.42 E-value: 8.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1753
Cdd:COG3064 20 QAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAEAAKKLAEAEKAAAEAEKKAAAEKAK 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1754 AKVRAEMEVLLASKARAEEESRSTSEKSK--QRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1831
Cdd:COG3064 100 AAKEAEAAAAAEKAAAAAEKEKAEEAKRKaeEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAA 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1832 AEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVED 1911
Cdd:COG3064 180 AALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADLAAVGV 259
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1912 TLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLA 1991
Cdd:COG3064 260 LGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEA 339
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1992 AEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ 2055
Cdd:COG3064 340 ALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGL 403
|
|
| COG4995 |
COG4995 |
Uncharacterized conserved protein, contains CHAT domain [Function unknown]; |
2016-2467 |
8.80e-04 |
|
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
Pssm-ID: 444019 [Multi-domain] Cd Length: 711 Bit Score: 45.73 E-value: 8.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2016 EARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARER 2095
Cdd:COG4995 3 ALALLALLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLALAL 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2096 AEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALR 2175
Cdd:COG4995 83 AALALALLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAAAA 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2176 QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDK 2255
Cdd:COG4995 163 AALLALALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLALLA 242
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2256 DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2335
Cdd:COG4995 243 LAAAAAALAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLLLAA 322
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2336 QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYR 2415
Cdd:COG4995 323 LLLLLAALALLALLLLLAAAALLAAALAAALALAAALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAALLA 402
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2416 TELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQL 2467
Cdd:COG4995 403 LAAAQLLRLLLAALALLLALAAYAAARLALLALIEYIILPDRLYAFVQLYQL 454
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1338-1526 |
9.03e-04 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 45.67 E-value: 9.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1338 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQlAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEE 1417
Cdd:PRK12678 78 RRAARAAAAARQAEQPAAEAAAAKAEAAPAARA-AAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEA 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1418 LQHLRQSSEAEIQAKARQVEAAERSRLRieeeiRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRR 1497
Cdd:PRK12678 157 RADAAERTEEEERDERRRRGDREDRQAE-----AERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRR 231
|
170 180
....*....|....*....|....*....
gi 1920237962 1498 QVQDETQRKRQAEAELALRVQAEAEAARE 1526
Cdd:PRK12678 232 RRDRRDARGDDNREDRGDRDGDDGEGRGG 260
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
2156-2444 |
9.21e-04 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 45.71 E-value: 9.21e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2156 KQAADAEMEKHKQFaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAaRQRGQVEEELfslRVQMEE 2235
Cdd:COG3096 402 QQALDVQQTRAIQY-QQAVQALEKARALCGLPDLTPENAEDYLAAFRAKEQQATEEVLEL-EQKLSVADAA---RRQFEK 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2236 LGKLKARIEAEnralVLRDK--DSAQRLLQEEAEKMKQvaeeAARLSVAAQEAARLRQLAEEdlaQQRA--LAEKMLKEK 2311
Cdd:COG3096 477 AYELVCKIAGE----VERSQawQTARELLRRYRSQQAL----AQRLQQLRAQLAELEQRLRQ---QQNAerLLEEFCQRI 545
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2312 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLaQETQGFQKTLETERQRQLEMSAEAERLRlrvaEMSRA 2391
Cdd:COG3096 546 GQQLDAAEELEELLAELEAQLEELEEQAAEAVEQRSELRQQL-EQLRARIKELAARAPAWLAAQDALERLR----EQSGE 620
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 2392 QARAEEDARRFRKQAedigerLYRTELATQEKvmlvQTLETQRQQSDRDAERL 2444
Cdd:COG3096 621 ALADSQEVTAAMQQL------LEREREATVER----DELAARKQALESQIERL 663
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1677-1958 |
9.42e-04 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 44.89 E-value: 9.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1756
Cdd:COG4372 69 EQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1757 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1836
Cdd:COG4372 149 EEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLE 228
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1837 AISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQR 1916
Cdd:COG4372 229 AKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALS 308
|
250 260 270 280
....*....|....*....|....*....|....*....|..
gi 1920237962 1917 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSK 1958
Cdd:COG4372 309 LIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGL 350
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
1339-1455 |
9.51e-04 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 45.59 E-value: 9.51e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1339 RMEEEERLAEQQRAEERERLAEVEA---ALEKQ-RQLAEAHAQAKAQAEREAQGLQRRMQEEVAR-----REEVAVEAQE 1409
Cdd:PRK00409 524 SLEELERELEQKAEEAEALLKEAEKlkeELEEKkEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEiikelRQLQKGGYAS 603
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1410 QKRS-IQEELQHLRQSSEAEIQAKARQVEAAErsRLRIEEEIRVVRL 1455
Cdd:PRK00409 604 VKAHeLIEARKRLNKANEKKEKKKKKQKEKQE--ELKVGDEVKYLSL 648
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1347-1508 |
1.04e-03 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 45.28 E-value: 1.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1347 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE 1426
Cdd:PRK12678 65 AAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARK 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1427 AEIQAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGELQALRARAEEAEAQKRQAQEEaERLRRQVQDETQRK 1506
Cdd:PRK12678 145 AGEGGEQPATEARADAAERTEEE--------ERDERRRRGDREDRQAEAERGERGRREERGRDGD-DRDRRDRREQGDRR 215
|
..
gi 1920237962 1507 RQ 1508
Cdd:PRK12678 216 EE 217
|
|
| PLEC |
smart00250 |
Plectin repeat; |
4381-4418 |
1.10e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 39.39 E-value: 1.10e-03
10 20 30
....*....|....*....|....*....|....*...
gi 1920237962 4381 QRFLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTA 4418
Cdd:smart00250 1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
2367-2612 |
1.13e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 45.43 E-value: 1.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2367 ERQRQLEMSAEA-ERLRLRVAEMSRAQARAEEDARRFRKQAEdigerlYRTELATQEKVMLVQTLETQRQQsdrdAERLR 2445
Cdd:TIGR02168 176 ETERKLERTRENlDRLEDILNELERQLKSLERQAEKAERYKE------LKAELRELELALLVLRLEELREE----LEELQ 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2446 EAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqlLQETQALQQSFLSEKDSLLQR-ERCIEQEKAKLEQLFQDEVAKAQ 2524
Cdd:TIGR02168 246 EELKEAEEELEELTAELQELEEKLEELRLEVSE--LEEEIEELQKELYALANEISRlEQQKQILRERLANLERQLEELEA 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2525 ALREeqqrqqqqmqqekqqLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAA 2604
Cdd:TIGR02168 324 QLEE---------------LESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388
|
....*...
gi 1920237962 2605 LARSEEIA 2612
Cdd:TIGR02168 389 AQLELQIA 396
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1330-1573 |
1.14e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 44.82 E-value: 1.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1330 IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQE 1409
Cdd:COG3883 18 IQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYR 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1410 QKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALR-------AraeEAE 1482
Cdd:COG3883 98 SGGSVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKaeleaakA---ELE 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1483 AQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA 1562
Cdd:COG3883 175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGA 254
|
250
....*....|.
gi 1920237962 1563 LETAQRSAEAE 1573
Cdd:COG3883 255 AGAAAGSAGAA 265
|
|
| SPEC |
smart00150 |
Spectrin repeats; |
615-707 |
1.18e-03 |
|
Spectrin repeats;
Pssm-ID: 197544 [Multi-domain] Cd Length: 101 Bit Score: 41.16 E-value: 1.18e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 615 HGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQA 694
Cdd:smart00150 1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIEERLE 80
|
90
....*....|...
gi 1920237962 695 ALQTQWSWMLQLC 707
Cdd:smart00150 81 ELNERWEELKELA 93
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1406-1759 |
1.19e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 44.51 E-value: 1.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1406 EAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQK 1485
Cdd:COG4372 10 KARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQL 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1486 RQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERA---RQVQVA 1562
Cdd:COG4372 90 QAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLEslqEELAAL 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1563 LETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRL 1642
Cdd:COG4372 170 EQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKE 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1643 QAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGE 1722
Cdd:COG4372 250 ELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLE 329
|
330 340 350
....*....|....*....|....*....|....*..
gi 1920237962 1723 QQRQLLEEELARLQREAAAATQKRRELEAELAKVRAE 1759
Cdd:COG4372 330 LALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAG 366
|
|
| GBP_C |
cd16269 |
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ... |
1348-1475 |
1.21e-03 |
|
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.
Pssm-ID: 293879 [Multi-domain] Cd Length: 291 Bit Score: 44.11 E-value: 1.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1348 EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAqAEREaqglqRRMQEEVARREEVavEAQEQKRSIQEELQHLRQSSEA 1427
Cdd:cd16269 177 QSKEAEAEAILQADQALTEKEKEIEAERAKAEA-AEQE-----RKLLEEQQRELEQ--KLEDQERSYEEHLRQLKEKMEE 248
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 1920237962 1428 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegELQALR 1475
Cdd:cd16269 249 ERENLLKEQERALESKLKEQEALLEEGFKEQAELLQE-----EIRSLK 291
|
|
| AtpF |
COG0711 |
FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ... |
1471-1570 |
1.22e-03 |
|
FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ATP synthase, membrane subunit b or b' is part of the Pathway/BioSystem: FoF1-type ATP synthase
Pssm-ID: 440475 [Multi-domain] Cd Length: 152 Bit Score: 42.47 E-value: 1.22e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1471 LQALRARAEEAE---AQKRQAQEEAERLRRQVQDEtQRKRQAEAElALRVQAEAEAAREKQRALQALEelrlqaEEAERR 1547
Cdd:COG0711 26 LKALDERQEKIAdglAEAERAKEEAEAALAEYEEK-LAEARAEAA-EIIAEARKEAEAIAEEAKAEAE------AEAERI 97
|
90 100
....*....|....*....|...
gi 1920237962 1548 LRQAEAERARQVQVALETAQRSA 1570
Cdd:COG0711 98 IAQAEAEIEQERAKALAELRAEV 120
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
1478-1862 |
1.24e-03 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 45.43 E-value: 1.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1478 AEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAR---EKQRALQALEELRLQAEEAERRLRQAEAE 1554
Cdd:PRK10929 104 TDALEQEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTEARRqlnEIERRLQTLGTPNTPLAQAQLTALQAESA 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1555 RARQVQVALETAQRSAE-----AELQSEhaSFAEKTAQLERTLkeehvavvqlreeatrraqqqaeaeraraeaereler 1629
Cdd:PRK10929 184 ALKALVDELELAQLSANnrqelARLRSE--LAKKRSQQLDAYL------------------------------------- 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1630 wqlkanEALRLRLQAEevaqqksltqaeaekqkeeaerearrrgkaeeqavRQRElAEQELEKQRQLAEGTAQQRLAAEQ 1709
Cdd:PRK10929 225 ------QALRNQLNSQ-----------------------------------RQRE-AERALESTELLAEQSGDLPKSIVA 262
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1710 ELIRLRAETEQGEQQRQLLeEELARLQREAAAATQKRREleaELAKVRAEMEVLLASKARAE----EESRSTSEKSKQRL 1785
Cdd:PRK10929 263 QFKINRELSQALNQQAQRM-DLIASQQRQAASQTLQVRQ---ALNTLREQSQWLGVSNALGEalraQVARLPEMPKPQQL 338
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1786 EAEAGRFRelaeeAARLR--ALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA---------------ISEATRLK--- 1845
Cdd:PRK10929 339 DTEMAQLR-----VQRLRyeDLLNKQPQLRQIRQADGQPLTAEQNRILDAQLRTqrellnsllsggdtlILELTKLKvan 413
|
410
....*....|....*...
gi 1920237962 1846 TEAEIALKE-KEAENERL 1862
Cdd:PRK10929 414 SQLEDALKEvNEATHRYL 431
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
2153-2610 |
1.25e-03 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 45.12 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQKAQV----EQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2228
Cdd:pfam05557 78 NRLKKKYLEALNKKLNEKESQLADAREVisclKNELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQN 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2229 LRVQMEELGKLKARIEAENRALVLRDKDSaqrllqEEAEKMKqvaEEAARLSVAAQEAARLRqlaeEDLAQQRALAEKML 2308
Cdd:pfam05557 158 LEKQQSSLAEAEQRIKELEFEIQSQEQDS------EIVKNSK---SELARIPELEKELERLR----EHNKHLNENIENKL 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2309 KEKMQAVQEATRLkaeaellqQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKT-------------LETERQRQLEMS 2375
Cdd:pfam05557 225 LLKEEVEDLKRKL--------EREEKYREEAATLELEKEKLEQELQSWVKLAQDTglnlrspedlsrrIEQLQQREIVLK 296
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2376 AEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTE-----------LATQEKVMLVQTLE------TQRQQSD 2438
Cdd:pfam05557 297 EENSSLTSSARQLEKARRELEQELAQYLKKIEDLNKKLKRHKalvrrlqrrvlLLTKERDGYRAILEsydkelTMSNYSP 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2439 RDAERLREA----------IAELEHEKDKLKQEA----QLLQLKSEEMQTVRQeqllQETQALQQSFLSEKDSLLQRERC 2504
Cdd:pfam05557 377 QLLERIEEAedmtqkmqahNEEMEAQLSVAEEELggykQQAQTLERELQALRQ----QESLADPSYSKEEVDSLRRKLET 452
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2505 IEQEKAKLEQlfQDEVAKAQALREEQQRQQQQMQQEKQQLaasmeearrRQHEAEEGVRRQQEELQRLAqqqqqqeklla 2584
Cdd:pfam05557 453 LELERQRLRE--QKNELEMELERRCLQGDYDPKKTKVLHL---------SMNPAAEAYQQRKNQLEKLQ----------- 510
|
490 500
....*....|....*....|....*.
gi 1920237962 2585 EENQRLRERLQHLEEERRAALARSEE 2610
Cdd:pfam05557 511 AEIERLKRLLKKLEDDLEQVLRLPET 536
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
1673-2055 |
1.26e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 45.00 E-value: 1.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1673 GKAEEQAVRQRELAEQELEK-----QRQLAEGTAQQRLAAEQELIRLRAE--TEQGEQQRQLLEEELARLQREAAAATQK 1745
Cdd:NF033838 50 SSGNESQKEHAKEVESHLEKilseiQKSLDKRKHTQNVALNKKLSDIKTEylYELNVLKEKSEAELTSKTKKELDAAFEQ 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1746 RRELEAELAKVRAEMEVLLA-----SKARAEEESRSTSEKSKQRLEAEAGRFrELAEEAARLRALAEEAKRQRqlaEEDA 1820
Cdd:NF033838 130 FKKDTLEPGKKVAEATKKVEeaekkAKDQKEEDRRNYPTNTYKTLELEIAES-DVEVKKAELELVKEEAKEPR---DEEK 205
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1821 VRQrAEAErVLAEKlaaiSEATRLKteaEIALKEKEAENERLRRLaedEAFQRRLLEEQAAQHKADIEARLAqlRKASES 1900
Cdd:NF033838 206 IKQ-AKAK-VESKK----AEATRLE---KIKTDREKAEEEAKRRA---DAKLKEAVEKNVATSEQDKPKRRA--KRGVLG 271
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1901 ELERQKGLVEDTLRQRRQVEEEIL---ALKGSFEKAAAGKAELELElGRIRGTAEDTLR-----SKEQAEQEAARqrqla 1972
Cdd:NF033838 272 EPATPDKKENDAKSSDSSVGEETLpspSLKPEKKVAEAEKKVEEAK-KKAKDQKEEDRRnyptnTYKTLELEIAE----- 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1973 aeEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKA---KVEEARRLRERAEQESARQLQLAQEAAQKrlQAEEKA 2049
Cdd:NF033838 346 --SDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAeatRLEKIKTDRKKAEEEAKRKAAEEDKVKEK--PAEQPQ 421
|
....*.
gi 1920237962 2050 HAFAVQ 2055
Cdd:NF033838 422 PAPAPQ 427
|
|
| PRK05035 |
PRK05035 |
electron transport complex protein RnfC; Provisional |
1419-1707 |
1.30e-03 |
|
electron transport complex protein RnfC; Provisional
Pssm-ID: 235334 [Multi-domain] Cd Length: 695 Bit Score: 44.94 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1419 QHLRQSsEAEIQAKARQVEAAERSRLRIEEeirvvrlqleateRQrggaegelqalrARAEeaeaqkRQAQEEAERLRRQ 1498
Cdd:PRK05035 429 QYYRQA-KAEIRAIEQEKKKAEEAKARFEA-------------RQ------------ARLE------REKAAREARHKKA 476
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1499 VQDETQRKRQAEAELALRVQAEAEAAREK----------QRALQALEELR-LQAEEAERRLRQAEAERARQVQVALETAQ 1567
Cdd:PRK05035 477 AEARAAKDKDAVAAALARVKAKKAAATQPivikagarpdNSAVIAAREARkAQARARQAEKQAAAAADPKKAAVAAAIAR 556
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1568 RSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERElerwqlkANEALRLRLQAEEV 1647
Cdd:PRK05035 557 AKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKA-------AVAAAIARAKAKKA 629
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1648 AQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAA 1707
Cdd:PRK05035 630 EQQANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAKA 689
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1355-1606 |
1.37e-03 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 44.52 E-value: 1.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1355 RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1434
Cdd:pfam13868 12 NSKLLAAKCNKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQE 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1435 QVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1514
Cdd:pfam13868 92 EYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEER 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1515 LRVQAEAEAAREKQRAlqaleELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELqsehasfAEKTAQLERTLKE 1594
Cdd:pfam13868 172 EAEREEIEEEKEREIA-----RLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREE-------AEKKARQRQELQQ 239
|
250
....*....|..
gi 1920237962 1595 EHVAVVQLREEA 1606
Cdd:pfam13868 240 AREEQIELKERR 251
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2762-2798 |
1.42e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 39.00 E-value: 1.42e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 2762 KLLSAERAVTGYKDPYTGEQISLFQAMKKDLIVREHG 2798
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1775-1923 |
1.46e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 44.77 E-value: 1.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1775 RSTSEKSKQRLEAEAGRFRELAEEAArlralaEEAKRQRQL-AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALK 1853
Cdd:PRK12704 26 KKIAEAKIKEAEEEAKRILEEAKKEA------EAIKKEALLeAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLD 99
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1854 EKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKasesELERQKGLVEDTLRQR--RQVEEEI 1923
Cdd:PRK12704 100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ----ELERISGLTAEEAKEIllEKVEEEA 167
|
|
| SAC6 |
COG5069 |
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton]; |
46-270 |
1.48e-03 |
|
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];
Pssm-ID: 227401 [Multi-domain] Cd Length: 612 Bit Score: 44.93 E-value: 1.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 46 KTFTKWVNKHLIKAQrhISDLYEDLRDGhnlISLLEVLSGDSLPRERDVIRSSRLPREKGRM-RFHKLQNVQIALDYLRH 124
Cdd:COG5069 382 RVFTFWLNSLDVSPE--ITNLFGDLRDQ---LILLQALSKKLMPMTVTHKLVKKQPASGIEEnRFKAFENENYAVDLGIT 456
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 125 RQVKLVNIRNDDIADGNpKLTLGLIW-------TIILHFQISDiqvsgqsEDMTAKEKLLLWSQRMV------EGCQGLR 191
Cdd:COG5069 457 EGFSLVGIKGLEILDGI-RLKLTLVWqvlrsntALFNHVLKKD-------GCGLSDSDLCAWLGSLGlkgdkeEGIRSFG 528
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 192 CDNFTTSWRDGRLFNAIIHrhkPTLIDMNKVyRQTNLENLDQA-------FSVAERDLGVTRLLDPEDVDVPQPdEKSII 264
Cdd:COG5069 529 DPAGSVSGVFYLDVLKGIH---SELVDYDLV-TRGFTEFDDIAdarslaiSSKILRSLGAIIKFLPEDINGVRP-RLDVL 603
|
....*.
gi 1920237962 265 TYVSSL 270
Cdd:COG5069 604 TFIESL 609
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1343-1588 |
1.51e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 44.50 E-value: 1.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1343 EERLAEQQRaEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQ----------EEVARREEVAVEAQEQKR 1412
Cdd:pfam07888 33 QNRLEECLQ-ERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAelkeelrqsrEKHEELEEKYKELSASSE 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1413 SIQEE---LQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQ 1489
Cdd:pfam07888 112 ELSEEkdaLLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLS 191
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1490 EEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVAleTAQRS 1569
Cdd:pfam07888 192 KEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLQERLNASERKVEGLGEELSSMA--AQRDR 269
|
250
....*....|....*....
gi 1920237962 1570 AEAELQSEHASFAEKTAQL 1588
Cdd:pfam07888 270 TQAELHQARLQAAQLTLQL 288
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
2206-2400 |
1.75e-03 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 44.56 E-value: 1.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2206 QRLKAEVTEAARQRGQVEEElfslrvqmeelgklkarieaenralvlrdkdsaQRLLQEEAEKMKQVAEEAARLSVAAQE 2285
Cdd:pfam15709 341 ERAEMRRLEVERKRREQEEQ---------------------------------RRLQQEQLERAEKMREELELEQQRRFE 387
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2286 AARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEA---ELLQQQKELAQEQARRLQEDKE---QMAQQLAQETQG 2359
Cdd:pfam15709 388 EIRLRKQRLEEERQRQEEEERKQRLQLQAAQERARQQQEEfrrKLQELQRKKQQEEAERAEAEKQrqkELEMQLAEEQKR 467
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 1920237962 2360 FQKTLETERQRQLEMSAEAERLRLRVAEMSRaqARAEEDAR 2400
Cdd:pfam15709 468 LMEMAEEERLEYQRQKQEAEEKARLEAEERR--QKEEEAAR 506
|
|
| PRK10920 |
PRK10920 |
putative uroporphyrinogen III C-methyltransferase; Provisional |
2316-2394 |
1.84e-03 |
|
putative uroporphyrinogen III C-methyltransferase; Provisional
Pssm-ID: 236795 Cd Length: 390 Bit Score: 43.93 E-value: 1.84e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 2316 QEATRLKAEAELLQQQKELAQEQArrlQEDKEQMAQQLAQETqgfqKTLETERQRQLEMSAEAERLRLRVAEMSRAQAR 2394
Cdd:PRK10920 60 QQAQNQTATNDALANQLTALQKAQ---ESQKQELEGILKQQA----KALDQANRQQAALAKQLDELQQKVATISGSDAK 131
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1456-1919 |
1.93e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 44.26 E-value: 1.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1456 QLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALE 1535
Cdd:COG3064 17 RLEQAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAEAAKKLAEAEKAAAEAEKKAAAE 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1536 ELRLQAEEAERRLRQAEAERARQVQValETAQRSAE--AELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQ 1613
Cdd:COG3064 97 KAKAAKEAEAAAAAEKAAAAAEKEKA--EEAKRKAEeeAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARA 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1614 AEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQ 1693
Cdd:COG3064 175 AAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADL 254
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1694 RQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1773
Cdd:COG3064 255 AAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGA 334
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1774 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALK 1853
Cdd:COG3064 335 ASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGLRLDLGAALLEA 414
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1854 EKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQV 1919
Cdd:COG3064 415 ASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDGGAVLADL 480
|
|
| FAM184 |
pfam15665 |
Family with sequence similarity 184, A and B; The function of FAM184 is not known. |
1415-1605 |
2.02e-03 |
|
Family with sequence similarity 184, A and B; The function of FAM184 is not known.
Pssm-ID: 464788 [Multi-domain] Cd Length: 211 Bit Score: 42.73 E-value: 2.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1415 QEELQHLRQSSEAEIQakarqveaaersrlRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1494
Cdd:pfam15665 13 EAEIQALKEAHEEEIQ--------------QILAETREKILQYKSKIGEELDLKRRIQTLEESLEQHERMKRQALTEFEQ 78
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1495 LRRQVQDetqRKRQAEAELALRVQAEA----EAAREKQRALQALEELRLQAE-EAERRLRQAEAERARQVQVALETaqrs 1569
Cdd:pfam15665 79 YKRRVEE---RELKAEAEHRQRVVELSreveEAKRAFEEKLESFEQLQAQFEqEKRKALEELRAKHRQEIQELLTT---- 151
|
170 180 190
....*....|....*....|....*....|....*.
gi 1920237962 1570 aeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREE 1605
Cdd:pfam15665 152 ----QRAQSASSLAEQEKLEELHKAELESLRKEVED 183
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
1341-1605 |
2.09e-03 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 44.09 E-value: 2.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1341 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREA-----------QGL-----------QRRMQEEVA 1398
Cdd:pfam02029 5 EEAARERRRRAREERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELkpsgqggldeeEAFldrtakreerrQKRLQEALE 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1399 RREEVAVEAQEQKRSIQEELQHlRQSSEAEIQAKARQVEaAERSRLRIEEEIRVVRL---QLEATERQRGGAEGELQALR 1475
Cdd:pfam02029 85 RQKEFDPTIADEKESVAERKEN-NEEEENSSWEKEEKRD-SRLGRYKEEETEIREKEyqeNKWSTEVRQAEEEGEEEEDK 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1476 ARAEEAEAQKRQAQEEAERLRRQVQDE---------------TQRKRQA--EAELALRVQAEAEAAREKQRALQALE-EL 1537
Cdd:pfam02029 163 SEEAEEVPTENFAKEEVKDEKIKKEKKvkyeskvfldqkrghPEVKSQNgeEEVTKLKVTTKRRQGGLSQSQEREEEaEV 242
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1538 RLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAE--KTAQLERTLKEEHVAVVQLREE 1605
Cdd:pfam02029 243 FLEAEQKLEELRRRRQEKESEEFEKLRQKQQEAELELEELKKKREErrKLLEEEEQRRKQEEAERKLREE 312
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
860-1817 |
2.09e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 44.40 E-value: 2.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 860 EAQEAIARLEAQHQALVAlwhQLHTEMKSLLAWQSLGRDMQLIRSWSLATFRTLKpEEQRQALRSLELHYQAFLRDSQDA 939
Cdd:pfam01576 219 DLQEQIAELQAQIAELRA---QLAKKEEELQAALARLEEETAQKNNALKKIRELE-AQISELQEDLESERAARNKAEKQR 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 940 GGFGPE-DRLQAEREYGSCSRHYQQLLQSleQGEQEESRCQRCISElkdirlqleacETRtVHRLRLPLDKEPARECAQR 1018
Cdd:pfam01576 295 RDLGEElEALKTELEDTLDTTAAQQELRS--KREQEVTELKKALEE-----------ETR-SHEAQLQEMRQKHTQALEE 360
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1019 ITEQQKaQAEVDGLGKGVARLSAEAEkVLALPEPSPAAPTLRSELELTLGKLE-QVRSLSAIYLEKLKtislvirstQEA 1097
Cdd:pfam01576 361 LTEQLE-QAKRNKANLEKAKQALESE-NAELQAELRTLQQAKQDSEHKRKKLEgQLQELQARLSESER---------QRA 429
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1098 EEVLRAHEEQLkEAQAVPATLPELEATKAALKKLRAQAEAQ-QPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRER 1176
Cdd:pfam01576 430 ELAEKLSKLQS-ELESVSSLLNEAEGKNIKLSKDVSSLESQlQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEE 508
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1177 VtlllerwqavlaqtdVRQRELEqlgRQLRyyresadPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIEr 1256
Cdd:pfam01576 509 E---------------EAKRNVE---RQLS-------TLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLE- 562
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1257 hgEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPK----VQSGSESIIQEYVDLRTRYS----ELSTLTSQ 1328
Cdd:pfam01576 563 --EKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKkfdqMLAEEKAISARYAEERDRAEaearEKETRALS 640
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1329 YIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEreaqglqrRMQEEVARREEVAVEAQ 1408
Cdd:pfam01576 641 LARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVE--------EMKTQLEELEDELQATE 712
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1409 EQKRSIQEELQHLRQSSEAEIQAKArqvEAAERSRLRIEEEIRVVRLQLEATERQRGGA-------EGELQALRARAEEA 1481
Cdd:pfam01576 713 DAKLRLEVNMQALKAQFERDLQARD---EQGEEKRRQLVKQVRELEAELEDERKQRAQAvaakkklELDLKELEAQIDAA 789
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1482 EAQKRQAQEEAERLRRQVQDetqrkRQAEAELALRVQAEAEA-AREKQRALQALEELRLQAEE----AERRLRQAEAERA 1556
Cdd:pfam01576 790 NKGREEAVKQLKKLQAQMKD-----LQRELEEARASRDEILAqSKESEKKLKNLEAELLQLQEdlaaSERARRQAQQERD 864
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1557 R-QVQVALETAQRSAeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQqaeaeraraeaerelerwqlkaN 1635
Cdd:pfam01576 865 ElADEIASGASGKSA---LQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQ----------------------V 919
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1636 EALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgkaeEQAVRQRELAEQELEKQRQLAEGTAQQRLAAeqELIRLR 1715
Cdd:pfam01576 920 EQLTTELAAERSTSQKS------------------------ESARQQLERQNKELKAKLQEMEGTVKSKFKS--SIAALE 973
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1716 AETEQgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEMEvllaSKARAEEESRSTSEKSKQRLEAEAGRFREL 1795
Cdd:pfam01576 974 AKIAQ-------LEEQLEQESRERQAANKLVRRTEKKLKEVLLQVE----DERRHADQYKDQAEKGNSRMKQLKRQLEEA 1042
|
970 980
....*....|....*....|..
gi 1920237962 1796 AEEAArlRALAEEAKRQRQLAE 1817
Cdd:pfam01576 1043 EEEAS--RANAARRKLQRELDD 1062
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
2428-2611 |
2.13e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 44.34 E-value: 2.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2428 QTLETQRQQSDR----DAERLREAIAELEHEKDKLKqeaqllqlKSEEMQTVRQEQLLQETQ--ALQQSFLSEKDSLLQR 2501
Cdd:pfam17380 281 QKAVSERQQQEKfekmEQERLRQEKEEKAREVERRR--------KLEEAEKARQAEMDRQAAiyAEQERMAMERERELER 352
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2502 ERcIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEK 2581
Cdd:pfam17380 353 IR-QEERKRELERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEE 431
|
170 180 190
....*....|....*....|....*....|.
gi 1920237962 2582 LLAEENQRL-RERLQHLEEERRAALARSEEI 2611
Cdd:pfam17380 432 ARQREVRRLeEERAREMERVRLEEQERQQQV 462
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1347-1600 |
2.14e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 2.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1347 AEQQRAEERERlaeVEAALEkqrqlaEAHAQAKAQAEREAQGL---------------------QRRMQEEVARREEVAV 1405
Cdd:PHA03247 1586 AKQQRAEATDR---VTAALR------EALAAHERRAQSEAESLanlktllrvaaipataaktldQARSVAEIVDQIELLL 1656
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1406 EAQEQKRSIQEE----LQHLRQSSEAEIQAKARQVEAAERSRLRieeeirvVRLQLEATERQRggaegeLQALRARAEEA 1481
Cdd:PHA03247 1657 EQTEKAAELDVAavdwLEHARRVFEAHPLTAARGGGPDPLARLH-------ARLDALGETRRR------TEALRRSLEAA 1723
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1482 EAQKRQAQEEAERLRRQvqdetqrkrqaeaelALRVQAEAEAAREKQRALQALEE--LRLQAEEAERRLrqaeAERARQV 1559
Cdd:PHA03247 1724 EAEWDEVWGRFGRVRGG---------------AWKSPEALRAAREQLRALQTATNtvLGLRADAHYERL----PAKYQGA 1784
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1920237962 1560 qVALETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVAVV 1600
Cdd:PHA03247 1785 -LGAKSAERAGAVE---ELGAAVARHDGLLARLREEVVARV 1821
|
|
| COG4995 |
COG4995 |
Uncharacterized conserved protein, contains CHAT domain [Function unknown]; |
2032-2467 |
2.23e-03 |
|
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
Pssm-ID: 444019 [Multi-domain] Cd Length: 711 Bit Score: 44.19 E-value: 2.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2032 LQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAE 2111
Cdd:COG4995 1 LLALALLALLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLAL 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2112 RLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQL 2191
Cdd:COG4995 81 ALAALALALLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2192 EETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQ 2271
Cdd:COG4995 161 AAAALLALALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLAL 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2272 VAEEAARLSvAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQ 2351
Cdd:COG4995 241 LALAAAAAA-LAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLL 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2352 QLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEM-SRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTL 2430
Cdd:COG4995 320 LAALLLLLAALALLALLLLLAAAALLAAALAAALALAaALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAA 399
|
410 420 430
....*....|....*....|....*....|....*..
gi 1920237962 2431 ETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQL 2467
Cdd:COG4995 400 LLALAAAQLLRLLLAALALLLALAAYAAARLALLALI 436
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
1471-1595 |
2.27e-03 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 44.43 E-value: 2.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1471 LQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEaarekqralQALEELRLQAEEAERRLRQ 1550
Cdd:PRK00409 525 LEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQ---------QAIKEAKKEADEIIKELRQ 595
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1920237962 1551 AEAERARQVqvaletaqrsAEAELQSEHASFAEKTAQLERTLKEE 1595
Cdd:PRK00409 596 LQKGGYASV----------KAHELIEARKRLNKANEKKEKKKKKQ 630
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1710-2057 |
2.30e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 44.12 E-value: 2.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1710 ELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAElaKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEA 1789
Cdd:pfam07888 30 ELLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWE--RQRRELESRVAELKEELRQSREKHEELEEKYKELS 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1790 GRFRELAEEAARLRAlAEEAKRQRQLAEEDAVrqRAEAERVLaeklaaiseatrlkteaeialkEKEAENERLRRLAEDE 1869
Cdd:pfam07888 108 ASSEELSEEKDALLA-QRAAHEARIRELEEDI--KTLTQRVL----------------------ERETELERMKERAKKA 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1870 AFQRRLLEEQAAQHKADIEARLAQLRKASeSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRG 1949
Cdd:pfam07888 163 GAQRKEEEAERKQLQAKLQQTEEELRSLS-KEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEALLEELRS 241
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1950 TAE-------------------DTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALE----- 2005
Cdd:pfam07888 242 LQErlnaserkveglgeelssmAAQRDRTQAELHQARLQAAQLTLQLADASLALREGRARWAQERETLQQSAEADkdrie 321
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 2006 ----EVERLKAKVEEARRLRERAEQESARQ--LQLAQEAAQKRLQAEEKAhAFAVQQK 2057
Cdd:pfam07888 322 klsaELQRLEERLQEERMEREKLEVELGREkdCNRVQLSESRRELQELKA-SLRVAQK 378
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
1456-1571 |
2.35e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 43.42 E-value: 2.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1456 QLEATERQR-GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQAL 1534
Cdd:PRK09039 67 DLLSLERQGnQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQI 146
|
90 100 110 120
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1535 EELR---------LQAEEAERRLRQAE-AERARQVQVALetAQRSAE 1571
Cdd:PRK09039 147 AALRrqlaaleaaLDASEKRDRESQAKiADLGRRLNVAL--AQRVQE 191
|
|
| PRK12705 |
PRK12705 |
hypothetical protein; Provisional |
1365-1571 |
2.37e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 237178 [Multi-domain] Cd Length: 508 Bit Score: 43.93 E-value: 2.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1365 LEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEvaRREEVAVEAQEQKRSIQEELQHLrQSSEAEIQAKARQVEAaersrl 1444
Cdd:PRK12705 25 LKKRQRLAKEAERILQEAQKEAEEKLEAALLE--AKELLLRERNQQRQEARREREEL-QREEERLVQKEEQLDA------ 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1445 RIEEeirvvrlqLEATERQRGGAEgelQALRARAEEAEAQKRQAQEEAERLrrqvqdETQRKRQAEAELALRVQAEAEaa 1524
Cdd:PRK12705 96 RAEK--------LDNLENQLEERE---KALSARELELEELEKQLDNELYRV------AGLTPEQARKLLLKLLDAELE-- 156
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 1920237962 1525 REKQRALQALEelrlqaEEAerrlrQAEAERARQVQVAlETAQRSAE 1571
Cdd:PRK12705 157 EEKAQRVKKIE------EEA-----DLEAERKAQNILA-QAMQRIAS 191
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1359-1566 |
2.37e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 2.37e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1359 AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ--RRMQEEVARREEVAVEAQEQKRSIQE-----------ELQHLRQSS 1425
Cdd:PHA03247 1150 STVDAAVRAHGVLADAVAALSPAVRDPACPLAflVALADSAAGYVKATRLALDARRAIARlgalgaaaadlAVAVRRENP 1229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1426 EAE------IQAKARQVEAAERSRLRIEEEIRVVrLQLEATERQRGGAEGELQAL-------RARAEEAEAQkrqAQEEA 1492
Cdd:PHA03247 1230 QAEgdraalLEAAARAVTAAREGLAACEGEFGGL-LHAEGSAGDPSPSGRALQELgkvvgatRRRADELEAA---AADLA 1305
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1493 ERLRRQVQDETQRKRQAEAELAL-RVQAEAEAAREKQRALQALE-ELRLQAEEAERRLRQAEAERARQVQVALETA 1566
Cdd:PHA03247 1306 EKMAARRARASRERWAADVEAALdRVENRAEFDAVELRRLQALAaTHGYNPRDFRKRAEQALAANAKTATLALEAA 1381
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
1389-2054 |
2.53e-03 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 43.97 E-value: 2.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1389 LQRRMQeevARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRL---RIEEEIRVVR-LQLEATERQR 1464
Cdd:pfam07111 21 LERRLD---TQRPTVTMWEQDVSGDGQGPGRRGRSLELEGSQALSQQAELISRQLQelrRLEEEVRLLReTSLQQKMRLE 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1465 GGA-EGELQALRARAEEAEAQK-RQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKqralqALEELRLQAE 1542
Cdd:pfam07111 98 AQAmELDALAVAEKAGQAEAEGlRAALAGAEMVRKNLEEGSQRELEEIQRLHQEQLSSLTQAHEE-----ALSSLTSKAE 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1543 EAERRLRQAEAERARQVQvALETAQRsaEAELQSEHASFAEKTAQLERTLKEEHVAVV--QLREEATRRAQQQAEAERAR 1620
Cdd:pfam07111 173 GLEKSLNSLETKRAGEAK-QLAEAQK--EAELLRKQLSKTQEELEAQVTLVESLRKYVgeQVPPEVHSQTWELERQELLD 249
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1621 AEAERELERWQLKAN-EALRLRLQaeevaqqkSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEG 1699
Cdd:pfam07111 250 TMQHLQEDRADLQATvELLQVRVQ--------SLTHMLALQEEELTRKIQPSDSLEPEFPKKCRSLLNRWREKVFALMVQ 321
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1700 TAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQReaaAATQKRRELEAELAKVRA-EMEVLLASKARAEEESRSTS 1778
Cdd:pfam07111 322 LKAQDLEHRDSVKQLRGQVAELQEQVTSQSQEQAILQR---ALQDKAAEVEVERMSAKGlQMELSRAQEARRRQQQQTAS 398
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1779 EKSKQRLEAEAGRFRELAEEAARLRaLAEEAKRQRQLAEE--DAVRQRAEAERVLAEKLAAIS---EATRLKTEAEIALK 1853
Cdd:pfam07111 399 AEEQLKFVVNAMSSTQIWLETTMTR-VEQAVARIPSLSNRlsYAVRKVHTIKGLMARKVALAQlrqESCPPPPPAPPVDA 477
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1854 EKEAENERLRRlaedeafQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKA 1933
Cdd:pfam07111 478 DLSLELEQLRE-------ERNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEVAQQLEQELQRAQESLASVGQQLEVA 550
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1934 AAGKAELELELGRIRGTAEDTLRSKEQAEQEA-----ARQRQLAAEEERRRREAEERVQK---SLAAEEEAARQRKAALE 2005
Cdd:pfam07111 551 RQGQQESTEEAASLRQELTQQQEIYGQALQEKvaeveTRLREQLSDTKRRLNEARREQAKavvSLRQIQHRATQEKERNQ 630
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2006 EVERLK--AKVEEARRLRERAEQ-ESARQLQLAQEAAQKRLQAEEKAHAFAV 2054
Cdd:pfam07111 631 ELRRLQdeARKEEGQRLARRVQElERDKNLMLATLQQEGLLSRYKQQRLLAV 682
|
|
| Rabaptin |
pfam03528 |
Rabaptin; |
1335-1607 |
2.65e-03 |
|
Rabaptin;
Pssm-ID: 367545 [Multi-domain] Cd Length: 486 Bit Score: 43.94 E-value: 2.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEaHAQAKAQAEREAQGLQRRMQEevARREEVAVEAQEQKRSI 1414
Cdd:pfam03528 96 DEVKSQWQEEVASLQAIMKETVREYEVQFHRRLEQERAQ-WNQYRESAEREIADLRRRLSE--GQEEENLEDEMKKAQED 172
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1415 QEELQHLRQSSEAEIQA-KARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALraraeeaEAQKRQAQEEAE 1493
Cdd:pfam03528 173 AEKLRSVVMPMEKEIAAlKAKLTEAEDKIKELEASKMKELNHYLEAEKSCRTDLEMYVAVL-------NTQKSVLQEDAE 245
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1494 RLRRQVQDETQRkrqaeaeLALRVQAEAEAAREKQRAL-QALEELRLQAEEAERRLRQAEAERARQVqvalETAQRSAEA 1572
Cdd:pfam03528 246 KLRKELHEVCHL-------LEQERQQHNQLKHTWQKANdQFLESQRLLMRDMQRMESVLTSEQLRQV----EEIKKKDQE 314
|
250 260 270
....*....|....*....|....*....|....*
gi 1920237962 1573 ELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAT 1607
Cdd:pfam03528 315 EHKRARTHKEKETLKSDREHTVSIHAVFSPAGVET 349
|
|
| Borrelia_P83 |
pfam05262 |
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins. |
1677-1821 |
2.66e-03 |
|
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
Pssm-ID: 114011 [Multi-domain] Cd Length: 489 Bit Score: 43.84 E-value: 2.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1677 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1756
Cdd:pfam05262 209 QEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQI 288
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1757 RAEM---EVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAV 1821
Cdd:pfam05262 289 EIKKndeEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNEDAI 356
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
2153-2317 |
2.67e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 42.60 E-value: 2.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2153 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQ 2232
Cdd:COG1579 9 LLDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNN 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2233 mEELGKLKARIEAENRALVLRDkDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM 2312
Cdd:COG1579 89 -KEYEALQKEIESLKRRISDLE-DEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAERE 166
|
....*
gi 1920237962 2313 QAVQE 2317
Cdd:COG1579 167 ELAAK 171
|
|
| SPEC |
smart00150 |
Spectrin repeats; |
518-612 |
2.72e-03 |
|
Spectrin repeats;
Pssm-ID: 197544 [Multi-domain] Cd Length: 101 Bit Score: 40.01 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 518 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQL---SPATRGAYRDCLGRLD 594
Cdd:smart00150 4 LRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLieeGHPDAEEIEERLEELN 83
|
90
....*....|....*...
gi 1920237962 595 LQYAKLLSSSKARLRSLE 612
Cdd:smart00150 84 ERWEELKELAEERRQKLE 101
|
|
| Rabaptin |
pfam03528 |
Rabaptin; |
2272-2515 |
2.96e-03 |
|
Rabaptin;
Pssm-ID: 367545 [Multi-domain] Cd Length: 486 Bit Score: 43.56 E-value: 2.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2272 VAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKML--KEKMQAVQEATRLKAEAELLQQQKELAQEQAR--------- 2340
Cdd:pfam03528 6 LQQRVAELEKENAEFYRLKQQLEAEFNQKRAKFKELYlaKEEDLKRQNAVLQEAQVELDALQNQLALARAEmenikavat 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2341 ----RLQEDKEQMAQQLAQETQGFQKTL-ETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRfrkqaedigerlyR 2415
Cdd:pfam03528 86 vsenTKQEAIDEVKSQWQEEVASLQAIMkETVREYEVQFHRRLEQERAQWNQYRESAEREIADLRR-------------R 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2416 TELATQEkvmlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQA--------- 2486
Cdd:pfam03528 153 LSEGQEE-----ENLEDEMKKAQEDAEKLRSVVMPMEKEIAALKAKLTEAEDKIKELEASKMKELNHYLEAekscrtdle 227
|
250 260 270 280
....*....|....*....|....*....|....*....|.
gi 1920237962 2487 -------LQQSFLSEKDSLLQRE-----RCIEQEKAKLEQL 2515
Cdd:pfam03528 228 myvavlnTQKSVLQEDAEKLRKElhevcHLLEQERQQHNQL 268
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1419-1908 |
3.00e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 43.49 E-value: 3.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1419 QHLRQSSEAEIQAKARqVEAAERSRLRIEEEIRVVRLQLEATERQRGGA--EGELQALRARAEEAEAQKRQAQEEAERLR 1496
Cdd:COG3064 2 QEALEEKAAEAAAQER-LEQAEAEKRAAAEAEQKAKEEAEEERLAELEAkrQAEEEAREAKAEAEQRAAELAAEAAKKLA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1497 RQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQALEELRLQAEEaerrlRQAEAERARQVQVALETAQRSAEAELQS 1576
Cdd:COG3064 81 EAEKAAAEAEKKAAAEKA-KAAKEAEAAAAAEKAAAAAEKEKAEEAK-----RKAEEEAKRKAEEERKAAEAEAAAKAEA 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1577 EHASFAEKTAQLERTLKEEhVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQA 1656
Cdd:COG3064 155 EAARAAAAAAAAAAAAAAR-AAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAA 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1657 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQ 1736
Cdd:COG3064 234 LAAVEATEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAA 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1737 REAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLA 1816
Cdd:COG3064 314 EEAVLAAAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILA 393
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1817 EEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRK 1896
Cdd:COG3064 394 AAGGGGLLGLRLDLGAALLEAASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDG 473
|
490
....*....|..
gi 1920237962 1897 ASESELERQKGL 1908
Cdd:COG3064 474 GAVLADLLLLGG 485
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1344-1605 |
3.06e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 43.66 E-value: 3.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1344 ERLAEQQRAEERERLAEVEaALEKQRQLAEAHAQAkaqaereaqgLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRq 1423
Cdd:pfam10174 453 ERLKEQREREDRERLEELE-SLKKENKDLKEKVSA----------LQPELTEKESSLIDLKEHASSLASSGLKKDSKLK- 520
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1424 SSEAEIQA-------------KARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQK----- 1485
Cdd:pfam10174 521 SLEIAVEQkkeecsklenqlkKAHNAEEAVRTNPEINDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENEKndkdk 600
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1486 ----------RQAQEEAERLR--RQVQDETQRKRQAEAELALRVQ-AEAEAAREKQRA--LQALEELRLQAEEAERRLRQ 1550
Cdd:pfam10174 601 kiaelesltlRQMKEQNKKVAniKHGQQEMKKKGAQLLEEARRREdNLADNSQQLQLEelMGALEKTRQELDATKARLSS 680
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1551 AE--------------AERARQVQVALETAQRSAEAELQSEHASFA--EKTAQLERTLKEEhvaVVQLREE 1605
Cdd:pfam10174 681 TQqslaekdghltnlrAERRKQLEEILEMKQEALLAAISEKDANIAllELSSSKKKKTQEE---VMALKRE 748
|
|
| CusB_dom_1 |
pfam00529 |
Cation efflux system protein CusB domain 1; The cation efflux system protein CusB from E. coli ... |
1422-1575 |
3.41e-03 |
|
Cation efflux system protein CusB domain 1; The cation efflux system protein CusB from E. coli can be divided into four different domains, the first three domains of the protein are mostly beta-strands and the fourth forms an all alpha-helical domain. This entry represents the first beta-domain (domain 1) of CusB and it is formed by the N and C-terminal ends of the polypeptide (residues 89-102 and 324-385). CusB is part of the copper-transporting efflux system CusCFBA. This domain can also be found in other membrane-fusion proteins, such as HlyD, MdtN, MdtE and AaeA. HlyD is a component of the prototypical alpha-haemolysin (HlyA) bacterial type I secretion system, along with the other components HlyB and TolC. HlyD is anchored in the cytoplasmic membrane by a single transmembrane domain and has a large periplasmic domain within the carboxy-terminal 100 amino acids, HlyB and HlyD form a stable complex that binds the recombinant protein bearing a C-terminal HlyA signal sequence and ATP in the cytoplasm. HlyD, HlyB and TolC combine to form the three-component ABC transporter complex that forms a trans-membrane channel or pore through which HlyA can be transferred directly to the extracellular medium. Cutinase has been shown to be transported effectively through this pore.
Pssm-ID: 425733 [Multi-domain] Cd Length: 322 Bit Score: 43.18 E-value: 3.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1422 RQSSEAEIQAKARQVEA-AERSRLRIE-EEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlRRQV 1499
Cdd:pfam00529 54 TDYQAALDSAEAQLAKAqAQVARLQAElDRLQALESELAISRQDYDGATAQLRAAQAAVKAAQAQLAQAQIDLAR-RRVL 132
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1500 QDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQ 1575
Cdd:pfam00529 133 APIGGISRESLVTAGALVAQAQANLLATVAQLDQIYVQITQSAAENQAEVRSELSGAQLQIAEAEAELKLAKLDLE 208
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
2278-2396 |
3.50e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 43.61 E-value: 3.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2278 RLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEA---TRLKAEAELLQQQKELaQEQARRLQEDKEQMAQQLA 2354
Cdd:PRK12704 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEihkLRNEFEKELRERRNEL-QKLEKRLLQKEENLDRKLE 103
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 2355 ------QETQGFQKTLETERQRQLEMSAEAERLR-------LRVAEMSRAQARAE 2396
Cdd:PRK12704 104 llekreEELEKKEKELEQKQQELEKKEEELEELIeeqlqelERISGLTAEEAKEI 158
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3756-3787 |
3.61e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 37.85 E-value: 3.61e-03
10 20 30
....*....|....*....|....*....|..
gi 1920237962 3756 RLLSAERAVTGYRDPYTEQTISLFQAMKKDLI 3787
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLI 33
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1062-1278 |
3.64e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.75 E-value: 3.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1062 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALkklraqaEAQQPV 1141
Cdd:COG4913 614 ALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERL-------DASSDD 686
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1142 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDvrqrELEQLGRQLRYYResadpLGAWLRD 1221
Cdd:COG4913 687 LAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLE----AAEDLARLELRAL-----LEERFAA 757
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1222 AKQRqeqiqavplANSQAVREQLRQE-KALLEDIERHGEKVEEC-QRFAKQYINAIKDY 1278
Cdd:COG4913 758 ALGD---------AVERELRENLEERiDALRARLNRAEEELERAmRAFNREWPAETADL 807
|
|
| PLEC |
smart00250 |
Plectin repeat; |
2725-2758 |
3.65e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 37.85 E-value: 3.65e-03
10 20 30
....*....|....*....|....*....|....
gi 1920237962 2725 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPE 2758
Cdd:smart00250 3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
|
|
| PTZ00491 |
PTZ00491 |
major vault protein; Provisional |
2258-2391 |
3.65e-03 |
|
major vault protein; Provisional
Pssm-ID: 240439 [Multi-domain] Cd Length: 850 Bit Score: 43.47 E-value: 3.65e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2258 AQRLLQE-----EAEKMK-QVAEEAAR---LSVAAQEAARlrQLAEEDLAQQRALAEKMLKEkMQAVQEATRLKAEAELL 2328
Cdd:PTZ00491 672 AELLEQEargrlERQKMHdKAKAEEQRtklLELQAESAAV--ESSGQSRAEALAEAEARLIE-AEAEVEQAELRAKALRI 748
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 2329 QQQKELAQEQARRLQEdkeqmaqqLAQETQgfQKTLETERQRQLeMSAEAERL--------RLRVAEMSRA 2391
Cdd:PTZ00491 749 EAEAELEKLRKRQELE--------LEYEQA--QNELEIAKAKEL-ADIEATKFerivealgRETLIAIARA 808
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
1181-1454 |
3.66e-03 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 43.75 E-value: 3.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1181 LERWQAVLAQTDVRQRELEQLGRQLryyrESADplgawlRDAKQRQEQIQAVPLANSQAVREQLrqEKALLEDIERhgeK 1260
Cdd:PRK11281 65 LEQTLALLDKIDRQKEETEQLKQQL----AQAP------AKLRQAQAELEALKDDNDEETRETL--STLSLRQLES---R 129
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1261 VEECQRFAKQYINAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSelSTLTSQyiRFISETLR-R 1339
Cdd:PRK11281 130 LAQTLDQLQNAQNDLAEYNSQLVSLQTQPE---------RAQAALYANSQRLQQIRNLLK--GGKVGG--KALRPSQRvL 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1340 MEEEERLAEQQRAEERERLA---EVEAALEKQRQLAEAHAQakaQAEREAQGLQRRM-QEEVARREEVAVEAQEQKRS-- 1413
Cdd:PRK11281 197 LQAEQALLNAQNDLQRKSLEgntQLQDLLQKQRDYLTARIQ---RLEHQLQLLQEAInSKRLTLSEKTVQEAQSQDEAar 273
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1414 ------IQEELQHLRQSSEAEIQAKAR-------------QVEAAERSRLRIEEEIRVVR 1454
Cdd:PRK11281 274 iqanplVAQELEINLQLSQRLLKATEKlntltqqnlrvknWLDRLTQSERNIKEQISVLK 333
|
|
| RecN |
COG0497 |
DNA repair ATPase RecN [Replication, recombination and repair]; |
1470-1604 |
3.71e-03 |
|
DNA repair ATPase RecN [Replication, recombination and repair];
Pssm-ID: 440263 [Multi-domain] Cd Length: 555 Bit Score: 43.53 E-value: 3.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1470 ELQALRARAEEAEAQKRQAQEEAERLRRQVQdetqrkrqaE-AELALRVQAEAEAAREKQRaLQALEELRLQAEEAERRL 1548
Cdd:COG0497 166 AWRALKKELEELRADEAERARELDLLRFQLE---------ElEAAALQPGEEEELEEERRR-LSNAEKLREALQEALEAL 235
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1549 RQAEAerarQVQVALETAQRSAEaELQSEHASFAEKTAQLERtlkeehvAVVQLRE 1604
Cdd:COG0497 236 SGGEG----GALDLLGQALRALE-RLAEYDPSLAELAERLES-------ALIELEE 279
|
|
| Tektin |
pfam03148 |
Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular ... |
1691-1964 |
3.78e-03 |
|
Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers of tektin heterodimers with axial periodicity matching tubulin. Tektin polypeptides consist of several alpha-helical regions that are predicted to form coiled coils. Indeed, tektins share considerable structural similarities with intermediate filament proteins. Possible functional roles for tektins are: stabilization of tubulin protofilaments; attachment of A and B-tubules in ciliary/flagellar microtubule doublets and C-tubules in centrioles; binding of axonemal components.
Pssm-ID: 460827 [Multi-domain] Cd Length: 383 Bit Score: 42.92 E-value: 3.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1691 EKQRQLAEgtaQQRLAAE---QELIRLRAETEQGEQQRQllEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASK 1767
Cdd:pfam03148 6 QELYREAE---AQRNDAErlrQESRRLRNETDAKTKWDQ--YDSNRRLGERIQDITFWKSELEKELEELDEEIELLLEEK 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1768 ARAEEESRSTSEK---SKQRLEAEAGRF----------RELAEEA-------ARLRALAEEAkrQRQLAEEDAVRQRAEA 1827
Cdd:pfam03148 81 RRLEKALEALEEPlhiAQECLTLREKRQgidlvhdeveKELLKEVeliegiqELLQRTLEQA--WEQLRLLRAARHKLEK 158
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1828 ErvLAEKLAAI---SEATRLK-TEAEIALKEKEAENERLRRLAED-EAFQRRLLE--EQAAQHKADIEARLAQLRKASES 1900
Cdd:pfam03148 159 D--LSDKKEALeidEKCLSLNnTSPNISYKPGPTRIPPNSSTPEEwEKFTQDNIEraEKERAASAQLRELIDSILEQTAN 236
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1901 ELERQKGLVEDTLRQRrqVEEeilalkgsFEKAaagKAELELELGRIRgtaedtlrsKEQAEQE 1964
Cdd:pfam03148 237 DLRAQADAVNFALRKR--IEE--------TEDA---KNKLEWQLKKTL---------QEIAELE 278
|
|
| PspA |
COG1842 |
Phage shock protein A [Transcription, Signal transduction mechanisms]; |
1445-1575 |
3.91e-03 |
|
Phage shock protein A [Transcription, Signal transduction mechanisms];
Pssm-ID: 441447 [Multi-domain] Cd Length: 217 Bit Score: 42.12 E-value: 3.91e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1445 RIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAA 1524
Cdd:COG1842 20 KAEDPEKMLDQAIRDMEEDLVEARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKGREDLAREALERKAELEAQ 99
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1525 REKQRAL-----QALEELRLQAEEAERRLRQAEAERArqvqvALETAQRSAEAELQ 1575
Cdd:COG1842 100 AEALEAQlaqleEQVEKLKEALRQLESKLEELKAKKD-----TLKARAKAAKAQEK 150
|
|
| PRK07735 |
PRK07735 |
NADH-quinone oxidoreductase subunit C; |
1807-2056 |
3.92e-03 |
|
NADH-quinone oxidoreductase subunit C;
Pssm-ID: 236081 [Multi-domain] Cd Length: 430 Bit Score: 43.05 E-value: 3.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1807 EEAKRQRQLAEEDAVRQRAEAERVLAEKLAA-ISEATRLKTEAEIALKEKEAENerlrrlaedeafqrrlLEEQAAQHKA 1885
Cdd:PRK07735 2 DPEKDLEDLKKEAARRAKEEARKRLVAKHGAeISKLEEENREKEKALPKNDDMT----------------IEEAKRRAAA 65
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1886 DIEARLAQLRKASESELERQkglvedtlrqrrqVEEEILALKGSFEKAAAGKAElELELGRIRGTAEDTLRSKEQAEQEA 1965
Cdd:PRK07735 66 AAKAKAAALAKQKREGTEEV-------------TEEEKAKAKAKAAAAAKAKAA-ALAKQKREGTEEVTEEEKAAAKAKA 131
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1966 ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQA 2045
Cdd:PRK07735 132 AAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAA 211
|
250
....*....|.
gi 1920237962 2046 EEKAHAFAVQQ 2056
Cdd:PRK07735 212 KAKAAALAKQK 222
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
2270-2611 |
3.97e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 43.79 E-value: 3.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2270 KQVAEEAARLSVAA--QEAARLRQLAEEDLAQQRALAEKmlkekmqavqEATRLKAEAELLQQQKELAQEQARR--LQED 2345
Cdd:PRK04863 260 KHLITESTNYVAADymRHANERRVHLEEALELRRELYTS----------RRQLAAEQYRLVEMARELAELNEAEsdLEQD 329
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2346 KEQMA--QQLAQETQGFQKTLE------TERQRQLEMSAEAERLRLRVAEMSRAQA-RAEEDARRFRKQAEDIGERL--- 2413
Cdd:PRK04863 330 YQAASdhLNLVQTALRQQEKIEryqadlEELEERLEEQNEVVEEADEQQEENEARAeAAEEEVDELKSQLADYQQALdvq 409
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2414 YRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEaqLLQLksEEMQTVRQEQLLQETQALQ--QSF 2491
Cdd:PRK04863 410 QTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEE--LLSL--EQKLSVAQAAHSQFEQAYQlvRKI 485
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2492 LSEKDsllqRERCIEQEKAKLEQL--FQDEVAKAQALReeqqrqqqqmqqekqqlaASMEEARRRQHEAEEGVRRQQEEL 2569
Cdd:PRK04863 486 AGEVS----RSEAWDVARELLRRLreQRHLAEQLQQLR------------------MRLSELEQRLRQQQRAERLLAEFC 543
|
330 340 350 360
....*....|....*....|....*....|....*....|..
gi 1920237962 2570 QRLAQQQQQQEkLLAEENQRLRERLQHLEEERRAALARSEEI 2611
Cdd:PRK04863 544 KRLGKNLDDED-ELEQLQEELEARLESLSESVSEARERRMAL 584
|
|
| KpsE |
COG3524 |
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis]; |
1371-1558 |
4.13e-03 |
|
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442746 [Multi-domain] Cd Length: 370 Bit Score: 42.92 E-value: 4.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1371 LAEAHAQAKAQAEREAQGLQRRMQEEVARreevaveAQEQKRSIQEELQHLRQsseaeiqakarqveaaERSRLRIEEEI 1450
Cdd:COG3524 160 LAESEELVNQLSERAREDAVRFAEEEVER-------AEERLRDAREALLAFRN----------------RNGILDPEATA 216
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1451 RVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAL-RVQAEaeaarekqr 1529
Cdd:COG3524 217 EALLQLIATLEGQLAELEAELAALRSYLSPNSPQVRQLRRRIAALEKQIAAERARLTGASGGDSLaSLLAE--------- 287
|
170 180 190
....*....|....*....|....*....|.
gi 1920237962 1530 alqaLEELRLQAEEAERRLRQAEA--ERARQ 1558
Cdd:COG3524 288 ----YERLELEREFAEKAYTSALAalEQARI 314
|
|
| Caldesmon |
pfam02029 |
Caldesmon; |
2313-2610 |
4.14e-03 |
|
Caldesmon;
Pssm-ID: 460421 [Multi-domain] Cd Length: 495 Bit Score: 43.32 E-value: 4.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2313 QAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRvaemsrAQ 2392
Cdd:pfam02029 6 EAARERRRRAREERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQK------RL 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2393 ARAEEDARRFRKQAEDIGErlyrtELATQEKVMLVQTLETQRQQSDRDAERLREAIAELE------------------HE 2454
Cdd:pfam02029 80 QEALERQKEFDPTIADEKE-----SVAERKENNEEEENSSWEKEEKRDSRLGRYKEEETEirekeyqenkwstevrqaEE 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2455 KDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQ 2534
Cdd:pfam02029 155 EGEEEEDKSEEAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQ 234
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2535 QQMQQEKQQLAA--SMEEARRRQHEAEEgvrrqqEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRA-------AL 2605
Cdd:pfam02029 235 EREEEAEVFLEAeqKLEELRRRRQEKES------EEFEKLRQKQQEAELELEELKKKREERRKLLEEEEQRrkqeeaeRK 308
|
....*
gi 1920237962 2606 ARSEE 2610
Cdd:pfam02029 309 LREEE 313
|
|
| ClpA |
COG0542 |
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein ... |
1437-1554 |
4.28e-03 |
|
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 440308 [Multi-domain] Cd Length: 836 Bit Score: 43.53 E-value: 4.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1437 EAAerSRLRIE-----EEIRVVRLQLEATERqrggaegELQALRaraeeaEAQKRQAQEEAERLRRQVQDETQRKRQAEA 1511
Cdd:COG0542 397 EAA--ARVRMEidskpEELDELERRLEQLEI-------EKEALK------KEQDEASFERLAELRDELAELEEELEALKA 461
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1920237962 1512 elalRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE 1554
Cdd:COG0542 462 ----RWEAEKELIEEIQELKEELEQRYGKIPELEKELAELEEE 500
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1483-1955 |
4.48e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 43.10 E-value: 4.48e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1483 AQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQR--ALQALEELRLQAEEAERRLRQAEAERARQvQ 1560
Cdd:COG3064 1 AQEALEEKAAEAAAQERLEQAEAEKRAAAEAEQKAKEEAEEERLAELeaKRQAEEEAREAKAEAEQRAAELAAEAAKK-L 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1561 VALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVA----VVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1636
Cdd:COG3064 80 AEAEKAAAEAEKKAAAEKAKAAKEAEAAAAAEKAAAAAekekAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARA 159
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1637 ALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRA 1716
Cdd:COG3064 160 AAAAAAAAAAAAARAAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEA 239
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1717 ETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELA 1796
Cdd:COG3064 240 TEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLA 319
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1797 EEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLL 1876
Cdd:COG3064 320 AAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGG 399
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237962 1877 EEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTL 1955
Cdd:COG3064 400 LLGLRLDLGAALLEAASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDGGAVLA 478
|
|
| PRK07735 |
PRK07735 |
NADH-quinone oxidoreductase subunit C; |
1717-1939 |
4.58e-03 |
|
NADH-quinone oxidoreductase subunit C;
Pssm-ID: 236081 [Multi-domain] Cd Length: 430 Bit Score: 43.05 E-value: 4.58e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1717 ETEQGEQQRQLLEEELARLQ-----------REAAAATQKRRELEAELAKVRAEMEV-----LLASKARAEEESRSTSEK 1780
Cdd:PRK07735 11 KKEAARRAKEEARKRLVAKHgaeiskleeenREKEKALPKNDDMTIEEAKRRAAAAAkakaaALAKQKREGTEEVTEEEK 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1781 SKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED--AVRQRAEAERVLAEKLAAISEATRLKTEAEIAL---KEK 1855
Cdd:PRK07735 91 AKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAkaAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKakaKAA 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1856 EAENERLRRLAEDEAFQRRLLEEQAAQH-KADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEE-ILALKGSFEKA 1933
Cdd:PRK07735 171 AAAKAKAAALAKQKAAEAGEGTEEVTEEeKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAKAKaIAAAKAKAAAA 250
|
....*.
gi 1920237962 1934 AAGKAE 1939
Cdd:PRK07735 251 ARAKTK 256
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
2173-2606 |
4.63e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 43.27 E-value: 4.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2173 ALRQKAQVEQ-ELTALRLQLEEtdhQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEaeNRALV 2251
Cdd:pfam10174 335 AKEQRAAILQtEVDALRLRLEE---KESFLNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE--NLQEQ 409
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2252 LRDKDsaqRLLQEEAEKMKQVAEEAARLSVA---AQEAARLRQLAEEDLAQQRALAEKMLKEKMQAV-QEATRLKAEAEL 2327
Cdd:pfam10174 410 LRDKD---KQLAGLKERVKSLQTDSSNTDTAlttLEEALSEKERIIERLKEQREREDRERLEELESLkKENKDLKEKVSA 486
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2328 LQQQKelaQEQARRLQEDKEQmAQQLAQ---ETQGFQKTLETERQRQLEmsaEAERLrlrVAEMSRAQaRAEEDARrfrk 2404
Cdd:pfam10174 487 LQPEL---TEKESSLIDLKEH-ASSLASsglKKDSKLKSLEIAVEQKKE---ECSKL---ENQLKKAH-NAEEAVR---- 551
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2405 QAEDIGERLYRTELATQEKVMlvqtlETQRQQSDrdAERLREAIAELEHEK-DKLKQEAQLLQLKSEEMQTvrQEQLLQE 2483
Cdd:pfam10174 552 TNPEINDRIRLLEQEVARYKE-----ESGKAQAE--VERLLGILREVENEKnDKDKKIAELESLTLRQMKE--QNKKVAN 622
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2484 TQALQQsflsekdsllqrercieQEKAKLEQLFQDevakaqALREEQQRQQQQMQQEKQQLAASMEEARRrqhEAEEGVR 2563
Cdd:pfam10174 623 IKHGQQ-----------------EMKKKGAQLLEE------ARRREDNLADNSQQLQLEELMGALEKTRQ---ELDATKA 676
|
410 420 430 440
....*....|....*....|....*....|....*....|...
gi 1920237962 2564 RQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALA 2606
Cdd:pfam10174 677 RLSSTQQSLAEKDGHLTNLRAERRKQLEEILEMKQEALLAAIS 719
|
|
| DUF4670 |
pfam15709 |
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ... |
2102-2288 |
4.63e-03 |
|
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.
Pssm-ID: 464815 [Multi-domain] Cd Length: 522 Bit Score: 43.02 E-value: 4.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2102 QSRRQVEEAERLKQsAEEQAQAQAQAQAAAEKLRKeaeqeaarraqaeqAALRQKQAADAEMEKHKQFAEQALRQKAQVE 2181
Cdd:pfam15709 360 QRRLQQEQLERAEK-MREELELEQQRRFEEIRLRK--------------QRLEEERQRQEEEERKQRLQLQAAQERARQQ 424
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2182 QEltALRLQLEETDHQKsildeelQRLKAEVTEAARQRGQVEEElfslrvQMEELGKLKARIEAENRALVLRDKDSAQRL 2261
Cdd:pfam15709 425 QE--EFRRKLQELQRKK-------QQEEAERAEAEKQRQKELEM------QLAEEQKRLMEMAEEERLEYQRQKQEAEEK 489
|
170 180 190
....*....|....*....|....*....|..
gi 1920237962 2262 LQEEAEKMKQVAEEAARLSVA-----AQEAAR 2288
Cdd:pfam15709 490 ARLEAEERRQKEEEAARLALEeamkqAQEQAR 521
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1372-1560 |
4.67e-03 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 43.35 E-value: 4.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1372 AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEir 1451
Cdd:PRK12678 67 AATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARK-- 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1452 vvRLQLEATERQRGGAEGELQALRARAEEAEA--QKRQAQEEAERLRRQVQDETQRKRQaEAELALRVQAEAEAAREKQR 1529
Cdd:PRK12678 145 --AGEGGEQPATEARADAAERTEEEERDERRRrgDREDRQAEAERGERGRREERGRDGD-DRDRRDRREQGDRREERGRR 221
|
170 180 190
....*....|....*....|....*....|.
gi 1920237962 1530 ALQALEELRLQAEEAERRLRQAEAERARQVQ 1560
Cdd:PRK12678 222 DGGDRRGRRRRRDRRDARGDDNREDRGDRDG 252
|
|
| GBP_C |
pfam02841 |
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral ... |
2267-2361 |
4.70e-03 |
|
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.
Pssm-ID: 460721 [Multi-domain] Cd Length: 297 Bit Score: 42.27 E-value: 4.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2267 EKMKQVAEEAARLSVAAQEAARLRQLAEEDL----AQQRALAE--KMLKEKMQAVQEatRLKAEAELLQQQKElaQEQAR 2340
Cdd:pfam02841 201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEqmmeAQERSYQEhvKQLIEKMEAERE--QLLAEQERMLEHKL--QEQEE 276
|
90 100
....*....|....*....|.
gi 1920237962 2341 RLQEDKEQMAQQLAQETQGFQ 2361
Cdd:pfam02841 277 LLKEGFKTEAESLQKEIQDLK 297
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1675-1775 |
4.71e-03 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 43.40 E-value: 4.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1675 AEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELA 1754
Cdd:PRK11448 143 LLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKEIT 222
|
90 100
....*....|....*....|.
gi 1920237962 1755 KvRAEMEVLLaskarAEEESR 1775
Cdd:PRK11448 223 D-QAAKRLEL-----SEEETR 237
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
2314-2568 |
4.74e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 42.89 E-value: 4.74e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2314 AVQEATRLKAEAELLQQQKEL--AQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2391
Cdd:COG3883 5 ALAAPTPAFADPQIQAKQKELseLQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2392 QARAEEDARRFRKQ------------AEDIGE---RLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKD 2456
Cdd:COG3883 85 REELGERARALYRSggsvsyldvllgSESFSDfldRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKA 164
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2457 KLKQEAQLLQLKSEEmqtvrQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQ 2536
Cdd:COG3883 165 ELEAAKAELEAQQAE-----QEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 239
|
250 260 270
....*....|....*....|....*....|..
gi 1920237962 2537 MQQEKQQLAASMEEARRRQHEAEEGVRRQQEE 2568
Cdd:COG3883 240 AAAAASAAGAGAAGAAGAAAGSAGAAGAAAGA 271
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
2234-2362 |
4.80e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 42.65 E-value: 4.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2234 EELGKLKARIEAENRALVLRDKDSAQrlLQEeaekmkQVAEEAARLSVAAQEAARLRQLAEEdLAQQRALAEKMLKEKMQ 2313
Cdd:PRK09039 53 SALDRLNSQIAELADLLSLERQGNQD--LQD------SVANLRASLSAAEAERSRLQALLAE-LAGAGAAAEGRAGELAQ 123
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 2314 AVQEATRLKAEA----ELLQQQKELAQEQARRLQ--------EDKEQMAQ----------QLAQETQGFQK 2362
Cdd:PRK09039 124 ELDSEKQVSARAlaqvELLNQQIAALRRQLAALEaaldasekRDRESQAKiadlgrrlnvALAQRVQELNR 194
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1880-2056 |
4.82e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.83 E-value: 4.82e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1880 AAQHKADIEARLAQLRKasesELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKE 1959
Cdd:COG4942 18 QADAAAEAEAELEQLQQ----EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIA 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1960 QAEQEAARQRQLAAEEERRRREAEER--------------VQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRErAE 2025
Cdd:COG4942 94 ELRAELEAQKEELAELLRALYRLGRQpplalllspedfldAVRRLQYLKYLAPARREQAEELRADLAELAALRAELE-AE 172
|
170 180 190
....*....|....*....|....*....|.
gi 1920237962 2026 QESARQLQLAQEAAQKRLQAEEKAHAFAVQQ 2056
Cdd:COG4942 173 RAELEALLAELEEERAALEALKAERQKLLAR 203
|
|
| YqiK |
COG2268 |
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown]; |
1794-1922 |
4.87e-03 |
|
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
Pssm-ID: 441869 [Multi-domain] Cd Length: 439 Bit Score: 42.94 E-value: 4.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1794 ELAEEAARLRALAE-EAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQ 1872
Cdd:COG2268 196 EIIRDARIAEAEAErETEIAIAQANREAEEAELEQEREIETARIAEAEAELAKKKAEERREAETARAEAEAAYEIAEANA 275
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 1873 RRLLEEQAAQHKADIEARLAQLRKA-SESELERQKGLVEDTLRQRRQVEEE 1922
Cdd:COG2268 276 EREVQRQLEIAEREREIELQEKEAErEEAELEADVRKPAEAEKQAAEAEAE 326
|
|
| TolC |
COG1538 |
Outer membrane protein TolC [Cell wall/membrane/envelope biogenesis]; |
1314-1575 |
4.92e-03 |
|
Outer membrane protein TolC [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 441147 [Multi-domain] Cd Length: 367 Bit Score: 42.72 E-value: 4.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1314 DLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAAlekQRQLAEAHAQaKAQAEREAQGLQRRM 1393
Cdd:COG1538 77 EVAQAYFDLLAAQEQ-LALAEENLALAEELLELARARYEAGLASRLDVLQA---EAQLAQARAQ-LAQAEAQLAQARNAL 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1394 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEA------EIQAKARQVEAAERsRLRIEEEIRVVRLQLEATERQRGGA 1467
Cdd:COG1538 152 ALLLGLPPPAPLDLPDPLPPLPPLPPSLPGLPSEalerrpDLRAAEAQLEAAEA-EIGVARAAFLPSLSLSASYGYSSSD 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1468 EGELQ-------------------ALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1528
Cdd:COG1538 231 DLFSGgsdtwsvglslslplfdggRNRARVRAAKAQLEQAEAQYEQTVLQALQEVEDALAALRAAREQLEALEEALEAAE 310
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1529 RALQALEEL-------RLQAEEAERRLRQAEAERarqvqVALETAQRSAEAELQ 1575
Cdd:COG1538 311 EALELARARyraglasLLDVLDAQRELLQAQLNL-----IQARYDYLLALVQLY 359
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1116-1526 |
4.98e-03 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 42.75 E-value: 4.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1116 ATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGaqevgerLQQRHGERDVEVERWRERvtllLERWQAVLAQTDVRQ 1195
Cdd:pfam19220 38 AILRELPQAKSRLLELEALLAQERAAYGKLRRELAG-------LTRRLSAAEGELEELVAR----LAKLEAALREAEAAK 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1196 RELEQLGRQLRYYRESADplgawlRDAKQRQEQIQAVPLANsQAVREQLRQEKALLEDIERHGEKVEECQRFAKQyinai 1275
Cdd:pfam19220 107 EELRIELRDKTAQAEALE------RQLAAETEQNRALEEEN-KALREEAQAAEKALQRAEGELATARERLALLEQ----- 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1276 kdyelqlvtykaqlepvaspaKKPKVQSGSESIIQEYVDLRTRYSELSTL---TSQYIRFISETLRRMEEEERLAEQQRA 1352
Cdd:pfam19220 175 ---------------------ENRRLQALSEEQAAELAELTRRLAELETQldaTRARLRALEGQLAAEQAERERAEAQLE 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1353 EERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLrqssEAEIQAK 1432
Cdd:pfam19220 234 EAVEAHRAERASLRMKLEALTARAAATEQLLAEARNQLRDRDEAIRAAERRLKEASIERDTLERRLAGL----EADLERR 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1433 ARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRqaeAE 1512
Cdd:pfam19220 310 TQQFQEMQRARAELEERAEMLTKALAAKDAALERAEERIASLSDRIAELTKRFEVERAALEQANRRLKEELQRER---AE 386
|
410
....*....|....
gi 1920237962 1513 LALrVQAEAEAARE 1526
Cdd:pfam19220 387 RAL-AQGALEIARE 399
|
|
| PRK12705 |
PRK12705 |
hypothetical protein; Provisional |
1465-1605 |
5.23e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 237178 [Multi-domain] Cd Length: 508 Bit Score: 42.77 E-value: 5.23e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1465 GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQrkrqaEAELALRVQAEAEAAREKQralqaleelRLQAEEa 1544
Cdd:PRK12705 19 GVLVVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAK-----ELLLRERNQQRQEARRERE---------ELQREE- 83
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 1545 ERRLRQAEAERARQVQVALETAQRS-AEAELQSEHASFAEKTAQLERTLKEehvaVVQLREE 1605
Cdd:PRK12705 84 ERLVQKEEQLDARAEKLDNLENQLEeREKALSARELELEELEKQLDNELYR----VAGLTPE 141
|
|
| 2A1904 |
TIGR00927 |
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ... |
1341-1528 |
5.40e-03 |
|
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273344 [Multi-domain] Cd Length: 1096 Bit Score: 43.06 E-value: 5.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1341 EEEERLAEQQRAEERERLAEVEaalEKQRQLAEAHAQAKAQAER--------EAQGLQRRMQ-----EEVARREEVAVEA 1407
Cdd:TIGR00927 649 GERPTEAEGENGEESGGEAEQE---GETETKGENESEGEIPAERkgeqegegEIEAKEADHKgeteaEEVEHEGETEAEG 725
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRSIQ--EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGELQA---LRARAEEAE 1482
Cdd:TIGR00927 726 TEDEGEIEtgEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGE--------TEAEGKEDEDEGEIQAgedGEMKGDEGA 797
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1920237962 1483 AQKRQAQEEAERlRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1528
Cdd:TIGR00927 798 EGKVEHEGETEA-GEKDEHEGQSETQADDTEVKDETGEQELNAENQ 842
|
|
| Borrelia_P83 |
pfam05262 |
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins. |
1339-1526 |
5.41e-03 |
|
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
Pssm-ID: 114011 [Multi-domain] Cd Length: 489 Bit Score: 42.68 E-value: 5.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1339 RMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQglqrrmQEEVARREEVAVEAQEQKRSIQEEL 1418
Cdd:pfam05262 206 RESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQ------QEAKNLPKPADTSSPKEDKQVAENQ 279
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1419 QHLRQSSEAEIQAKARQVeaaersrlrieeeirvvrlqLEATERQRGGAEGElqalrARAEEAEAQKRqaQEEAERLRRQ 1498
Cdd:pfam05262 280 KREIEKAQIEIKKNDEEA--------------------LKAKDHKAFDLKQE-----SKASEKEAEDK--ELEAQKKREP 332
|
170 180
....*....|....*....|....*....
gi 1920237962 1499 VQDETQR-KRQAEAElalrVQAEAEAARE 1526
Cdd:pfam05262 333 VAEDLQKtKPQVEAQ----PTSLNEDAID 357
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1467-1576 |
5.55e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.84 E-value: 5.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1467 AEGELQALRARAE-EAEAQKR----QAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALE-ELRLQ 1540
Cdd:PRK12704 36 AEEEAKRILEEAKkEAEAIKKeallEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREeELEKK 115
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237962 1541 AEEAERRLRQAEAERARqvqvaLETAQRSAEAELQS 1576
Cdd:PRK12704 116 EKELEQKQQELEKKEEE-----LEELIEEQLQELER 146
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
1788-1897 |
5.62e-03 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 43.02 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1788 EAGRFRELAEEAARLRALAEEAKRQRQLAEE--------DAVRQRAEAERVLAEKLAAISEATRLKTEAEIA-LKEKEAE 1858
Cdd:PRK11448 130 KPGPFVPPEDPENLLHALQQEVLTLKQQLELqarekaqsQALAEAQQQELVALEGLAAELEEKQQELEAQLEqLQEKAAE 209
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1920237962 1859 NErlrrlAEDEAFQRRLLEEQAAQHKAD-IEARL---AQLRKA 1897
Cdd:PRK11448 210 TS-----QERKQKRKEITDQAAKRLELSeEETRIlidQQLRKA 247
|
|
| SPFH_like_u3 |
cd03406 |
Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily; This ... |
1333-1441 |
6.03e-03 |
|
Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily; This model summarizes an uncharacterized family of proteins similar to stomatin, prohibitin, flotillin, HflK/C (SPFH) and podocin. The conserved domain common to the SPFH superfamily has also been referred to as the Band 7 domain. Many superfamily members are associated with lipid rafts. Individual proteins of the SPFH superfamily may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Microdomains formed from flotillin proteins may in addition be dynamic units with their own regulatory functions. Flotillins have been implicated in signal transduction, vesicle trafficking, cytoskeleton rearrangement and are known to interact with a variety of proteins. Stomatin interacts with and regulates members of the degenerin/epithelia Na+ channel family in mechanosensory cells of Caenorhabditis elegans and vertebrate neurons and participates in trafficking of Glut1 glucose transporters. Prohibitin may act as a chaperone for the stabilization of mitochondrial proteins. Prokaryotic HflK/C plays a role in the decision between lysogenic and lytic cycle growth during lambda phage infection. Flotillins have been implicated in the progression of prion disease, in the pathogenesis of neurodegenerative diseases such as Parkinson's and Alzheimer's disease and, in cancer invasion and metastasis. Mutations in the podocin gene give rise to autosomal recessive steroid resistant nephritic syndrome.
Pssm-ID: 259804 [Multi-domain] Cd Length: 293 Bit Score: 41.90 E-value: 6.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1333 ISETLRR----MEEEE-RLAeqqRAEERERLAEVEAALEKQRQLAEahaqakaqAEREAQGLQRRMQEEVARReevavEA 1407
Cdd:cd03406 160 IPEAIRRnyeaMEAEKtKLL---IAEQHQKVVEKEAETERKRAVIE--------AEKDAEVAKIQMQQKIMEK-----EA 223
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237962 1408 QEQKRSIQEELQHLRQSS--EAEIQAKARQVEAAER 1441
Cdd:cd03406 224 EKKISEIEDEMHLAREKAraDAEYYRALREAEANKL 259
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1725-2037 |
6.10e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 6.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1725 RQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESrstsEKSKQRLEAEAGRFRELAEEAARLRA 1804
Cdd:COG4372 19 RPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEEL----EQARSELEQLEEELEELNEQLQAAQA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1805 LAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHK 1884
Cdd:COG4372 95 ELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1885 ADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQE 1964
Cdd:COG4372 175 ALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEE 254
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237962 1965 AARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQE 2037
Cdd:COG4372 255 VILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKK 327
|
|
| TolA |
COG3064 |
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis]; |
1338-1745 |
6.26e-03 |
|
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442298 [Multi-domain] Cd Length: 485 Bit Score: 42.72 E-value: 6.26e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1338 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAK-AQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE 1416
Cdd:COG3064 24 EKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEqRAAELAAEAAKKLAEAEKAAAEAEKKAAAEKAKAAKE 103
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1417 ELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLR 1496
Cdd:COG3064 104 AEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAAAALV 183
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1497 RQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQS 1576
Cdd:COG3064 184 AAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADLAAVGVLGAA 263
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1577 EHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQA 1656
Cdd:COG3064 264 LAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEAALSL 343
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1657 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQ 1736
Cdd:COG3064 344 LAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGLRLDLGAALLEAASAVELRVL 423
|
....*....
gi 1920237962 1737 REAAAATQK 1745
Cdd:COG3064 424 LALAGAAGA 432
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
2171-2612 |
6.28e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 43.13 E-value: 6.28e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2171 EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQ-RGQVEEELFSLRVQMEELGKLKARIEAENRA 2249
Cdd:TIGR02169 233 EALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKiKDLGEEEQLRVKEKIGELEAEIASLERSIAE 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2250 LVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEE------DLAQQRALAEKMLKEKMQAVQEATRLKA 2323
Cdd:TIGR02169 313 KERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEyaelkeELEDLRAELEEVDKEFAETRDELKDYRE 392
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2324 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQetqgFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFR 2403
Cdd:TIGR02169 393 KLEKLKREINELKRELDRLQEELQRLSEELAD----LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2404 KQAEDIGERLYRTELATQEKVMLVQTLETQRQQSdRDAERLREAIAELEHEKDK--LKQEAQLLQLKSE----------- 2470
Cdd:TIGR02169 469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARAS-EERVRGGRAVEEVLKASIQgvHGTVAQLGSVGERyataievaagn 547
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2471 ---------EMQTVRQEQLLQETQALQQSFL-------SEKD-SLLQRERCI----------EQEKAKLEQLFQDEV--- 2520
Cdd:TIGR02169 548 rlnnvvvedDAVAKEAIELLKRRKAGRATFLplnkmrdERRDlSILSEDGVIgfavdlvefdPKYEPAFKYVFGDTLvve 627
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2521 ----------------------------------------------AKAQALREEQQRQQQQMQQEKQQLA---ASMEEA 2551
Cdd:TIGR02169 628 dieaarrlmgkyrmvtlegelfeksgamtggsraprggilfsrsepAELQRLRERLEGLKRELSSLQSELRrieNRLDEL 707
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237962 2552 RRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEenqrLRERLQHLEEERRAALARSEEIA 2612
Cdd:TIGR02169 708 SQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE----LEEDLSSLEQEIENVKSELKELE 764
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1931-2413 |
6.33e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 42.83 E-value: 6.33e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1931 EKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERvQKSLAAEEEAARQRKAALEEVERL 2010
Cdd:COG4717 49 ERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEE-LEELEAELEELREELEKLEKLLQL 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2011 KAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRseaeaARRAAEEAE 2090
Cdd:COG4717 128 LPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEE-----LQDLAEELE 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2091 AARERAEREAAQSRRQVEEAERLKQsAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFA 2170
Cdd:COG4717 203 ELQQRLAELEEELEEAQEELEELEE-ELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFL 281
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2171 EQAL---------RQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAArqrgqveEELFSLRVQMEELGKLKA 2241
Cdd:COG4717 282 VLGLlallflllaREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSP-------EELLELLDRIEELQELLR 354
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2242 RIE-AENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLkekmqAVQEATR 2320
Cdd:COG4717 355 EAEeLEEELQLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELL-----EALDEEE 429
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2321 LKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQ-ETQGFQKTLETERQRQLEMSAEAER--LRLRVAE--MSRAQARA 2395
Cdd:COG4717 430 LEEELEELEEELEELEEELEELREELAELEAELEQlEEDGELAELLQELEELKAELRELAEewAALKLALelLEEAREEY 509
|
490
....*....|....*....
gi 1920237962 2396 EEDAR-RFRKQAEDIGERL 2413
Cdd:COG4717 510 REERLpPVLERASEYFSRL 528
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
2170-2423 |
6.50e-03 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 42.97 E-value: 6.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2170 AEQALRQKAQVEQELTALRLQLEETDHQK----------SILDEELQRLKAEVTEAARQRGQVEEELF-SLRVQMEELGK 2238
Cdd:PLN02939 158 LEKILTEKEALQGKINILEMRLSETDARIklaaqekihvEILEEQLEKLRNELLIRGATEGLCVHSLSkELDVLKEENML 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2239 LKARIEAEnRALVLRDKDSAQRLLQEEAEKM---KQVAEEAARLSVAAQEAARLRQLAEED------------------- 2296
Cdd:PLN02939 238 LKDDIQFL-KAELIEVAETEERVFKLEKERSlldASLRELESKFIVAQEDVSKLSPLQYDCwwekvenlqdlldratnqv 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2297 ------LAQQRALAEK--MLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTL---- 2364
Cdd:PLN02939 317 ekaalvLDQNQDLRDKvdKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQESIKEFQDTLsklk 396
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237962 2365 ETERQRQLEMSAEA------ERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEK 2423
Cdd:PLN02939 397 EESKKRSLEHPADDmpsefwSRILLLIDGWLLEKKISNNDAKLLREMVWKRDGRIREAYLSCKGK 461
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
1010-1607 |
6.55e-03 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 42.87 E-value: 6.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1010 EPARECAQRITEQQKAQAEVDGLGKGVARLSaeaekvLALPepspaAPTLRSELEltlGKLEQVRSLSAIYlEKLKTISL 1089
Cdd:PRK10246 254 ELQQEASRRQQALQQALAAEEKAQPQLAALS------LAQP-----ARQLRPHWE---RIQEQSAALAHTR-QQIEEVNT 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1090 VIRSTQEAEEVLRAHeeQLKEAQAVPATLPELeatkaalkklrAQAEAQQPVFDALRDELRG-----AQEVGERLQQRhg 1164
Cdd:PRK10246 319 RLQSTMALRARIRHH--AAKQSAELQAQQQSL-----------NTWLAEHDRFRQWNNELAGwraqfSQQTSDREQLR-- 383
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1165 erdveveRWRERVTLLLERWQAVLAQT-----DVRQRELEQLGRQlRYYRESADPLGAWLRDAKQRQEQIQAvplANSQA 1239
Cdd:PRK10246 384 -------QWQQQLTHAEQKLNALPAITltltaDEVAAALAQHAEQ-RPLRQRLVALHGQIVPQQKRLAQLQV---AIQNV 452
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1240 VREQLRQEKALLEDIERHGEKVEE-------CQRFAKqyinaIKDYELQ--------------------LVTYKAqLEPV 1292
Cdd:PRK10246 453 TQEQTQRNAALNEMRQRYKEKTQQladvktiCEQEAR-----IKDLEAQraqlqagqpcplcgstshpaVEAYQA-LEPG 526
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1293 ASPAKKPKVQSGSESIIQEYVDLRtrySELSTLTSQYIRFISETLRRMEEEERLAEQQRaeererlaEVEAALEKQRQLA 1372
Cdd:PRK10246 527 VNQSRLDALEKEVKKLGEEGAALR---GQLDALTKQLQRDESEAQSLRQEEQALTQQWQ--------AVCASLNITLQPQ 595
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1373 EAHAQ-AKAQAEREAQGLQRRMQEEVarrEEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQV--EAAERSRL--RIE 1447
Cdd:PRK10246 596 DDIQPwLDAQEEHERQLRLLSQRHEL---QGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLpqEDEEASWLatRQQ 672
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1448 EEIRVVRLQLEATERQRGGAE--------GELQALRARAEEAEAQK-RQAQEEAERLRRQVQ-------DETQRKRQAEA 1511
Cdd:PRK10246 673 EAQSWQQRQNELTALQNRIQQltplletlPQSDDLPHSEETVALDNwRQVHEQCLSLHSQLQtlqqqdvLEAQRLQKAQA 752
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1512 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLrqaeaERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERT 1591
Cdd:PRK10246 753 QFDTALQASVFDDQQAFLAALLDEETLTQLEQLKQNL-----ENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVTVEQI 827
|
650
....*....|....*.
gi 1920237962 1592 LKEEHVAVVQLREEAT 1607
Cdd:PRK10246 828 QQELAQLAQQLRENTT 843
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1116-1559 |
6.62e-03 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 42.59 E-value: 6.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1116 ATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQ 1195
Cdd:COG5278 83 EARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLA 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1196 RELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAI 1275
Cdd:COG5278 163 LALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALAL 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1276 KDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEER 1355
Cdd:COG5278 243 ALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAA 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1356 ERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQ 1435
Cdd:COG5278 323 AALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAA 402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1436 VEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAL 1515
Cdd:COG5278 403 AAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAA 482
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1920237962 1516 RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV 1559
Cdd:COG5278 483 ALAEAEAAAALAAAAALSLALALAALLLAAAEAALAAALAAALA 526
|
|
| Plectin |
pfam00681 |
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ... |
3681-3719 |
6.66e-03 |
|
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
Pssm-ID: 459901 Cd Length: 39 Bit Score: 36.92 E-value: 6.66e-03
10 20 30
....*....|....*....|....*....|....*....
gi 1920237962 3681 YLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVARQL 3719
Cdd:pfam00681 1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
|
|
| PRK12678 |
PRK12678 |
transcription termination factor Rho; Provisional |
1427-1580 |
6.90e-03 |
|
transcription termination factor Rho; Provisional
Pssm-ID: 237171 [Multi-domain] Cd Length: 672 Bit Score: 42.58 E-value: 6.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1427 AEIQAKARQVEAAERSRLRIEEEIrvvrlqlEATERQRGGAEGElqALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1506
Cdd:PRK12678 29 PELRALAKQLGIKGTSGMRKGELI-------AAIKEARGGGAAA--AAATPAAPAAAARRAARAAAAARQAEQPAAEAAA 99
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237962 1507 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHAS 1580
Cdd:PRK12678 100 AKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERR 173
|
|
| CHASE3 |
COG5278 |
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms]; |
1702-2113 |
6.96e-03 |
|
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
Pssm-ID: 444089 [Multi-domain] Cd Length: 530 Bit Score: 42.59 E-value: 6.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1702 QQRLAAEQELIRLRAETEQGEQQRQLL---EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTS 1778
Cdd:COG5278 83 EARAEIDELLAELRSLTADNPEQQARLdelEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLA 162
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1779 EKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAE 1858
Cdd:COG5278 163 LALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALAL 242
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1859 NERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKA 1938
Cdd:COG5278 243 ALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAA 322
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1939 ELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEAR 2018
Cdd:COG5278 323 AALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAA 402
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2019 RLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAER 2098
Cdd:COG5278 403 AAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAA 482
|
410
....*....|....*
gi 1920237962 2099 EAAQSRRQVEEAERL 2113
Cdd:COG5278 483 ALAEAEAAAALAAAA 497
|
|
| hsdR |
PRK11448 |
type I restriction enzyme EcoKI subunit R; Provisional |
2201-2296 |
7.24e-03 |
|
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain] Cd Length: 1123 Bit Score: 42.63 E-value: 7.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2201 LDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALvlrdkdsAQRLLQEEAEKMKQVAEEAARLS 2280
Cdd:PRK11448 147 LQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQEL-------EAQLEQLQEKAAETSQERKQKRK 219
|
90
....*....|....*.
gi 1920237962 2281 VAAQEAARLRQLAEED 2296
Cdd:PRK11448 220 EITDQAAKRLELSEEE 235
|
|
| TPH |
pfam13868 |
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ... |
1328-1522 |
7.41e-03 |
|
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.
Pssm-ID: 464007 [Multi-domain] Cd Length: 341 Bit Score: 41.83 E-value: 7.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1328 QYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQlaeahaqakAQAEREAQgLQRRMQEEVARREEVAVEA 1407
Cdd:pfam13868 156 RILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQD---------EKAERDEL-RAKLYQEEQERKERQKERE 225
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1408 QEQKRsiQEELQHLRQSSEAEIQAKARQvEAAERSRLRIEEEiRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1487
Cdd:pfam13868 226 EAEKK--ARQRQELQQAREEQIELKERR-LAEEAEREEEEFE-RMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEE 301
|
170 180 190
....*....|....*....|....*....|....*
gi 1920237962 1488 AQEEAERLRRQVQDETQRKRQAEAELALRVQAEAE 1522
Cdd:pfam13868 302 REEQRAAEREEELEEGERLREEEAERRERIEEERQ 336
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1174-1445 |
7.55e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 42.06 E-value: 7.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1174 RERVTLLLERWQAVLAQTDVR---QRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKAL 1250
Cdd:COG4942 2 RKLLLLALLLALAAAAQADAAaeaEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1251 LEDIERHGEKVEECQRFAKQYINAIKdyELQLVTYKAQLEPvaspakKPKVQSGSESIIQEYvdlrtRYSELSTLTSQYI 1330
Cdd:COG4942 82 EAELAELEKEIAELRAELEAQKEELA--ELLRALYRLGRQP------PLALLLSPEDFLDAV-----RRLQYLKYLAPAR 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1331 RFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRqlaeahaQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQ 1410
Cdd:COG4942 149 REQAEELRADLAELAALRAELEAERAELEALLAELEEER-------AALEALKAERQKLLARLEKELAELAAELAELQQE 221
|
250 260 270
....*....|....*....|....*....|....*
gi 1920237962 1411 KRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLR 1445
Cdd:COG4942 222 AEELEALIARLEAEAAAAAERTPAAGFAALKGKLP 256
|
|
| CCCAP |
pfam15964 |
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ... |
2231-2526 |
7.77e-03 |
|
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.
Pssm-ID: 435040 [Multi-domain] Cd Length: 703 Bit Score: 42.59 E-value: 7.77e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2231 VQMEE---LGKLKARIEAEN-RALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLA-EEDLAQQRALAE 2305
Cdd:pfam15964 341 VQMTEeanFEKTKALIQCEQlKSELERQKERLEKELASQQEKRAQEKEALRKEMKKEREELGATMLAlSQNVAQLEAQVE 420
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2306 KMLKEKMQAVQEATrlKAEAELLQQQKELAQEQAR-RLQEDKEQMAQQLAQETQgfqKTLETERQRQLEMS-AEAERLRL 2383
Cdd:pfam15964 421 KVTREKNSLVSQLE--EAQKQLASQEMDVTKVCGEmRYQLNQTKMKKDEAEKEH---REYRTKTGRQLEIKdQEIEKLGL 495
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2384 RVAEMSRAQARAEEDARRFRKQAEDIGERLYRTE----LATQEKVMLVQTL----ETQRQQSDRDAERLREAIAELE--H 2453
Cdd:pfam15964 496 ELSESKQRLEQAQQDAARAREECLKLTELLGESEhqlhLTRLEKESIQQSFsneaKAQALQAQQREQELTQKMQQMEaqH 575
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2454 EKD----------------KLKQEAQLLQLKSEEM-QTVRQE--QLLQETQALQQSFLSEK-------DSLLQRERCIEQ 2507
Cdd:pfam15964 576 DKTvneqyslltsqntfiaKLKEECCTLAKKLEEItQKSRSEveQLSQEKEYLQDRLEKLQkrneeleEQCVQHGRMHER 655
|
330
....*....|....*....
gi 1920237962 2508 EKAKLEQLFQDEVAKAQAL 2526
Cdd:pfam15964 656 MKQRLRQLDKHCQATAQQL 674
|
|
| Nop14 |
pfam04147 |
Nop14-like family; Emg1 and Nop14 are novel proteins whose interaction is required for the ... |
1748-1869 |
7.78e-03 |
|
Nop14-like family; Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production.
Pssm-ID: 461196 Cd Length: 835 Bit Score: 42.61 E-value: 7.78e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1748 ELEAELAKVRAEM--EVLLASKARAEEesrstseKSKQRLEAEAGRFR---ELAEEAARLRALAEEAKRQRQLAEEDAVR 1822
Cdd:pfam04147 155 EEEPERKKSKKEVmeEVIAKSKLHKYE-------RQKAKEEDEELREEldkELKDLRSLLSGSKRPKPEQAKKPEEKPDR 227
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 1823 QRAEAE-----RVLA-EKLAAISEatRLKTEAEIALKEKE----AENERLRRLAEDE 1869
Cdd:pfam04147 228 KKPDDDydklvRELAfDKRAKPSD--RTKTEEELAEEEKErlekLEEERLRRMRGEE 282
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
2105-2486 |
7.79e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 42.19 E-value: 7.79e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2105 RQVEEAERlkqsaeeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAeQALRQKAQVEQEL 2184
Cdd:pfam07888 73 RQRRELES--------------------RVAELKEELRQSREKHEELEEKYKELSASSEELSEEKD-ALLAQRAAHEARI 131
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2185 TALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRAL--VLRDKDSAQRLL 2262
Cdd:pfam07888 132 RELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKEFQELrnSLAQRDTQVLQL 211
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2263 QEEAEKMKQVAEEAARlSVAAQEAAR--LRQLAEEDLAQQRALAekMLKEKMQAVQeATRLKAEAELLQQQKELAQ---- 2336
Cdd:pfam07888 212 QDTITTLTQKLTTAHR-KEAENEALLeeLRSLQERLNASERKVE--GLGEELSSMA-AQRDRTQAELHQARLQAAQltlq 287
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2337 --EQARRLQEDKEQMaqqlAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEmsraqaraeedARRFRKQAEdigerly 2414
Cdd:pfam07888 288 laDASLALREGRARW----AQERETLQQSAEADKDRIEKLSAELQRLEERLQE-----------ERMEREKLE------- 345
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2415 rTELATQEKVMLVQTLETQRQQSDrdaerLREAIAELEHEKDKLKQEAQllqlksEEMQTVRQEQLLQETQA 2486
Cdd:pfam07888 346 -VELGREKDCNRVQLSESRRELQE-----LKASLRVAQKEKEQLQAEKQ------ELLEYIRQLEQRLETVA 405
|
|
| ATAD3_N |
pfam12037 |
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal ... |
1674-1814 |
7.87e-03 |
|
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal domain of ATPase family AAA domain-containing protein 3 (ATAD3) which is involved in dimerization and interacts with the inner surface of the outer mitochondrial membrane. This domain is found associated with the AAA ATPase domain (pfam00004). ATAD3 is essential for mitochondrial network organization, mitochondrial metabolism and cell growth at organizm and cellular level. It may also play an important role in mitochondrial protein synthesis.
Pssm-ID: 463442 [Multi-domain] Cd Length: 264 Bit Score: 41.51 E-value: 7.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1674 KAEEQaVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA-- 1751
Cdd:pfam12037 52 KKQEQ-TRQAELQAKIKEYEAAQEQLKIERQRVEYEERRKTLQEETKQKQQRAQYQDELARKRYQDQLEAQRRRNEELlr 130
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237962 1752 ---ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEA-GRFRELAE-EAARLRALAEEAKRQRQ 1814
Cdd:pfam12037 131 kqeESVAKQEAMRIQAQRRQTEEHEAELRRETERAKAEAEAeARAKEEREnEDLNLEQLREKANEERE 198
|
|
| ERM_helical |
pfam20492 |
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ... |
1682-1775 |
7.90e-03 |
|
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.
Pssm-ID: 466641 [Multi-domain] Cd Length: 120 Bit Score: 39.52 E-value: 7.90e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1682 QRELAEQELEKQRQLAEGTAQQRLAAEQELIRLraeteqgEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEME 1761
Cdd:pfam20492 21 ETKKAQEELEESEETAEELEEERRQAEEEAERL-------EQKRQEAEEEKERLEESAEMEAEEKEQLEAELAEAQEEIA 93
|
90
....*....|....
gi 1920237962 1762 VLLASKARAEEESR 1775
Cdd:pfam20492 94 RLEEEVERKEEEAR 107
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
2183-2526 |
7.98e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 42.31 E-value: 7.98e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2183 ELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELG----KLKARIEA-ENralvlrDKDS 2257
Cdd:TIGR04523 104 DLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNnkynDLKKQKEElEN------ELNL 177
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2258 AQRLLQEEAEKMKQVAEEAAR----LSVAAQEAARLRQLAEE--DLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQ 2331
Cdd:TIGR04523 178 LEKEKLNIQKNIDKIKNKLLKlellLSNLKKKIQKNKSLESQisELKKQNNQLKDNIEKKQQEINE---KTTEISNTQTQ 254
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2332 -KELAQEQarrlQEDKEQMAQQLAQETQGFQKTLETERQRQlEMSAEAERLRlrvaemsraqaraeedarrfRKQAEDIG 2410
Cdd:TIGR04523 255 lNQLKDEQ----NKIKKQLSEKQKELEQNNKKIKELEKQLN-QLKSEISDLN--------------------NQKEQDWN 309
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2411 ERLyRTELATQEKVmlVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQlkseemqtvrqEQLLQETQALQQs 2490
Cdd:TIGR04523 310 KEL-KSELKNQEKK--LEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQ-----------RELEEKQNEIEK- 374
|
330 340 350
....*....|....*....|....*....|....*.
gi 1920237962 2491 FLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQAL 2526
Cdd:TIGR04523 375 LKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQK 410
|
|
| PLEC |
smart00250 |
Plectin repeat; |
3680-3716 |
8.67e-03 |
|
Plectin repeat;
Pssm-ID: 197605 Cd Length: 38 Bit Score: 36.69 E-value: 8.67e-03
10 20 30
....*....|....*....|....*....|....*..
gi 1920237962 3680 RYLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVA 3716
Cdd:smart00250 2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
2181-2611 |
8.73e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 42.50 E-value: 8.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2181 EQELTALRLQLEETDHQKSILDEELQRLKAEVTeAARQRGQV-EEELFSLRVQMEElgklKARIEAEnRALVLRDKDSAQ 2259
Cdd:pfam10174 302 ESELLALQTKLETLTNQNSDCKQHIEVLKESLT-AKEQRAAIlQTEVDALRLRLEE----KESFLNK-KTKQLQDLTEEK 375
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2260 RLLQEEAEKMKQVAEEAARLSVAAQEaaRLRQLAEEDLAQQRALAEkmLKEKMQAVQEATRLKAEAelLQQQKELAQEQA 2339
Cdd:pfam10174 376 STLAGEIRDLKDMLDVKERKINVLQK--KIENLQEQLRDKDKQLAG--LKERVKSLQTDSSNTDTA--LTTLEEALSEKE 449
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2340 R---RLQEDKEQMAQQLAQEtqgfqktLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRT 2416
Cdd:pfam10174 450 RiieRLKEQREREDRERLEE-------LESLKKENKDLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSL 522
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2417 ELATQEKVMLVQTLETQRQ--QSDRDAERLREAIAelehekDKLKQEAQLLQLKSEEMQTVRqeqllqetqalqqsflSE 2494
Cdd:pfam10174 523 EIAVEQKKEECSKLENQLKkaHNAEEAVRTNPEIN------DRIRLLEQEVARYKEESGKAQ----------------AE 580
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2495 KDSLLQRERCIEQEK-------AKLEQLFQDEVaKAQALREEQQRQQQQMQQEKQqlAASMEEARRRQHEAEEGVRRQQ- 2566
Cdd:pfam10174 581 VERLLGILREVENEKndkdkkiAELESLTLRQM-KEQNKKVANIKHGQQEMKKKG--AQLLEEARRREDNLADNSQQLQl 657
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|..
gi 1920237962 2567 ----EELQRLAQQQQQQEKLLAEENQRLRERLQHLEE---ERRAALarsEEI 2611
Cdd:pfam10174 658 eelmGALEKTRQELDATKARLSSTQQSLAEKDGHLTNlraERRKQL---EEI 706
|
|
| COG3899 |
COG3899 |
Predicted ATPase [General function prediction only]; |
1933-2451 |
8.87e-03 |
|
Predicted ATPase [General function prediction only];
Pssm-ID: 443106 [Multi-domain] Cd Length: 1244 Bit Score: 42.54 E-value: 8.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1933 AAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKA 2012
Cdd:COG3899 736 PPDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYANLGLLLLGDYEEAYEFGEL 815
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2013 KVEEARRLRERAEQESAR----QLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEE 2088
Cdd:COG3899 816 ALALAERLGDRRLEARALfnlgFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAAR 895
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2089 AEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQ 2168
Cdd:COG3899 896 LLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAA 975
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2169 FAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENR 2248
Cdd:COG3899 976 AAAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAA 1055
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2249 ALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAekmlkekmqavqeatRLKAEAELL 2328
Cdd:COG3899 1056 AAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAAL---------------AALALAAAL 1120
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 2329 QQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAED 2408
Cdd:COG3899 1121 AALALAAAARAAAALLLLAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALL 1200
|
490 500 510 520
....*....|....*....|....*....|....*....|...
gi 1920237962 2409 IGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAEL 2451
Cdd:COG3899 1201 ALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLL 1243
|
|
| MAP7 |
pfam05672 |
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ... |
1335-1448 |
9.02e-03 |
|
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.
Pssm-ID: 461709 [Multi-domain] Cd Length: 153 Bit Score: 40.02 E-value: 9.02e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1335 ETLRRMEEEERLAEQQRA-EERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL-QRRMQEEVARREEVAVEAQEQKR 1412
Cdd:pfam05672 11 EAARILAEKRRQAREQRErEEQERLEKEEEERLRKEELRRRAEEERARREEEARRLeEERRREEEERQRKAEEEAEEREQ 90
|
90 100 110
....*....|....*....|....*....|....*.
gi 1920237962 1413 SIQEELQHLRQSSEAeiqAKARQVEAAERSRLRIEE 1448
Cdd:pfam05672 91 REQEEQERLQKQKEE---AEAKAREEAERQRQEREK 123
|
|
| PRK07353 |
PRK07353 |
F0F1 ATP synthase subunit B'; Validated |
1338-1433 |
9.04e-03 |
|
F0F1 ATP synthase subunit B'; Validated
Pssm-ID: 235999 [Multi-domain] Cd Length: 140 Bit Score: 39.60 E-value: 9.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1338 RRMEEEERLAEQQRAEERERLAEVEA-ALEKQRQLAEAHAQAK---AQAEREAQGLqrrmqeevaRREEVAV---EAQEQ 1410
Cdd:PRK07353 32 KVVEEREDYIRTNRAEAKERLAEAEKlEAQYEQQLASARKQAQaviAEAEAEADKL---------AAEALAEaqaEAQAS 102
|
90 100
....*....|....*....|...
gi 1920237962 1411 KRSIQEELQHLRQSSEAEIQAKA 1433
Cdd:PRK07353 103 KEKARREIEQQKQAALAQLEQQV 125
|
|
| PRK11637 |
PRK11637 |
AmiB activator; Provisional |
1214-1414 |
9.09e-03 |
|
AmiB activator; Provisional
Pssm-ID: 236942 [Multi-domain] Cd Length: 428 Bit Score: 41.99 E-value: 9.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1214 PLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVEECQRfakqyinAIKDYELQLVTYKAQL-EPV 1292
Cdd:PRK11637 37 AFSAHASDNRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASR-------KLRETQNTLNQLNKQIdELN 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1293 ASPAKKPKVQSGSESIIQEYVDLRTRYSE-------LSTLTSQ-------YIRFISE----------------TLRRMEE 1342
Cdd:PRK11637 110 ASIAKLEQQQAAQERLLAAQLDAAFRQGEhtglqliLSGEESQrgerilaYFGYLNQarqetiaelkqtreelAAQKAEL 189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1343 EERLAEQQ----------------RAEERERLAEVEAALEK-QRQLAE--------------AHAQAKAQAEREAQGLQR 1391
Cdd:PRK11637 190 EEKQSQQKtllyeqqaqqqkleqaRNERKKTLTGLESSLQKdQQQLSElranesrlrdsiarAEREAKARAEREAREAAR 269
|
250 260
....*....|....*....|....
gi 1920237962 1392 -RMQEEVARREEVAVEAQEQKRSI 1414
Cdd:PRK11637 270 vRDKQKQAKRKGSTYKPTESERSL 293
|
|
| PspA_IM30 |
pfam04012 |
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent ... |
1332-1548 |
9.27e-03 |
|
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homolog in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma(54) containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator.
Pssm-ID: 461130 [Multi-domain] Cd Length: 215 Bit Score: 40.82 E-value: 9.27e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1332 FISETLRRMEEEERLAEQQRAEERERLAEVEaalekqRQLAEAHAQAKaQAEREAqglqRRMQEEVARREEVAVEAQEQK 1411
Cdd:pfam04012 12 NIHEGLDKAEDPEKMLEQAIRDMQSELVKAR------QALAQTIARQK-QLERRL----EQQTEQAKKLEEKAQAALTKG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1412 RsiqeelQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEE 1491
Cdd:pfam04012 81 N------EELAREALAEKKSLEKQAEALETQLAQQRSAVEQLRKQLAALETKIQQLKAKKNLLKARLKAAKAQEAVQTSL 154
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237962 1492 AERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQaLEELRLQAEEAERRL 1548
Cdd:pfam04012 155 GSLSTSSATDSFERIEEKIEEREARADAAAELASAVDLDAK-LEQAGIQMEVSEDVL 210
|
|
| PLN02316 |
PLN02316 |
synthase/transferase |
1479-1542 |
9.34e-03 |
|
synthase/transferase
Pssm-ID: 215180 [Multi-domain] Cd Length: 1036 Bit Score: 42.16 E-value: 9.34e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237962 1479 EEAEAQKRQAQEEAERLRrqvQDETQRKRQAE--AELALRVQAEAEAAREKQRALQALEELRLQAE 1542
Cdd:PLN02316 253 EKRRELEKLAKEEAERER---QAEEQRRREEEkaAMEADRAQAKAEVEKRREKLQNLLKKASRSAD 315
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1702-1964 |
9.44e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 41.43 E-value: 9.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1702 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRstseKS 1781
Cdd:COG1340 1 SKTDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVK----EL 76
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1782 KQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlaEEDAVRQR--------------AEAERVLAEKLAaiseatRLKTE 1847
Cdd:COG1340 77 KEERDELNEKLNELREELDELRKELAELNKAGG--SIDKLRKEierlewrqqtevlsPEEEKELVEKIK------ELEKE 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1848 AEIALKEKEAENERLRRLAEDEAFQrrlleEQAAQHKADIEARLAQLRKASES------ELERQKGLVEDTLRQRRQVEE 1921
Cdd:COG1340 149 LEKAKKALEKNEKLKELRAELKELR-----KEAEEIHKKIKELAEEAQELHEEmielykEADELRKEADELHKEIVEAQE 223
|
250 260 270 280
....*....|....*....|....*....|....*....|...
gi 1920237962 1922 EILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQE 1964
Cdd:COG1340 224 KADELHEEIIELQKELRELRKELKKLRKKQRALKREKEKEELE 266
|
|
| Apolipoprotein |
pfam01442 |
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ... |
1356-1527 |
9.50e-03 |
|
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.
Pssm-ID: 460211 [Multi-domain] Cd Length: 175 Bit Score: 40.32 E-value: 9.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1356 ERLAEVEAALEK-QRQLAEAHAQAKAQAEREAQGLQRRMQEEV-ARREEVAVEAQEQKRSIQEELQHLRQsseaeiQAKA 1433
Cdd:pfam01442 4 DSLDELSTYAEElQEQLGPVAQELVDRLEKETEALRERLQKDLeEVRAKLEPYLEELQAKLGQNVEELRQ------RLEP 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237962 1434 RQVEAAERSRLRIEEEIRVVRlqlEATERQRGGAEGELQALRARAEE-AEAQKRQAQEEAERLRRQVQDETQ----RKRQ 1508
Cdd:pfam01442 78 YTEELRKRLNADAEELQEKLA---PYGEELRERLEQNVDALRARLAPyAEELRQKLAERLEELKESLAPYAEevqaQLSQ 154
|
170
....*....|....*....
gi 1920237962 1509 AEAELALRVQAEAEAAREK 1527
Cdd:pfam01442 155 RLQELREKLEPQAEDLREK 173
|
|
|