NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1920237946|ref|XP_036685836|]
View 

plectin isoform X4 [Balaenoptera musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CH_PLEC-like_rpt1 cd21188
first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ...
183-287 2.52e-76

first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


:

Pssm-ID: 409037  Cd Length: 105  Bit Score: 248.86  E-value: 2.52e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  183 DRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKLVNI 262
Cdd:cd21188      1 DAVQKKTFTKWVNKHLIKARRRVVDLFEDLRDGHNLISLLEVLSGESLPRERGRMRFHRLQNVQTALDFLKYRKIKLVNI 80
                           90       100
                   ....*....|....*....|....*
gi 1920237946  263 RNDDIADGNPKLTLGLIWTIILHFQ 287
Cdd:cd21188     81 RAEDIVDGNPKLTLGLIWTIILHFQ 105
CH_PLEC_rpt2 cd21238
second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ...
300-405 1.17e-69

second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


:

Pssm-ID: 409087  Cd Length: 106  Bit Score: 229.91  E-value: 1.17e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21238      1 MTAKEKLLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPMLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 80
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  380 PEDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21238     81 PEDVDVPQPDEKSIITYVSSLYDAMP 106
S10_plectin pfam03501
Plectin/S10 domain; This presumed domain is found at the N-terminus of some isoforms of the ...
7-99 1.03e-44

Plectin/S10 domain; This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding.


:

Pssm-ID: 427337  Cd Length: 92  Bit Score: 158.07  E-value: 1.03e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946    7 MPLDQLRTIYEVLFREGVMVAKKDRRPrSLHPHVpGVTNLQVTRAMASLRARGLVRETFAWRHFYWYLTNEGIAHLRQYL 86
Cdd:pfam03501    1 IPKENRKAIYEYLFKEGVLVAKKDFNL-PKHPEL-NVPNLQVIKAMQSLKSRGYVKEQFAWRHYYWYLTNEGIEYLREYL 78
                           90
                   ....*....|...
gi 1920237946   87 HLPPEIVPASLQR 99
Cdd:pfam03501   79 HLPAEIVPATLKR 91
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1444-2019 3.65e-38

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 158.18  E-value: 3.65e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1444 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEErERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1523
Cdd:COG1196    226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELE-AELEELRLELEELELELEEAQAEEYELLAELARLEQDI 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1524 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLrIEEEIRVVRLQLEATERQRGGAEGELQA 1603
Cdd:COG1196    305 ARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEE-AEAELAEAEEALLEAEAELAEAEEELEE 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1604 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1683
Cdd:COG1196    384 LAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLE 463
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1684 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaEAERARAEAERELERWQLK 1763
Cdd:COG1196    464 LLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLL--------AGLRGLAGAVAVLIGVEAA 535
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1764 ANEALRLRLQAeevAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR 1843
Cdd:COG1196    536 YEAALEAALAA---ALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREAD 612
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1844 LRAETEQGEqqrqLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1923
Cdd:COG1196    613 ARYYVLGDT----LLGRTLVAARLEAALRR--AVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE---- 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1924 ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQR 2003
Cdd:COG1196    683 ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
                          570
                   ....*....|....*.
gi 1920237946 2004 RLLEEQAAQHKADIEA 2019
Cdd:COG1196    763 EELERELERLEREIEA 778
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1807-2375 9.02e-35

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 147.01  E-value: 9.02e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1886
Cdd:COG1196    210 EKAERYRELKEELKELEAELL---LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1887 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLA 1966
Cdd:COG1196    287 QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEE-ELEEAEEELEEAEAELAEAEE 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1967 AISEATRLKTEAEIALKEKEAENERLRRlaedEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQR 2046
Cdd:COG1196    366 ALLEAEAELAEAEEELEELAEELLEALR----AAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEE 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2047 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEA 2126
Cdd:COG1196    442 EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRG 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2127 ARQRKAALEEVERLKAKVEEAR---RLRERAEQESARQLQLAQEAAQKRL-----QAEEKAHAFAVQQKEQELQQTLQQE 2198
Cdd:COG1196    522 LAGAVAVLIGVEAAYEAALEAAlaaALQNIVVEDDEVAAAAIEYLKAAKAgratfLPLDKIRARAALAAALARGAIGAAV 601
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2199 QSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2278
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAEL 681
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2279 EQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2358
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
                          570
                   ....*....|....*..
gi 1920237946 2359 LRVQMEELGKLKARIEA 2375
Cdd:COG1196    762 LEELERELERLEREIEA 778
Spectrin_like pfam18373
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links ...
1020-1097 2.57e-30

Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links desmosomal cadherins to the intermediate filaments. The N-terminal region of DP contains a plakin domain common to members of the plakin family. Plakin domains contain multiple copies of spectrin repeats (SRs) pfam00435. Spectrin repeats (SRs) consist of three alpha-helices (A, B, and C) that form an antiparallel triple-helical bundle. This entry describes SR6 which has a divergent structure relative to the other SRs. SR6 shows significant deviations in helices A and B where they are significantly shorter than in other repeats. Structural comparison revealed that SR6 is more similar to other three-helix-bundle proteins, including target of Myb1 and the syntaxin Habc domain, than to other SR proteins. Due to these differences with other spectrin repeats, this region is termed spectrin-like repeat.


:

Pssm-ID: 465730  Cd Length: 78  Bit Score: 116.16  E-value: 2.57e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 1020 LAWQSLGRDMQLIRSWSLATFRTLKPEEQRQALRSLELHYQAFLRDSQDAGGFGPEDRLQAEREYGSCSRHYQQLLQS 1097
Cdd:pfam18373    1 VSWQYLLKDIQRINSWTISMLKTMRPEEYRQVLKNLETHYQDFLRDSQESEMFGAEDRRQLEREVNSAQQHYQTLLVS 78
PTZ00121 super family cl31754
MAEBL; Provisional
2014-2740 3.40e-26

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 119.86  E-value: 3.40e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2014 KADIEARLAQLRKASESELERQKGLVEDTlRQRRQVEEEilalKGSFE---KAAAGKAELELELGRIRGTAEDTLRSKE- 2089
Cdd:PTZ00121  1059 KAEAKAHVGQDEGLKPSYKDFDFDAKEDN-RADEATEEA----FGKAEeakKTETGKAEEARKAEEAKKKAEDARKAEEa 1133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2090 -QAE-----QEAARQRQLAAEEERRRREAEERVQKSLAAEE----EAARQ----RKAA----LEEVERLKA--KVEEARR 2149
Cdd:PTZ00121  1134 rKAEdarkaEEARKAEDAKRVEIARKAEDARKAEEARKAEDakkaEAARKaeevRKAEelrkAEDARKAEAarKAEEERK 1213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2150 LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAERE 2229
Cdd:PTZ00121  1214 AEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2230 AAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADA--EMEKHKQFAEQALRQK 2307
Cdd:PTZ00121  1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAadEAEAAEEKAEAAEKKK 1373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2308 AQVEQELTALRLQLEETDHQksildEELQRLKAEVTEAARQRGQVEEElfslrvqmeelgKLKARiEAENRALVLRDKDS 2387
Cdd:PTZ00121  1374 EEAKKKADAAKKKAEEKKKA-----DEAKKKAEEDKKKADELKKAAAA------------KKKAD-EAKKKAEEKKKADE 1435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQlAEEdlAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2465
Cdd:PTZ00121  1436 AKKKAEEakKADEAKKKAEEAKKAEEAKKKAEEAKK-ADE--AKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2466 QEqARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQA--RAEEDARRFRKQAEDigerL 2543
Cdd:PTZ00121  1513 DE-AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK-KAEEKKKAEEakKAEEDKNMALRKAEE----A 1586
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2544 YRTELATQEKVM-LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFL 2622
Cdd:PTZ00121  1587 KKAEEARIEEVMkLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2623 SEKDsllqrerciEQEKAKLEQLFQDEVAKAQAlreeqqrqqqqmqqeKQQLAASMEEARRrqheAEEgVRRQQEELQRL 2702
Cdd:PTZ00121  1667 AKKA---------EEDKKKAEEAKKAEEDEKKA---------------AEALKKEAEEAKK----AEE-LKKKEAEEKKK 1717
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1920237946 2703 AQQQQQQEkllaEENQRLRERLQHLEEE--RRAALARSEE 2740
Cdd:PTZ00121  1718 AEELKKAE----EENKIKAEEAKKEAEEdkKKAEEAKKDE 1753
Smc super family cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1142-1706 2.70e-22

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


The actual alignment was detected with superfamily member COG1196:

Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 106.56  E-value: 2.70e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1142 ARECAQRITEQQKAQAEVDGLGKGVARLSAEAEKvlalpepspaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVI 1221
Cdd:COG1196    252 EAELEELEAELAELEAELEELRLELEELELELEE-------------AQAEEYELLAELARLEQDIARLEERRRELEERL 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1222 RSTQEAEEVLRAHEEQLKEAQAvpATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVE 1301
Cdd:COG1196    319 EELEEELAELEEELEELEEELE--ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAA 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1302 RWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL 1381
Cdd:COG1196    397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLE 476
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1382 EDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1461
Cdd:COG1196    477 AALAE-----------LLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA------ 539
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1462 fisetlrrMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA----HAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1537
Cdd:COG1196    540 --------LEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRAtflpLDKIRARAALAAALARGAIGAAVDLVASDLREA 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1617
Cdd:COG1196    612 DARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEE 691
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1618 AQEEAERLRRQvqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA-LETA 1696
Cdd:COG1196    692 ELELEEALLAE---EEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEeLERE 768
                          570
                   ....*....|
gi 1920237946 1697 QRSAEAELQS 1706
Cdd:COG1196    769 LERLEREIEA 778
SH3_10 super family cl39368
SH3 domain; This entry represents an SH3 domain.
919-985 3.49e-18

SH3 domain; This entry represents an SH3 domain.


The actual alignment was detected with superfamily member pfam17902:

Pssm-ID: 407754  Cd Length: 65  Bit Score: 81.15  E-value: 3.49e-18
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946  919 QLKPRSpaHPMRGRVPLLAVCDYKQVEVTVHKGDECQMVGPAQPFYWKVLGSSCSEAAMPSVCFLVP 985
Cdd:pfam17902    1 PLKQRR--SPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
2931-2969 2.27e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 72.36  E-value: 2.27e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 2931 LLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEMNRVL 2969
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4168-4206 5.20e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 71.59  E-value: 5.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4168 LLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEMNEIL 4206
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3590-3628 5.20e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 71.59  E-value: 5.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3590 LLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEMNRVL 3628
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3849-3887 1.06e-13

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 67.74  E-value: 1.06e-13
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3849 LLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPELHDRL 3887
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3183-3221 7.46e-13

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 65.43  E-value: 7.46e-13
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3183 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPELHEKL 3221
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3925-3963 2.93e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.50  E-value: 2.93e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3925 LLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDTHDQL 3963
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3514-3552 4.41e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.12  E-value: 4.41e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3514 LLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPELHEKL 3552
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4437-4475 4.59e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.12  E-value: 4.59e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4437 LLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRI 4475
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
744-922 8.48e-12

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


:

Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 67.86  E-value: 8.48e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  744 LHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQ 823
Cdd:cd00176      2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERL 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  824 AALQTQWSWMLQLCCCIEAHLKENTAYFQFFSDVREAEEQLRKLQETLRRKYTCDrsiTATRLEDLLQDAQDEKEQLSEY 903
Cdd:cd00176     82 EELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGK---DLESVEELLKKHKELEEELEAH 158
                          170
                   ....*....|....*....
gi 1920237946  904 RGHLSGLAKRAKAIVQLKP 922
Cdd:cd00176    159 EPRLKSLNELAEELLEEGH 177
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4092-4130 1.05e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 61.96  E-value: 1.05e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4092 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKL 4130
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3259-3297 1.19e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 61.96  E-value: 1.19e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3259 LLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKETSAAL 3297
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
2855-2893 3.14e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 60.80  E-value: 3.14e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 2855 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPELHHKL 2893
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4513-4551 4.42e-09

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 54.64  E-value: 4.42e-09
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4513 FLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTAQKL 4551
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4269-4297 1.07e-06

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 47.71  E-value: 1.07e-06
                           10        20
                   ....*....|....*....|....*....
gi 1920237946 4269 IVDPETGKEMSVYEAYRKGLIDHQTYLEL 4297
Cdd:pfam00681   11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PLEC smart00250
Plectin repeat;
4052-4089 5.10e-06

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 45.94  E-value: 5.10e-06
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  4052 QRFLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTA 4089
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
3476-3511 4.24e-05

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 43.24  E-value: 4.24e-05
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1920237946  3476 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTA 3511
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
3220-3256 1.31e-04

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 42.08  E-value: 1.31e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3220 KLLSAEKAVTGYKDPYSGQSVSLFQALKKGLIPREQG 3256
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
4401-4434 1.96e-04

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 41.31  E-value: 1.96e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1920237946  4401 EETGPVAGILDTETLEKVSITEAMHRNLVDNITG 4434
Cdd:smart00250    5 EAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
3143-3180 3.58e-04

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.54  E-value: 3.58e-04
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  3143 RRALRGSGVIAGVWLEEAGQKLSIYEALRKDLLQPEAA 3180
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
3551-3587 3.91e-04

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.54  E-value: 3.91e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3551 KLLSAEKAVTGYRDPYSGSTISLFQAMKKGLVLREHG 3587
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
2892-2928 1.32e-03

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 39.00  E-value: 1.32e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  2892 KLLSAERAVTGYKDPYTGEQISLFQAMKKDLIVREHG 2928
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
SPEC smart00150
Spectrin repeats;
648-742 1.69e-03

Spectrin repeats;


:

Pssm-ID: 197544 [Multi-domain]  Cd Length: 101  Bit Score: 40.78  E-value: 1.69e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   648 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQL---SPATRGAYRDCLGRLD 724
Cdd:smart00150    4 LRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLieeGHPDAEEIEERLEELN 83
                            90
                    ....*....|....*...
gi 1920237946   725 LQYAKLLSSSKARLRSLE 742
Cdd:smart00150   84 ERWEELKELAEERRQKLE 101
PLEC smart00250
Plectin repeat;
3886-3917 3.30e-03

Plectin repeat;


:

Pssm-ID: 197605  Cd Length: 38  Bit Score: 37.85  E-value: 3.30e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1920237946  3886 RLLSAERAVTGYRDPYTEQTISLFQAMKKDLI 3917
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLI 33
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3811-3849 6.52e-03

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


:

Pssm-ID: 459901  Cd Length: 39  Bit Score: 37.31  E-value: 6.52e-03
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3811 YLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVARQL 3849
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
 
Name Accession Description Interval E-value
CH_PLEC-like_rpt1 cd21188
first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ...
183-287 2.52e-76

first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409037  Cd Length: 105  Bit Score: 248.86  E-value: 2.52e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  183 DRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKLVNI 262
Cdd:cd21188      1 DAVQKKTFTKWVNKHLIKARRRVVDLFEDLRDGHNLISLLEVLSGESLPRERGRMRFHRLQNVQTALDFLKYRKIKLVNI 80
                           90       100
                   ....*....|....*....|....*
gi 1920237946  263 RNDDIADGNPKLTLGLIWTIILHFQ 287
Cdd:cd21188     81 RAEDIVDGNPKLTLGLIWTIILHFQ 105
CH_PLEC_rpt2 cd21238
second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ...
300-405 1.17e-69

second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409087  Cd Length: 106  Bit Score: 229.91  E-value: 1.17e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21238      1 MTAKEKLLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPMLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 80
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  380 PEDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21238     81 PEDVDVPQPDEKSIITYVSSLYDAMP 106
SAC6 COG5069
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];
179-512 2.47e-45

Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];


Pssm-ID: 227401 [Multi-domain]  Cd Length: 612  Bit Score: 176.28  E-value: 2.47e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDRVQKKTFTKWVNKHLIKA-QRHISDLYEDLRDGHNLISLLEVLSGDSLPR--EKGRMRFHKLQNVQIALDYLRHR 255
Cdd:COG5069      3 AKKWQKVQKKTFTKWTNEKLISGgQKEFGDLDTDLKDGVKLAQLLEALQKDNAGEynETPETRIHVMENVSGRLEFIKGK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  256 QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQvsgQSEDMTAKEKLLLWSQRMVEGCQ-GLRCDNFTTSWRDGRL 334
Cdd:COG5069     83 GVKLFNIGPQDIVDGNPKLILGLIWSLISRLTIATIN---EEGELTKHINLLLWCDEDTGGYKpEVDTFDFFRSWRDGLA 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  335 FNAIIHRHKPTLIDMNKVYRQTNLE--NLDQAFSVAERDLGVTRLLDPEDV-DVPQPDEKSIITYVS------SLYD--- 402
Cdd:COG5069    160 FSALIHDSRPDTLDPNVLDLQKKNKalNNFQAFENANKVIGIARLIGVEDIvNVSIPDERSIMTYVSwyiirfGLLEkid 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  403 -AMPRVPDVQDGVKANElQLRwQEYRELVLLLLQWIRAHTAGFEERRFPSSFEEIEILWCQFLKFKETE--LPAKEAD-K 478
Cdd:COG5069    240 iALHRVYRLLEADETLI-QLR-LPYEIILLRLLNLIHLKQANWKVVNFSKDVSDGENYTDLLNQLNALCsrAPLETTDlH 317
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1920237946  479 NRSKGIYQSLEgAVQAGQLKVPPGYHPLDVEKEW 512
Cdd:COG5069    318 SLAGQILQNAE-KYDCRKYLPPAGNPKLDLAFVA 350
S10_plectin pfam03501
Plectin/S10 domain; This presumed domain is found at the N-terminus of some isoforms of the ...
7-99 1.03e-44

Plectin/S10 domain; This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding.


Pssm-ID: 427337  Cd Length: 92  Bit Score: 158.07  E-value: 1.03e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946    7 MPLDQLRTIYEVLFREGVMVAKKDRRPrSLHPHVpGVTNLQVTRAMASLRARGLVRETFAWRHFYWYLTNEGIAHLRQYL 86
Cdd:pfam03501    1 IPKENRKAIYEYLFKEGVLVAKKDFNL-PKHPEL-NVPNLQVIKAMQSLKSRGYVKEQFAWRHYYWYLTNEGIEYLREYL 78
                           90
                   ....*....|...
gi 1920237946   87 HLPPEIVPASLQR 99
Cdd:pfam03501   79 HLPAEIVPATLKR 91
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1444-2019 3.65e-38

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 158.18  E-value: 3.65e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1444 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEErERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1523
Cdd:COG1196    226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELE-AELEELRLELEELELELEEAQAEEYELLAELARLEQDI 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1524 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLrIEEEIRVVRLQLEATERQRGGAEGELQA 1603
Cdd:COG1196    305 ARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEE-AEAELAEAEEALLEAEAELAEAEEELEE 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1604 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1683
Cdd:COG1196    384 LAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLE 463
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1684 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaEAERARAEAERELERWQLK 1763
Cdd:COG1196    464 LLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLL--------AGLRGLAGAVAVLIGVEAA 535
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1764 ANEALRLRLQAeevAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR 1843
Cdd:COG1196    536 YEAALEAALAA---ALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREAD 612
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1844 LRAETEQGEqqrqLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1923
Cdd:COG1196    613 ARYYVLGDT----LLGRTLVAARLEAALRR--AVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE---- 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1924 ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQR 2003
Cdd:COG1196    683 ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
                          570
                   ....*....|....*.
gi 1920237946 2004 RLLEEQAAQHKADIEA 2019
Cdd:COG1196    763 EELERELERLEREIEA 778
PTZ00121 PTZ00121
MAEBL; Provisional
1472-2177 2.44e-37

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 156.84  E-value: 2.44e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1472 EEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSiqEELQHL 1551
Cdd:PTZ00121  1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKA--EDAKKA 1178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1552 RQSSEAEIQAKARQVEAAERSRlRIEEEIRVVRLQlEATERQRGGAEGELQALRaRAEEAeaqkRQAQEEAERLRRQVQD 1631
Cdd:PTZ00121  1179 EAARKAEEVRKAEELRKAEDAR-KAEAARKAEEER-KAEEARKAEDAKKAEAVK-KAEEA----KKDAEEAKKAEEERNN 1251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1632 ETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRlQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASF 1711
Cdd:PTZ00121  1252 EEIRKFEEARMAHFARRQAAIKAEEARKA----DELK-KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1712 AEKTAQ-LERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEAlrlRLQAEEVAQQKSLTQaeaek 1790
Cdd:PTZ00121  1327 AKKKADaAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA---KKKAEEKKKADEAKK----- 1398
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1791 qkeeaerearrrgKAEEQAVRQRELAEQELEKQR-QLAEGTAQQRLAAEqELIRLRAETEQGEQQRQLLE-----EELAR 1864
Cdd:PTZ00121  1399 -------------KAEEDKKKADELKKAAAAKKKaDEAKKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEeakkaEEAKK 1464
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1865 LQREAAAATQKRRELE----AELAKVRAEmevllASKARAEEESRSTSEKSKqrleAEAGRFRELAEEAARLRAlAEEAK 1940
Cdd:PTZ00121  1465 KAEEAKKADEAKKKAEeakkADEAKKKAE-----EAKKKADEAKKAAEAKKK----ADEAKKAEEAKKADEAKK-AEEAK 1534
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1941 RQRQLAEEDAVRQRAEAERvlAEKLAAISEatrlKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEAR 2020
Cdd:PTZ00121  1535 KADEAKKAEEKKKADELKK--AEELKKAEE----KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2021 LAQLRKASESELERQKGLVEDTLRQRrqveEEILALKGSFEKAAAGKAELELELGRIRGT-----AEDTLRSKEQA--EQ 2093
Cdd:PTZ00121  1609 AEEAKKAEEAKIKAEELKKAEEEKKK----VEQLKKKEAEEKKKAEELKKAEEENKIKAAeeakkAEEDKKKAEEAkkAE 1684
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2094 EAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE----VERLKAKVEEARRLRERAEQESARQLQLAQEAA 2169
Cdd:PTZ00121  1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKK 1764

                   ....*...
gi 1920237946 2170 QKRLQAEE 2177
Cdd:PTZ00121  1765 EEEKKAEE 1772
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1807-2375 9.02e-35

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 147.01  E-value: 9.02e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1886
Cdd:COG1196    210 EKAERYRELKEELKELEAELL---LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1887 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLA 1966
Cdd:COG1196    287 QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEE-ELEEAEEELEEAEAELAEAEE 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1967 AISEATRLKTEAEIALKEKEAENERLRRlaedEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQR 2046
Cdd:COG1196    366 ALLEAEAELAEAEEELEELAEELLEALR----AAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEE 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2047 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEA 2126
Cdd:COG1196    442 EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRG 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2127 ARQRKAALEEVERLKAKVEEAR---RLRERAEQESARQLQLAQEAAQKRL-----QAEEKAHAFAVQQKEQELQQTLQQE 2198
Cdd:COG1196    522 LAGAVAVLIGVEAAYEAALEAAlaaALQNIVVEDDEVAAAAIEYLKAAKAgratfLPLDKIRARAALAAALARGAIGAAV 601
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2199 QSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2278
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAEL 681
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2279 EQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2358
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
                          570
                   ....*....|....*..
gi 1920237946 2359 LRVQMEELGKLKARIEA 2375
Cdd:COG1196    762 LEELERELERLEREIEA 778
PTZ00034 PTZ00034
40S ribosomal protein S10; Provisional
5-114 1.72e-31

40S ribosomal protein S10; Provisional


Pssm-ID: 173331  Cd Length: 124  Bit Score: 121.28  E-value: 1.72e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946    5 MLMPLDQLRTIYEVLFREGVMVAKKDRrPRSLHPHVpGVTNLQVTRAMASLRARGLVRETFAWRHFYWYLTNEGIAHLRQ 84
Cdd:PTZ00034     2 VYVPKANRKAIYRYLFKEGVIVCKKDP-KGPWHPEL-NVPNLHVMMLMRSLKSRGLVKEQFAWQHYYYYLTDEGIEYLRT 79
                           90       100       110
                   ....*....|....*....|....*....|
gi 1920237946   85 YLHLPPEIVPASLQRVRRPVAMVMPARRTP 114
Cdd:PTZ00034    80 YLHLPPDVFPATHKKKSVNFERKTEEEGSR 109
Spectrin_like pfam18373
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links ...
1020-1097 2.57e-30

Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links desmosomal cadherins to the intermediate filaments. The N-terminal region of DP contains a plakin domain common to members of the plakin family. Plakin domains contain multiple copies of spectrin repeats (SRs) pfam00435. Spectrin repeats (SRs) consist of three alpha-helices (A, B, and C) that form an antiparallel triple-helical bundle. This entry describes SR6 which has a divergent structure relative to the other SRs. SR6 shows significant deviations in helices A and B where they are significantly shorter than in other repeats. Structural comparison revealed that SR6 is more similar to other three-helix-bundle proteins, including target of Myb1 and the syntaxin Habc domain, than to other SR proteins. Due to these differences with other spectrin repeats, this region is termed spectrin-like repeat.


Pssm-ID: 465730  Cd Length: 78  Bit Score: 116.16  E-value: 2.57e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 1020 LAWQSLGRDMQLIRSWSLATFRTLKPEEQRQALRSLELHYQAFLRDSQDAGGFGPEDRLQAEREYGSCSRHYQQLLQS 1097
Cdd:pfam18373    1 VSWQYLLKDIQRINSWTISMLKTMRPEEYRQVLKNLETHYQDFLRDSQESEMFGAEDRRQLEREVNSAQQHYQTLLVS 78
PTZ00121 PTZ00121
MAEBL; Provisional
1848-2738 2.06e-27

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 124.10  E-value: 2.06e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1848 TEQGEQQRQLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFREL-- 1925
Cdd:PTZ00121  1033 TEYGNNDDVLKEKDIIDEDIDGNHEG--KAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETgk 1110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1926 AEEAARlralAEEAKRQrqlAEEdaVRQRAEAERvlAEKLAAISEATRLKTE--AEIALKEKEAENERLRRLAEDeafQR 2003
Cdd:PTZ00121  1111 AEEARK----AEEAKKK---AED--ARKAEEARK--AEDARKAEEARKAEDAkrVEIARKAEDARKAEEARKAED---AK 1176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2004 RLLEEQAAqhkadIEARLA-QLRKASESelerqkglvedtlrqrRQVEEeilalkgsfekaaAGKAELELELGRIRgTAE 2082
Cdd:PTZ00121  1177 KAEAARKA-----EEVRKAeELRKAEDA----------------RKAEA-------------ARKAEEERKAEEAR-KAE 1221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2083 DTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERlkaKVEEARRLRERAEQESARQL 2162
Cdd:PTZ00121  1222 DAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEAR---KADELKKAEEKKKADEAKKA 1298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2163 QLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERlrseaeaarraaeeaeaareraereaaqsRRQVEEAER 2242
Cdd:PTZ00121  1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE-----------------------------AKKAAEAAK 1349
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2243 LKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQaLRQKAQVEQELTALRLQLE 2322
Cdd:PTZ00121  1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE-LKKAAAAKKKADEAKKKAE 1428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2323 ETDHQksildEELQRLKAEVTEAARQRGQVEEelfslRVQMEELGKlkaRIEAENRALVLRDKDSAQRLLQE---EAEKM 2399
Cdd:PTZ00121  1429 EKKKA-----DEAKKKAEEAKKADEAKKKAEE-----AKKAEEAKK---KAEEAKKADEAKKKAEEAKKADEakkKAEEA 1495
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2400 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK----EKMQAVQEATRLKAEAELLQQQKELAQEQARRLQED 2475
Cdd:PTZ00121  1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKadeaKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2476 KeQMAQQLAQETQgfqktlETERQRQLEMSAEAERLRLRVAEmsraQARAEEDARrfrKQAEDIGErlyrtelaTQEKVM 2555
Cdd:PTZ00121  1576 K-NMALRKAEEAK------KAEEARIEEVMKLYEEEKKMKAE----EAKKAEEAK---IKAEELKK--------AEEEKK 1633
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2556 LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQalqqsflsekdslLQRErci 2635
Cdd:PTZ00121  1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA-------------LKKE--- 1697
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2636 EQEKAKLEQL---FQDEVAKAQALREEQQRQQQqmqqekqqlaaSMEEARRRQHE----AEEgVRRQQEELQRLAQQQQQ 2708
Cdd:PTZ00121  1698 AEEAKKAEELkkkEAEEKKKAEELKKAEEENKI-----------KAEEAKKEAEEdkkkAEE-AKKDEEEKKKIAHLKKE 1765
                          890       900       910
                   ....*....|....*....|....*....|....
gi 1920237946 2709 QEKLLAEENQR----LRERLQHLEEERRAALARS 2738
Cdd:PTZ00121  1766 EEKKAEEIRKEkeavIEEELDEEDEKRRMEVDKK 1799
PTZ00121 PTZ00121
MAEBL; Provisional
2014-2740 3.40e-26

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 119.86  E-value: 3.40e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2014 KADIEARLAQLRKASESELERQKGLVEDTlRQRRQVEEEilalKGSFE---KAAAGKAELELELGRIRGTAEDTLRSKE- 2089
Cdd:PTZ00121  1059 KAEAKAHVGQDEGLKPSYKDFDFDAKEDN-RADEATEEA----FGKAEeakKTETGKAEEARKAEEAKKKAEDARKAEEa 1133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2090 -QAE-----QEAARQRQLAAEEERRRREAEERVQKSLAAEE----EAARQ----RKAA----LEEVERLKA--KVEEARR 2149
Cdd:PTZ00121  1134 rKAEdarkaEEARKAEDAKRVEIARKAEDARKAEEARKAEDakkaEAARKaeevRKAEelrkAEDARKAEAarKAEEERK 1213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2150 LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAERE 2229
Cdd:PTZ00121  1214 AEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2230 AAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADA--EMEKHKQFAEQALRQK 2307
Cdd:PTZ00121  1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAadEAEAAEEKAEAAEKKK 1373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2308 AQVEQELTALRLQLEETDHQksildEELQRLKAEVTEAARQRGQVEEElfslrvqmeelgKLKARiEAENRALVLRDKDS 2387
Cdd:PTZ00121  1374 EEAKKKADAAKKKAEEKKKA-----DEAKKKAEEDKKKADELKKAAAA------------KKKAD-EAKKKAEEKKKADE 1435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQlAEEdlAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2465
Cdd:PTZ00121  1436 AKKKAEEakKADEAKKKAEEAKKAEEAKKKAEEAKK-ADE--AKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2466 QEqARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQA--RAEEDARRFRKQAEDigerL 2543
Cdd:PTZ00121  1513 DE-AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK-KAEEKKKAEEakKAEEDKNMALRKAEE----A 1586
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2544 YRTELATQEKVM-LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFL 2622
Cdd:PTZ00121  1587 KKAEEARIEEVMkLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2623 SEKDsllqrerciEQEKAKLEQLFQDEVAKAQAlreeqqrqqqqmqqeKQQLAASMEEARRrqheAEEgVRRQQEELQRL 2702
Cdd:PTZ00121  1667 AKKA---------EEDKKKAEEAKKAEEDEKKA---------------AEALKKEAEEAKK----AEE-LKKKEAEEKKK 1717
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1920237946 2703 AQQQQQQEkllaEENQRLRERLQHLEEE--RRAALARSEE 2740
Cdd:PTZ00121  1718 AEELKKAE----EENKIKAEEAKKEAEEdkKKAEEAKKDE 1753
CH smart00033
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ...
188-285 2.25e-25

Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.


Pssm-ID: 214479 [Multi-domain]  Cd Length: 101  Bit Score: 103.16  E-value: 2.25e-25
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   188 KTFTKWVNKHLIKA-QRHISDLYEDLRDGHNLISLLEVLSGDSLPREK---GRMRFHKLQNVQIALDYLRHRQVKLVNIR 263
Cdd:smart00033    1 KTLLRWVNSLLAEYdKPPVTNFSSDLKDGVALCALLNSLSPGLVDKKKvaaSLSRFKKIENINLALSFAEKLGGKVVLFE 80
                            90       100
                    ....*....|....*....|..
gi 1920237946   264 NDDIADGnPKLTLGLIWTIILH 285
Cdd:smart00033   81 PEDLVEG-PKLILGVIWTLISL 101
growth_prot_Scy NF041483
polarized growth protein Scy;
1477-2688 2.54e-25

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 116.85  E-value: 2.54e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEERERLAEVEAalEKQRQLAE-AHAQAKAQAEREAQGLQRRMQ--EEVARREEvAVEAQEQKRSIQEELQHLRQ 1553
Cdd:NF041483    85 ADQLRADAERELRDARA--QTQRILQEhAEHQARLQAELHTEAVQRRQQldQELAERRQ-TVESHVNENVAWAEQLRART 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 SSEA-----EIQAKARQVEAAERSRlrieeeirVVRLQLEAteRQRGGAEGElqalRARAEeAEAQKRQAQEEAERLRRQ 1628
Cdd:NF041483   162 ESQArrlldESRAEAEQALAAARAE--------AERLAEEA--RQRLGSEAE----SARAE-AEAILRRARKDAERLLNA 226
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1629 VQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQaeEAERRLRQAEAERARQVQVALETA-QRSAEAELQSE 1707
Cdd:NF041483   227 ASTQAQEATDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQ--EAEEALREARAEAEKVVAEAKEAAaKQLASAESANE 304
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1708 hasfaektaQLERTLKEEhvaVVQLREEATRRAQQQAEAERARAEAERELERWQL-KANEALRLRLQAEEVAQQKSLTQA 1786
Cdd:NF041483   305 ---------QRTRTAKEE---IARLVGEATKEAEALKAEAEQALADARAEAEKLVaEAAEKARTVAAEDTAAQLAKAART 372
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1787 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQ-RLAAEQELIRLRAETEQgeqqrqlLEEELARL 1865
Cdd:NF041483   373 AEEVLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQlKGAAKDDTKEYRAKTVE-------LQEEARRL 445
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1866 QREAAaatQKRRELEAELAKVRAEmevllaskARAEeesrstsekSKQRLEAEAGRFRELAEEAarlRALAEEAKRQrql 1945
Cdd:NF041483   446 RGEAE---QLRAEAVAEGERIRGE--------ARRE---------AVQQIEEAARTAEELLTKA---KADADELRST--- 499
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1946 aeedavrQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAA-QHKADIEARLAQL 2024
Cdd:NF041483   500 -------ATAESERVRTE---AIERATTLRRQAEETLERTRAEAERLRAEAEEQAEEVRAAAERAArELREETERAIAAR 569
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2025 RKASESELERQKGLVEdtlrQRRQVEEEilALKGSFEKAAAGKAELELELGRIRGTAEDTLRS-KEQAEQEAARQRqlaa 2103
Cdd:NF041483   570 QAEAAEELTRLHTEAE----ERLTAAEE--ALADARAEAERIRREAAEETERLRTEAAERIRTlQAQAEQEAERLR---- 639
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2104 eeerrRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLR-------ERAEQESARQLQLAQ-EAAQKRLQ 2174
Cdd:NF041483   640 -----TEAAADASAARAEGENVAVRLRSEAAAEAERLKSEAQEsADRVRaeaaaaaERVGTEAAEALAAAQeEAARRRRE 714
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2175 AEE---KAHAFAVQQKEQELQQTLQQEQSVLERLrseaeaarraaeeaeaareraEREAAQSRRQVEEAERlkqsaeeqa 2251
Cdd:NF041483   715 AEEtlgSARAEADQERERAREQSEELLASARKRV---------------------EEAQAEAQRLVEEADR--------- 764
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2252 qaqaqaqaaaeklrkeaeqeaarraqaeqaalRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEET-DHQKSI 2330
Cdd:NF041483   765 --------------------------------RATELVSAAEQTAQQVRDSVAGLQEQAEEEIAGLRSAAEHAaERTRTE 812
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2331 LDEELQRLKAEvteAARQRGQVEEELFSLRVQ-MEELGKLKARIEAEnralVLRDKDSAQRLLQEEAEKMKQVAEEAA-R 2408
Cdd:NF041483   813 AQEEADRVRSD---AYAERERASEDANRLRREaQEETEAAKALAERT----VSEAIAEAERLRSDASEYAQRVRTEASdT 885
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2409 LSVAAQEAARLRQLAEEDLAQQRALA---EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAq 2485
Cdd:NF041483   886 LASAEQDAARTRADAREDANRIRSDAaaqADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIA- 964
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2486 etqgfqktleterqrqlEMSAEAERLRLRVAE-MSRAQARAE---EDARRFRKQAEDIGERLyRTELATQEKVMLVQTLE 2561
Cdd:NF041483   965 -----------------EATGEAERLRAEAAEtVGSAQQHAErirTEAERVKAEAAAEAERL-RTEAREEADRTLDEARK 1026
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2562 TQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLL----QETQALQQSFLSEKDSLLQRERcieq 2637
Cdd:NF041483  1027 DANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVgaarKEAERIVAEATVEGNSLVEKAR---- 1102
                         1210      1220      1230      1240      1250
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2638 ekAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLaasMEEARRRQHEA 2688
Cdd:NF041483  1103 --TDADELLVGARRDATAIRERAEELRDRITGEIEEL---HERARRESAEQ 1148
CH pfam00307
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ...
185-288 2.30e-24

Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.


Pssm-ID: 425596 [Multi-domain]  Cd Length: 109  Bit Score: 100.44  E-value: 2.30e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  185 VQKKTFTKWVNKHLIKAQRH--ISDLYEDLRDGHNLISLLEVLSGDSLP-REKGRMRFHKLQNVQIALDYLRHRQ-VKLV 260
Cdd:pfam00307    2 ELEKELLRWINSHLAEYGPGvrVTNFTTDLRDGLALCALLNKLAPGLVDkKKLNKSEFDKLENINLALDVAEKKLgVPKV 81
                           90       100
                   ....*....|....*....|....*...
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:pfam00307   82 LIEPEDLVEGDNKSVLTYLASLFRRFQA 109
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1514-2450 2.47e-24

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 113.23  E-value: 2.47e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1514 REAQGLQRRMQEEVARREEVAVEAQEQKRSIQ------EELQHLR-QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ 1586
Cdd:TIGR02168  175 KETERKLERTRENLDRLEDILNELERQLKSLErqaekaERYKELKaELRELELALLVLRLEELREELEELQEELKEAEEE 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1587 LEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEE 1666
Cdd:TIGR02168  255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1667 LRLQAEEAERRLrqaeaerarqvqvaletaqrsaeAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaea 1746
Cdd:TIGR02168  335 LAEELAELEEKL-----------------------EELKEELESLEAELEELEAELEELESRLEELEEQLE--------- 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1747 eraraeaerelerwQLKANEALRLRLQAEEVAQQKSLTQAEaekqkeeaerearrrgkaeEQAVRQRELAEQELEKQRQl 1826
Cdd:TIGR02168  383 --------------TLRSKVAQLELQIASLNNEIERLEARL-------------------ERLEDRRERLQQEIEELLK- 428
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1827 aEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRS 1906
Cdd:TIGR02168  429 -KLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEG 507
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1907 TSE--KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlaeEDAVRQRAEAERVLAEKLAAiSEATRLKTEAEIALKE 1984
Cdd:TIGR02168  508 VKAllKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRL---QAVVVENLNAAKKAIAFLKQ-NELGRVTFLPLDSIKG 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1985 KEAENERLRRLAEDEAFQRRL--LEEQAAQHKADIEARLAQLRKASEselerqkglVEDTLRQRRQVEEEIL-------- 2054
Cdd:TIGR02168  584 TEIQGNDREILKNIEGFLGVAkdLVKFDPKLRKALSYLLGGVLVVDD---------LDNALELAKKLRPGYRivtldgdl 654
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2055 -----ALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaaeeerRRREAEERVQKSLAAEEEAARQ 2129
Cdd:TIGR02168  655 vrpggVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK-------ELEELEEELEQLRKELEELSRQ 727
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2130 RKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAfaVQQKEQELQQTLQQEQSVLERLRSEA 2209
Cdd:TIGR02168  728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAE--AEAEIEELEAQIEQLKEELKALREAL 805
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2210 EAARRAAEEAEAARERAEREAAQSRRQVEEAERLkqsaeeqaqaqaqaqaaaekLRKEAEQEAARRAQAEQAALRQKQAA 2289
Cdd:TIGR02168  806 DELRAELTLLNEEAANLRERLESLERRIAATERR--------------------LEDLEEQIEELSEDIESLAAEIEELE 865
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2290 DAEMEKHKQFaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL-GK 2368
Cdd:TIGR02168  866 ELIEELESEL-EALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLqER 944
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2369 L--KARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAarLRQLAE--EDLAQQRALAEKMLKEKMQA 2444
Cdd:TIGR02168  945 LseEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLAAIEE--YEELKEryDFLTAQKEDLTEAKETLEEA 1022

                   ....*.
gi 1920237946 2445 VQEATR 2450
Cdd:TIGR02168 1023 IEEIDR 1028
CH pfam00307
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ...
300-406 6.37e-24

Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.


Pssm-ID: 425596 [Multi-domain]  Cd Length: 109  Bit Score: 99.28  E-value: 6.37e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGC-QGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVY--RQTNLENLDQAFSVAERDLGVTR 376
Cdd:pfam00307    1 LELEKELLRWINSHLAEYgPGVRVTNFTTDLRDGLALCALLNKLAPGLVDKKKLNksEFDKLENINLALDVAEKKLGVPK 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1920237946  377 -LLDPEDVDvpQPDEKSIITYVSSLYDAMPR 406
Cdd:pfam00307   81 vLIEPEDLV--EGDNKSVLTYLASLFRRFQA 109
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1832-2624 1.98e-23

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 110.53  E-value: 1.98e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1832 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAAtQKRRELEAELAKVRAEmevLLASKARAEEESRSTSEKS 1911
Cdd:TIGR02168  172 ERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKA-ERYKELKAELRELELA---LLVLRLEELREELEELQEE 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1912 KQRLEAEagrFRELAEEAARLRALAEEAKRQRQLAEEDAvrqrAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENER 1991
Cdd:TIGR02168  248 LKEAEEE---LEELTAELQELEEKLEELRLEVSELEEEI----EELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1992 LRRLAEDEAFQRRLLEEQAAQHKADIEaRLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFekaaagkAELE 2071
Cdd:TIGR02168  321 LEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV-------AQLE 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2072 LELGRIRGTAEDTLRSKEQAEQEAARQRQ-LAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRL 2150
Cdd:TIGR02168  393 LQIASLNNEIERLEARLERLEDRRERLQQeIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2151 RERAEQESARQLQLAQE--AAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAER 2228
Cdd:TIGR02168  473 AEQALDAAERELAQLQArlDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVV 552
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2229 EAAQSRRQVEEAerLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAaDAEMEKHKQFAeqaLRQKA 2308
Cdd:TIGR02168  553 ENLNAAKKAIAF--LKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKF-DPKLRKALSYL---LGGVL 626
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2309 QVEQELTALRLQlEETDHQKSI--LDEELQRLKAEVTEAARQRGQVeeeLFSLRVQMEELGKLKARIEAENRAL--VLRD 2384
Cdd:TIGR02168  627 VVDDLDNALELA-KKLRPGYRIvtLDGDLVRPGGVITGGSAKTNSS---ILERRREIEELEEKIEELEEKIAELekALAE 702
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2385 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM-----------QAVQEATRLKA 2453
Cdd:TIGR02168  703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEaeieeleerleEAEEELAEAEA 782
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2454 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlrvaEMSRAQARAEEDARRFR 2533
Cdd:TIGR02168  783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLE----DLEEQIEELSEDIESLA 858
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2534 KQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTvRQEQLLQE 2613
Cdd:TIGR02168  859 AEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL-RLEGLEVR 937
                          810
                   ....*....|.
gi 1920237946 2614 TQALQQSFLSE 2624
Cdd:TIGR02168  938 IDNLQERLSEE 948
growth_prot_Scy NF041483
polarized growth protein Scy;
1468-2179 2.45e-23

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 110.30  E-value: 2.45e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLA-EAHAQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQE 1546
Cdd:NF041483   254 RQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAkEAAAKQLASAESANEQRTRTAKEEIAR---LVGEATKEAEALKA 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1547 ELQHLRQSSEAEiqAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGElQALRARAEEAEAQKRQAQEEAERLR 1626
Cdd:NF041483   331 EAEQALADARAE--AEKLVAEAAEKARTVAAED--------TAAQLAKAARTAE-EVLTKASEDAKATTRAAAEEAERIR 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1627 RQVQDETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRLQaEEAeRRLRqAEAERARqvqvaletaqrsaeaelqs 1706
Cdd:NF041483   400 REAEAEADRLRGEAADQAEQLKGAAKDDTKEYRA----KTVELQ-EEA-RRLR-GEAEQLR------------------- 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1707 ehasfAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELErwqlkANEALRLRLQAEEVAqqKSLTQA 1786
Cdd:NF041483   454 -----AEAVAEGERIRGEARREAVQQIEEAARTAEELLTKAKADADELRSTA-----TAESERVRTEAIERA--TTLRRQ 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1787 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA-AEQELIRLRAETEQ----GEQQRQLLEEE 1861
Cdd:NF041483   522 AEETLERTRAEAERLRAEAEEQAEEVRAAAERAARELREETERAIAARQAeAAEELTRLHTEAEErltaAEEALADARAE 601
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1862 LARLQREAAAATQKRRELEAE-----LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrelAEEAARLRALA 1936
Cdd:NF041483   602 AERIRREAAEETERLRTEAAErirtlQAQAEQEAERLRTEAAADASAARAEGENVAVRLRSEA------AAEAERLKSEA 675
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1937 EE-AKRQRQLAEEDAVRQRAEAERVLAeklAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAfqrrllEEQAAQHKA 2015
Cdd:NF041483   676 QEsADRVRAEAAAAAERVGTEAAEALA---AAQEEAARRRREAEETLGSARAEADQERERAREQS------EELLASARK 746
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2016 DIEARLAQLRKASESELERQKGLV---EDTLRQRR--------QVEEEILALKGSFEKAAA-GKAELELELGRIRGtaeD 2083
Cdd:NF041483   747 RVEEAQAEAQRLVEEADRRATELVsaaEQTAQQVRdsvaglqeQAEEEIAGLRSAAEHAAErTRTEAQEEADRVRS---D 823
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2084 TLRSKEQAEQEAARQRQlaaeeerrrreaeERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLRERAeqeSARQL 2162
Cdd:NF041483   824 AYAERERASEDANRLRR-------------EAQEETEAAKALAERTVSEAIAEAERLRSDASEyAQRVRTEA---SDTLA 887
                          730
                   ....*....|....*..
gi 1920237946 2163 QLAQEAAQKRLQAEEKA 2179
Cdd:NF041483   888 SAEQDAARTRADAREDA 904
growth_prot_Scy NF041483
polarized growth protein Scy;
1464-2180 3.97e-23

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 109.53  E-value: 3.97e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1464 SETLRRMEEEERL---AEQQRAE---ERERLaEVEAALEKQRQLAEAHAQAK---AQAEREAQGLQRRMQEEVAR-REEV 1533
Cdd:NF041483   433 AKTVELQEEARRLrgeAEQLRAEavaEGERI-RGEARREAVQQIEEAARTAEellTKAKADADELRSTATAESERvRTEA 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1534 AVEAQEQKRSIQEELQHLRqsSEAEiQAKARQVEAAERSRLRIEEEIRVVRLQLE-ATERQRGGAEGELQALRARAEE-- 1610
Cdd:NF041483   512 IERATTLRRQAEETLERTR--AEAE-RLRAEAEEQAEEVRAAAERAARELREETErAIAARQAEAAEELTRLHTEAEErl 588
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1611 --AEAQKRQAQEEAERLRRQVQDETQRKRQAEAE--LALRVQAEAEAAREKQRALQALEELRLQAEEAERRLR-QAEAER 1685
Cdd:NF041483   589 taAEEALADARAEAERIRREAAEETERLRTEAAEriRTLQAQAEQEAERLRTEAAADASAARAEGENVAVRLRsEAAAEA 668
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1686 ARQVQVALETAQRsaeaeLQSEHASFAEKTAQlertlkeehvavvqlreeatrraqqqaeaeraraeaerelerwqlKAN 1765
Cdd:NF041483   669 ERLKSEAQESADR-----VRAEAAAAAERVGT---------------------------------------------EAA 698
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1766 EALrlrlqaeevaqqksltqaeaekqkeeaerearrrGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI--- 1842
Cdd:NF041483   699 EAL----------------------------------AAAQEEAARRRREAEETLGSARAEADQERERAREQSEELLasa 744
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1843 RLRAETEQGEQQRqLLEEELARLQREAAAATQKRRELEAELAKV--RAEMEV--LLASKARAEEESRSTSEKSKQRLEAE 1918
Cdd:NF041483   745 RKRVEEAQAEAQR-LVEEADRRATELVSAAEQTAQQVRDSVAGLqeQAEEEIagLRSAAEHAAERTRTEAQEEADRVRSD 823
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1919 AGRFRELA-EEAARLRALA-EEAKRQRQLAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLA 1996
Cdd:NF041483   824 AYAERERAsEDANRLRREAqEETEAAKALAERTVSEAIAEAERLRSD---ASEYAQRVRTEASDTLASAEQDAARTRADA 900
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1997 EDEAFQRRllEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAElelelgR 2076
Cdd:NF041483   901 REDANRIR--SDAAAQADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEATGEAE------R 972
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2077 IRGTAEDTLRSkeqAEQEAARQRQLAAEEERRrreaeervqkslaAEEEAARQRKAALEEVERL--KAKVEEARRLRERA 2154
Cdd:NF041483   973 LRAEAAETVGS---AQQHAERIRTEAERVKAE-------------AAAEAERLRTEAREEADRTldEARKDANKRRSEAA 1036
                          730       740
                   ....*....|....*....|....*.
gi 1920237946 2155 EQESARQLQLAQEAAQKRLQAEEKAH 2180
Cdd:NF041483  1037 EQADTLITEAAAEADQLTAKAQEEAL 1062
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1142-1706 2.70e-22

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 106.56  E-value: 2.70e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1142 ARECAQRITEQQKAQAEVDGLGKGVARLSAEAEKvlalpepspaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVI 1221
Cdd:COG1196    252 EAELEELEAELAELEAELEELRLELEELELELEE-------------AQAEEYELLAELARLEQDIARLEERRRELEERL 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1222 RSTQEAEEVLRAHEEQLKEAQAvpATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVE 1301
Cdd:COG1196    319 EELEEELAELEEELEELEEELE--ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAA 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1302 RWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL 1381
Cdd:COG1196    397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLE 476
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1382 EDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1461
Cdd:COG1196    477 AALAE-----------LLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA------ 539
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1462 fisetlrrMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA----HAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1537
Cdd:COG1196    540 --------LEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRAtflpLDKIRARAALAAALARGAIGAAVDLVASDLREA 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1617
Cdd:COG1196    612 DARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEE 691
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1618 AQEEAERLRRQvqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA-LETA 1696
Cdd:COG1196    692 ELELEEALLAE---EEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEeLERE 768
                          570
                   ....*....|
gi 1920237946 1697 QRSAEAELQS 1706
Cdd:COG1196    769 LERLEREIEA 778
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2118-2733 3.57e-22

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 106.17  E-value: 3.57e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2118 KSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQ 2197
Cdd:COG1196    203 EPLERQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELE 282
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2198 EQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQ 2277
Cdd:COG1196    283 LEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2278 AEQAALRQKQAADAEMEKHKQFAEQALRQkaqvEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELF 2357
Cdd:COG1196    363 AEEALLEAEAELAEAEEELEELAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEE 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2358 SLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM 2437
Cdd:COG1196    439 EEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAG 518
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2438 LKEKMQAVQEATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRV 2515
Cdd:COG1196    519 LRGLAGAVAVLIGVEAAYEAALEAALAAalQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIG 598
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2516 AEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLL 2595
Cdd:COG1196    599 AAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAE 678
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2596 QLKSEEMQTVRQEQLLQETQALQQsflsekdsLLQRERCIEQEKAKLEQlfqdevakaqalreeqqrqqqqmqqekqqlA 2675
Cdd:COG1196    679 AELEELAERLAEEELELEEALLAE--------EEEERELAEAEEERLEE------------------------------E 720
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2676 ASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRA 2733
Cdd:COG1196    721 LEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEA 778
COG5045 COG5045
Ribosomal protein S10E [Translation, ribosomal structure and biogenesis];
5-112 3.73e-22

Ribosomal protein S10E [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227378  Cd Length: 105  Bit Score: 94.22  E-value: 3.73e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946    5 MLMPLDQLRTIYEVLFREGVMVAKKDRRpRSLHPHVpGVTNLQVTRAMASLRARGLVRETFAWRHFYWYLTNEGIAHLRQ 84
Cdd:COG5045      1 MLVPKENRYKIHQRLFQKGVAVAKKDFN-LGKHREL-EIPNLHVIKAMQSLISYGYVKTIHVWRHSYYTLTPEGVEYLRE 78
                           90       100
                   ....*....|....*....|....*...
gi 1920237946   85 YLHLPPEIVPASLQRVRRPVAmvMPARR 112
Cdd:COG5045     79 YLVLPDEGVPSTEAPAVSPTQ--RPQRR 104
growth_prot_Scy NF041483
polarized growth protein Scy;
1471-2207 3.88e-20

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 99.52  E-value: 3.88e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEahaQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQEELQh 1550
Cdd:NF041483   560 EETERAIAARQAEAAEELTRLHTEAEERLTAAE---EALADARAEAERIRREAAEETER---LRTEAAERIRTLQAQAE- 632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1551 lrqsSEAEiqaKARQVEAAERSRLRIEEEIRVVRLQLEA-TERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQV 1629
Cdd:NF041483   633 ----QEAE---RLRTEAAADASAARAEGENVAVRLRSEAaAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQ 705
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1630 QDETQRKRQAEAELAlrvQAEAEAAREKQRALQALEELrlqAEEAERRLRQAEAERARQVqvalETAQRSAeaelqSEHA 1709
Cdd:NF041483   706 EEAARRRREAEETLG---SARAEADQERERAREQSEEL---LASARKRVEEAQAEAQRLV----EEADRRA-----TELV 770
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1710 SFAEKTAQLERTlkeehvAVVQLREEATRraqqqaeaeraraeaerelerwqlkanEALRLRLQAEEVAQQksLTQAEAE 1789
Cdd:NF041483   771 SAAEQTAQQVRD------SVAGLQEQAEE---------------------------EIAGLRSAAEHAAER--TRTEAQE 815
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1790 KQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQ--------------- 1854
Cdd:NF041483   816 EADRVRSDAYAERERASEDANRLRREAQEETEAAKALAERTVSE---AIAEAERLRSDASEYAQRvrteasdtlasaeqd 892
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1855 ----RQLLEEELARLQREAAA-----ATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrel 1925
Cdd:NF041483   893 aartRADAREDANRIRSDAAAqadrlIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEA------ 966
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1926 AEEAARLRALAEEAKRQrqlAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEialkekeAENERLRRLAEDEAFQRR- 2004
Cdd:NF041483   967 TGEAERLRAEAAETVGS---AQQHAERIRTEAERVKAE---AAAEAERLRTEAR-------EEADRTLDEARKDANKRRs 1033
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2005 ---------LLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELG 2075
Cdd:NF041483  1034 eaaeqadtlITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVGAARKEAERIVAEATVEGNSLVEKARTDADELLVGAR 1113
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2076 R----IRGTAEDtLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLK--------AK 2143
Cdd:NF041483  1114 RdataIRERAEE-LRDRITGEIEELHERARRESAEQMKSAGERCDALVKAAEEQLAEAEAKAKELVSDANseaskvriAA 1192
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2144 VEEARRLRERAEQESARQLQLAQ------EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRS 2207
Cdd:NF041483  1193 VKKAEGLLKEAEQKKAELVREAEkikaeaEAEAKRTVEEGKRELDVLVRRREDINAEISRVQDVLEALES 1262
SH3_10 pfam17902
SH3 domain; This entry represents an SH3 domain.
919-985 3.49e-18

SH3 domain; This entry represents an SH3 domain.


Pssm-ID: 407754  Cd Length: 65  Bit Score: 81.15  E-value: 3.49e-18
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946  919 QLKPRSpaHPMRGRVPLLAVCDYKQVEVTVHKGDECQMVGPAQPFYWKVLGSSCSEAAMPSVCFLVP 985
Cdd:pfam17902    1 PLKQRR--SPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
CH smart00033
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ...
304-400 4.76e-18

Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.


Pssm-ID: 214479 [Multi-domain]  Cd Length: 101  Bit Score: 82.36  E-value: 4.76e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   304 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN----LENLDQAFSVAERDLGVTRLLD 379
Cdd:smart00033    1 KTLLRWVNSLLAEYDKPPVTNFSSDLKDGVALCALLNSLSPGLVDKKKVAASLSrfkkIENINLALSFAEKLGGKVVLFE 80
                            90       100
                    ....*....|....*....|.
gi 1920237946   380 PEDVDVPQPDEKSIITYVSSL 400
Cdd:smart00033   81 PEDLVEGPKLILGVIWTLISL 101
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1523-2428 6.95e-18

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 92.34  E-value: 6.95e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1523 MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERsrlriEEEIRVVRLQLEATERQRGGAEGELQ 1602
Cdd:pfam02463  158 IEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQA-----KKALEYYQLKEKLELEEEYLLYLDYL 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1603 ALRARAEEAEAQKRQAQEEAERLRRQVQDetqrKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE 1682
Cdd:pfam02463  233 KLNEERIDLLQELLRDEQEEIESSKQEIE----KEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1683 AERARQVQVALETAQRSAEAELQSEHASFAEKtaqleRTLKEEHVAVVQLREEatrraqqqaeaeraraeaerelerwql 1762
Cdd:pfam02463  309 KVDDEEKLKESEKEKKKAEKELKKEKEEIEEL-----EKELKELEIKREAEEE--------------------------- 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1763 kANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI 1842
Cdd:pfam02463  357 -EEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEE 435
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1843 RLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEA----- 1917
Cdd:pfam02463  436 EESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKvllal 515
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1918 -EAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLA 1996
Cdd:pfam02463  516 iKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIA 595
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1997 EDEAFQRRLLeeqAAQHKADIEARLAQ---------LRKASESELERQKGLVEDTLRQRRQVEEEILALKG---SFEKAA 2064
Cdd:pfam02463  596 VLEIDPILNL---AQLDKATLEADEDDkrakvvegiLKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEvkaSLSELT 672
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2065 AGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKV 2144
Cdd:pfam02463  673 KELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEE 752
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2145 EEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAhafAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARE 2224
Cdd:pfam02463  753 EKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLK---VEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK 829
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2225 RAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaadAEMEKHKQFAEQAL 2304
Cdd:pfam02463  830 IKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKE---EKEKEEKKELEEES 906
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2305 RQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEvteaarQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRD 2384
Cdd:pfam02463  907 QKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE------EADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAI 980
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....
gi 1920237946 2385 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLA 2428
Cdd:pfam02463  981 EEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLE 1024
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1047-1735 2.09e-16

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 87.42  E-value: 2.09e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1047 EQRQALRSLELHYQA----FLRDSQDAGgfgPEDRLQAEREYGSCSRHYQQLLQSLEQGEQE----ESRCQRCISELKDI 1118
Cdd:TIGR02168  217 ELKAELRELELALLVlrleELREELEEL---QEELKEAEEELEELTAELQELEEKLEELRLEvselEEEIEELQKELYAL 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1119 RLQLEACETRTVH---RLRlPLDKEPARECAQRITEQQK---AQAEVDGLGKGVARLSAEAEkvlALPEPSPAAPTLRSE 1192
Cdd:TIGR02168  294 ANEISRLEQQKQIlreRLA-NLERQLEELEAQLEELESKldeLAEELAELEEKLEELKEELE---SLEAELEELEAELEE 369
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1193 LELTLGKL-EQVRSLSAIYLEKLKTISLvIRSTQEaeeVLRAHEEQLKEAQAVPATlpelEATKAALKKLRAQAEAQQPV 1271
Cdd:TIGR02168  370 LESRLEELeEQLETLRSKVAQLELQIAS-LNNEIE---RLEARLERLEDRRERLQQ----EIEELLKKLEEAELKELQAE 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1272 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLL---LERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAW 1348
Cdd:TIGR02168  442 LEELEEELEELQEELERLEEALEELREELEEAEQALDAAereLAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGI 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1349 LRDAKQR---------------QEQIQAVPLANSQAVR---EQLRQEK----ALLEDIERHGEKVEECQRFAKQYINAIK 1406
Cdd:TIGR02168  522 LGVLSELisvdegyeaaieaalGGRLQAVVVENLNAAKkaiAFLKQNElgrvTFLPLDSIKGTEIQGNDREILKNIEGFL 601
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1407 DYELQLVTYKAQLEPVASP---------------AKKPKVQSGSESIIQEYVDLRTRYS--------ELSTL-TSQYIRF 1462
Cdd:TIGR02168  602 GVAKDLVKFDPKLRKALSYllggvlvvddldnalELAKKLRPGYRIVTLDGDLVRPGGVitggsaktNSSILeRRREIEE 681
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKR 1542
Cdd:TIGR02168  682 LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEA 761
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1543 SIQEELQHLRQSSEAEIQAKARQVEAaersrlriEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEA 1622
Cdd:TIGR02168  762 EIEELEERLEEAEEELAEAEAEIEEL--------EAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRI 833
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1623 ERLRRQVQDETQRKRQAEAELAlrvqaeaeaarekqRALQALEELRLQAEEAERRLRQAEAERArQVQVALETAqRSAEA 1702
Cdd:TIGR02168  834 AATERRLEDLEEQIEELSEDIE--------------SLAAEIEELEELIEELESELEALLNERA-SLEEALALL-RSELE 897
                          730       740       750
                   ....*....|....*....|....*....|....*
gi 1920237946 1703 ELQSEHASFAEKTAQLERTLKE--EHVAVVQLREE 1735
Cdd:TIGR02168  898 ELSEELRELESKRSELRRELEElrEKLAQLELRLE 932
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
2931-2969 2.27e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 72.36  E-value: 2.27e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 2931 LLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEMNRVL 2969
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4168-4206 5.20e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 71.59  E-value: 5.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4168 LLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEMNEIL 4206
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3590-3628 5.20e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 71.59  E-value: 5.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3590 LLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEMNRVL 3628
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3849-3887 1.06e-13

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 67.74  E-value: 1.06e-13
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3849 LLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPELHDRL 3887
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2283-2742 1.32e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 78.17  E-value: 1.32e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQ 2362
Cdd:TIGR02168  231 VLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2363 MEELGKLKARIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM 2442
Cdd:TIGR02168  311 LANLERQLEELEAQ-LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2443 QAVQEATRLKAEAELLQQQKE-LAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2521
Cdd:TIGR02168  390 QLELQIASLNNEIERLEARLErLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2522 QARAEEDARRFRKQAEDIGERLYRTE-----------------------------LATQEKV------------------ 2554
Cdd:TIGR02168  470 LEEAEQALDAAERELAQLQARLDSLErlqenlegfsegvkallknqsglsgilgvLSELISVdegyeaaieaalggrlqa 549
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2555 MLVQTLETQRQ--QSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE-----------------------MQTVRQEQ 2609
Cdd:TIGR02168  550 VVVENLNAAKKaiAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEgflgvakdlvkfdpklrkalsylLGGVLVVD 629
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2610 LLQETQALQ------------------------QSFLSEKDSLLQRERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQ 2665
Cdd:TIGR02168  630 DLDNALELAkklrpgyrivtldgdlvrpggvitGGSAKTNSSILERRREIEELEEKIEEL-EEKIAELEKALAELRKELE 708
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2666 QMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIA 2742
Cdd:TIGR02168  709 ELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1469-2593 1.63e-13

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 77.91  E-value: 1.63e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1469 RMEEEERLAEQQRAEERERLAEVEAAL----EKQRQLAEAHAQAKAQAEREAqglqrrmqEEVARREEVAVEAQEQKRSI 1544
Cdd:pfam01576    2 RQEEEMQAKEEELQKVKERQQKAESELkeleKKHQQLCEEKNALQEQLQAET--------ELCAEAEEMRARLAARKQEL 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1545 QEELQHLrqssEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegelQALRARAEEAEAQKRQAQEEAER 1624
Cdd:pfam01576   74 EEILHEL----ESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAAR-------QKLQLEKVTTEAKIKKLEEDILL 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1625 LRRQVQDETQRKRQAE---AELALRVQAEAEAA------REKQRALQALEELRLQAEEAERRlrqaEAERARQVQVALET 1695
Cdd:pfam01576  143 LEDQNSKLSKERKLLEeriSEFTSNLAEEEEKAkslsklKNKHEAMISDLEERLKKEEKGRQ----ELEKAKRKLEGEST 218
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1696 AQRSAEAELQsehASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAeaeraraeaerelerwQLKANEALRLRLQaE 1775
Cdd:pfam01576  219 DLQEQIAELQ---AQIAELRAQLAKKEEELQAALARLEEETAQKNNALK----------------KIRELEAQISELQ-E 278
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1776 EVAQQKSltqaeaekqkeeaerearrrgkAEEQAVRQRELAEQELEKQRQLAEGT-----AQQRLAA--EQELIRLR--- 1845
Cdd:pfam01576  279 DLESERA----------------------ARNKAEKQRRDLGEELEALKTELEDTldttaAQQELRSkrEQEVTELKkal 336
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1846 -AETEQGEQQRQ-----------LLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEEsRSTSEKSKQ 1913
Cdd:pfam01576  337 eEETRSHEAQLQemrqkhtqaleELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHK-RKKLEGQLQ 415
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1914 RLEA---EAGRFR-ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERV---LAEKLAAISEATRLKTEAEIALKEKE 1986
Cdd:pfam01576  416 ELQArlsESERQRaELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLesqLQDTQELLQEETRQKLNLSTRLRQLE 495
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1987 AENERLRRLAEDEAFQRRLLEEQAAQHkadiEARLAQLRKASESELERQKGLVEDtlrqRRQVEEEILALKGSFEKAAAG 2066
Cdd:pfam01576  496 DERNSLQEQLEEEEEAKRNVERQLSTL----QAQLSDMKKKLEEDAGTLEALEEG----KKRLQRELEALTQQLEEKAAA 567
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2067 KAELELELGRIRGTAEDTLRSKEQaeqeaarQRQLAAEEERRRREAeervqKSLAAEEEAARQRKAALEEVERLKAKVEE 2146
Cdd:pfam01576  568 YDKLEKTKNRLQQELDDLLVDLDH-------QRQLVSNLEKKQKKF-----DQMLAEEKAISARYAEERDRAEAEAREKE 635
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2147 ARRLRERAEQESARQLQLAQEAAQKRLQAEekahafavqqkeqelqqtlqqeqsvLERLRSEAEAARRAAEEAEAARERA 2226
Cdd:pfam01576  636 TRALSLARALEEALEAKEELERTNKQLRAE-------------------------MEDLVSSKDDVGKNVHELERSKRAL 690
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2227 EREAAQSRRQVEEAERLKQSaeeqaqaqaqaqAAAEKLRKEAEQEAARRAQAeqaalRQKQAADAEMEKHKQfaeQALRQ 2306
Cdd:pfam01576  691 EQQVEEMKTQLEELEDELQA------------TEDAKLRLEVNMQALKAQFE-----RDLQARDEQGEEKRR---QLVKQ 750
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2307 KAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKD 2386
Cdd:pfam01576  751 VRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKE 830
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2387 SAQRLLQEEAEKMkQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQ 2466
Cdd:pfam01576  831 SEKKLKNLEAELL-QLQEDLAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLN 909
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2467 EQARRLQEDKEQMAQQLAQETQGFQKtLETERQrqlEMSAEAERLRLRVAEMSRAQ---------------ARAEEDARR 2531
Cdd:pfam01576  910 DRLRKSTLQVEQLTTELAAERSTSQK-SESARQ---QLERQNKELKAKLQEMEGTVkskfkssiaaleakiAQLEEQLEQ 985
                         1130      1140      1150      1160      1170      1180
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2532 FRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQ 2593
Cdd:pfam01576  986 ESRERQAANKLVRRTEKKLKEVLLQVEDERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEAS 1047
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3183-3221 7.46e-13

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 65.43  E-value: 7.46e-13
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3183 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPELHEKL 3221
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3925-3963 2.93e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.50  E-value: 2.93e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3925 LLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDTHDQL 3963
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3514-3552 4.41e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.12  E-value: 4.41e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3514 LLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPELHEKL 3552
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4437-4475 4.59e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.12  E-value: 4.59e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4437 LLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRI 4475
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
2301-2625 7.36e-12

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 72.08  E-value: 7.36e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2301 EQALRQKAQVEQELTALRLQLEETDHQKSIlDEELQRLKAEVTEAARQRGQVEEELFSL--RVQMEELGKLK-------A 2371
Cdd:pfam17380  255 EYTVRYNGQTMTENEFLNQLLHIVQHQKAV-SERQQQEKFEKMEQERLRQEKEEKAREVerRRKLEEAEKARqaemdrqA 333
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2372 RIEAENRALVLRDKDSAQRLLQEEaekmKQVAEEAARLSVAAQEAARLRQLaeEDLAQQRALAEKMLKEKMQAVQEATRL 2451
Cdd:pfam17380  334 AIYAEQERMAMERERELERIRQEE----RKRELERIRQEEIAMEISRMREL--ERLQMERQQKNERVRQELEAARKVKIL 407
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2452 KAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQgfQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARR 2531
Cdd:pfam17380  408 EEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEER--AREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRD 485
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2532 fRKQAEDIGERLYRTELATQEKVM--------LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQ 2603
Cdd:pfam17380  486 -RKRAEEQRRKILEKELEERKQAMieeerkrkLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEE 564
                          330       340
                   ....*....|....*....|..
gi 1920237946 2604 TVRQEQLLQETQALQQSFLSEK 2625
Cdd:pfam17380  565 RSRLEAMEREREMMRQIVESEK 586
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
744-922 8.48e-12

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 67.86  E-value: 8.48e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  744 LHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQ 823
Cdd:cd00176      2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERL 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  824 AALQTQWSWMLQLCCCIEAHLKENTAYFQFFSDVREAEEQLRKLQETLRRKYTCDrsiTATRLEDLLQDAQDEKEQLSEY 903
Cdd:cd00176     82 EELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGK---DLESVEELLKKHKELEEELEAH 158
                          170
                   ....*....|....*....
gi 1920237946  904 RGHLSGLAKRAKAIVQLKP 922
Cdd:cd00176    159 EPRLKSLNELAEELLEEGH 177
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4092-4130 1.05e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 61.96  E-value: 1.05e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4092 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKL 4130
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3259-3297 1.19e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 61.96  E-value: 1.19e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3259 LLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKETSAAL 3297
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PTZ00121 PTZ00121
MAEBL; Provisional
1123-1678 2.90e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 70.56  E-value: 2.90e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1123 EACETRTVHRLRLPLDKEPARECAQRITEQQKAqaevDGLGKGV--ARLSAEAEKVLAlPEPSPAAPTLRSELELTLGKL 1200
Cdd:PTZ00121  1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKA----DEAKKKAeeAKKKADAAKKKA-EEAKKAAEAAKAEAEAAADEA 1359
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1201 EQVRSLS-AIYLEKLKTISLVIRSTQEAEEVLRAhEEQLKEAQAVPATLPELEATKAALKK---LRAQAEAQQPVfdalr 1276
Cdd:PTZ00121  1360 EAAEEKAeAAEKKKEEAKKKADAAKKKAEEKKKA-DEAKKKAEEDKKKADELKKAAAAKKKadeAKKKAEEKKKA----- 1433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1277 DELRGAQEvgERLQQRHGERDVEVERWRERVTLLLERwqavlaqtdvrQRELEQLGRQLRYYREsADPLGAWLRDAKQRQ 1356
Cdd:PTZ00121  1434 DEAKKKAE--EAKKADEAKKKAEEAKKAEEAKKKAEE-----------AKKADEAKKKAEEAKK-ADEAKKKAEEAKKKA 1499
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1357 EQIQAVPLANSQAVREQLRQEKALLEDIERHGE--KVEEcqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSG 1434
Cdd:PTZ00121  1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEakKADE----AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1435 SESIIQEYVDLR----TRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERlaeveaalEKQRQLAEAHAQAKA 1510
Cdd:PTZ00121  1576 KNMALRKAEEAKkaeeARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK--------KKVEQLKKKEAEEKK 1647
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1511 QAEReaqgLQRRMQEEVARREEVAVEAQEQKRSIQEelqhLRQSSEAEiqakarqvEAAERSRLRIEEEIRVVRLQLEAT 1590
Cdd:PTZ00121  1648 KAEE----LKKAEEENKIKAAEEAKKAEEDKKKAEE----AKKAEEDE--------KKAAEALKKEAEEAKKAEELKKKE 1711
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1591 ERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAElalrvQAEAEAAREKQRALQALEELRLQ 1670
Cdd:PTZ00121  1712 AEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK-----KEEEKKAEEIRKEKEAVIEEELD 1786

                   ....*...
gi 1920237946 1671 AEEAERRL 1678
Cdd:PTZ00121  1787 EEDEKRRM 1794
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
2855-2893 3.14e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 60.80  E-value: 3.14e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 2855 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPELHHKL 2893
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
1222-1418 8.65e-10

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 61.69  E-value: 8.65e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1222 RSTQEAEEVLRAHEEQLKEAQaVPATLPELEATKAALKKLRAQAEAQQPVFDALrdelrgaQEVGERLQQRHGERDVEVe 1301
Cdd:cd00176      7 RDADELEAWLSEKEELLSSTD-YGDDLESVEALLKKHEALEAELAAHEERVEAL-------NELGEQLIEEGHPDAEEI- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1302 rwRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADpLGAWLRDAKQRQEQIQavPLANSQAVREQLRQEKALL 1381
Cdd:cd00176     78 --QERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD-LEQWLEEKEAALASED--LGKDLESVEELLKKHKELE 152
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1920237946 1382 EDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQ 1418
Cdd:cd00176    153 EELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLE 189
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4513-4551 4.42e-09

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 54.64  E-value: 4.42e-09
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4513 FLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTAQKL 4551
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PLEC smart00250
Plectin repeat;
4435-4472 5.75e-09

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 54.41  E-value: 5.75e-09
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  4435 QRLLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMV 4472
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
growth_prot_Scy NF041483
polarized growth protein Scy;
2284-2730 3.11e-08

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 60.61  E-value: 3.11e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2284 RQKQAADAEMEKHKqfAEQAL-RQKAQVEQELTALRLQLEE-TDHQksildEELQRLKAEVTEAARQRGQVEEELFSLRV 2361
Cdd:NF041483   195 RQRLGSEAESARAE--AEAILrRARKDAERLLNAASTQAQEaTDHA-----EQLRSSTAAESDQARRQAAELSRAAEQRM 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2362 QMEELGKLKARIEAEnRALVLRDKDSAQRLLQEEA---EKMKQVAEEAARL-SVAAQEAARLRQLAEEDLAQQRALAEKM 2437
Cdd:NF041483   268 QEAEEALREARAEAE-KVVAEAKEAAAKQLASAESaneQRTRTAKEEIARLvGEATKEAEALKAEAEQALADARAEAEKL 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2438 LKEKMQAVQEATRLKAEAELlqqqkelaqEQARRLQEDKEQMAQQLAQETQGfQKTLETERQRQlEMSAEAERLRLRVAE 2517
Cdd:NF041483   347 VAEAAEKARTVAAEDTAAQL---------AKAARTAEEVLTKASEDAKATTR-AAAEEAERIRR-EAEAEADRLRGEAAD 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2518 MS-RAQARAEEDARRFRKQAEDIGE--RLYRTElATQEKVMLVQTLETQRQQSDRDA-ERLREAIAELEHEKDKLKQEA- 2592
Cdd:NF041483   416 QAeQLKGAAKDDTKEYRAKTVELQEeaRRLRGE-AEQLRAEAVAEGERIRGEARREAvQQIEEAARTAEELLTKAKADAd 494
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2593 QLLQLKSEEMQTVRQEQLLQETQALQQSflsekDSLLQRERCiEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQ 2672
Cdd:NF041483   495 ELRSTATAESERVRTEAIERATTLRRQA-----EETLERTRA-EAERLRAEAEEQAEEVRAAAERAARELREETERAIAA 568
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2673 QLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLR----ERLQHLEEE 2730
Cdd:NF041483   569 RQAEAAEELTRLHTEAEERLTAAEEALADARAEAERIRREAAEETERLRteaaERIRTLQAQ 630
PLEC smart00250
Plectin repeat;
4166-4202 9.63e-08

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 50.94  E-value: 9.63e-08
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  4166 IRLLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEM 4202
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
PLEC smart00250
Plectin repeat;
2929-2965 7.46e-07

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 48.25  E-value: 7.46e-07
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  2929 IRLLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEM 2965
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4269-4297 1.07e-06

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 47.71  E-value: 1.07e-06
                           10        20
                   ....*....|....*....|....*....
gi 1920237946 4269 IVDPETGKEMSVYEAYRKGLIDHQTYLEL 4297
Cdd:pfam00681   11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PLEC smart00250
Plectin repeat;
3588-3624 1.79e-06

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 47.09  E-value: 1.79e-06
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3588 IRLLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEM 3624
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
PLEC smart00250
Plectin repeat;
3847-3882 3.91e-06

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 46.32  E-value: 3.91e-06
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1920237946  3847 RQLLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPE 3882
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
PLEC smart00250
Plectin repeat;
4052-4089 5.10e-06

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 45.94  E-value: 5.10e-06
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  4052 QRFLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTA 4089
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4054-4092 9.47e-06

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 45.01  E-value: 9.47e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4054 FLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTAFEL 4092
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
growth_prot_Scy NF041483
polarized growth protein Scy;
1969-2722 1.07e-05

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 52.14  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1969 SEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQ 2048
Cdd:NF041483    22 AEMDRLKTEREKAVQHAEDLGYQVEVLRAKLHEARRSLASRPAYDGADIGYQAEQLLRNAQIQADQLRADAERELRDARA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2049 VEEEIlaLKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQ----------AEQEAARQRQLAAEEERRRREAEERVQK 2118
Cdd:NF041483   102 QTQRI--LQEHAEHQARLQAELHTEAVQRRQQLDQELAERRQtveshvnenvAWAEQLRARTESQARRLLDESRAEAEQA 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2119 SLAAEEEAARqrkAALEEVERLKAKVEEARRLRE----RAEQESARQLQLAQEAAQKRLQAEEKAHAF-------AVQQK 2187
Cdd:NF041483   180 LAAARAEAER---LAEEARQRLGSEAESARAEAEailrRARKDAERLLNAASTQAQEATDHAEQLRSStaaesdqARRQA 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2188 EQELQQTLQQEQSVLERLR-----SEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAE 2262
Cdd:NF041483   257 AELSRAAEQRMQEAEEALRearaeAEKVVAEAKEAAAKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAEQAL 336
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2263 KLRKEAEQEAARRAQAEQAALRQKQAAdAEMEKHKQFAEQALRQKAQVEQELTalRLQLEETDHQKSILDEELQRLKAEV 2342
Cdd:NF041483   337 ADARAEAEKLVAEAAEKARTVAAEDTA-AQLAKAARTAEEVLTKASEDAKATT--RAAAEEAERIRREAEAEADRLRGEA 413
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2343 TEAARQ-RGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAAR-----LSVAAQEA 2416
Cdd:NF041483   414 ADQAEQlKGAAKDDTKEYRAKTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARtaeelLTKAKADA 493
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2417 ARLRQLAEEDLAQQRALA-EKMLKEKMQAVQEATRLKAEAELLQQQkelAQEQARRLQEDKEQMAQQLAQETQGFQKTLE 2495
Cdd:NF041483   494 DELRSTATAESERVRTEAiERATTLRRQAEETLERTRAEAERLRAE---AEEQAEEVRAAAERAARELREETERAIAARQ 570
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2496 TERQRQLE-MSAEAERlRLRVAEMSRAQARAEedARRFRKQAEDIGERLyRTELAtqekvmlvQTLETQRQQSDRDAERL 2574
Cdd:NF041483   571 AEAAEELTrLHTEAEE-RLTAAEEALADARAE--AERIRREAAEETERL-RTEAA--------ERIRTLQAQAEQEAERL 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2575 R-EAIAELEHEkdKLKQEAQLLQLKSEEMQtvRQEQLLQETQALQQSFLSEKDSLLQRercIEQEKAKleqlfqdevaka 2653
Cdd:NF041483   639 RtEAAADASAA--RAEGENVAVRLRSEAAA--EAERLKSEAQESADRVRAEAAAAAER---VGTEAAE------------ 699
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2654 qalreeqqrqqqqmqqekqQLAASMEEARRRQHEAEEGVRRQQEEL-QRLAQQQQQQEKLLAEENQRLRE 2722
Cdd:NF041483   700 -------------------ALAAAQEEAARRRREAEETLGSARAEAdQERERAREQSEELLASARKRVEE 750
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1076-1646 1.35e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 51.66  E-value: 1.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1076 DRLQAEREYGSCSRHYQQLLQSLEQGEQEESrcqrcISELKDIRL-QLEACETRTVHRLRlpldkepaRECAQRITEQQK 1154
Cdd:pfam15921  364 ERDQFSQESGNLDDQLQKLLADLHKREKELS-----LEKEQNKRLwDRDTGNSITIDHLR--------RELDDRNMEVQR 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1155 AQAEVDGL-----GKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEE 1229
Cdd:pfam15921  431 LEALLKAMksecqGQMERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKER 510
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1230 VlraheeqlkeaqavpatlpeLEATKAALKKLRAQAEAQQPVFDALRDE---LRGAQEVGERLQQRHGERDVEVERWRER 1306
Cdd:pfam15921  511 A--------------------IEATNAEITKLRSRVDLKLQELQHLKNEgdhLRNVQTECEALKLQMAEKDKVIEILRQQ 570
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1307 VTLLLE-------RWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLR--DAKQRQEQIQAVPLANSQAvrEQLRQE 1377
Cdd:pfam15921  571 IENMTQlvgqhgrTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIRelEARVSDLELEKVKLVNAGS--ERLRAV 648
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1378 KALLEDIERHGEKVEECQrfaKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQsgsesiiqeyvdLRTRYSELSTLTS 1457
Cdd:pfam15921  649 KDIKQERDQLLNEVKTSR---NELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQ------------LKSAQSELEQTRN 713
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1458 qyirfiseTLRRMEEEERLAeqqraeererlaeVEAALEKQRQLAEAHAQAKAqaereaqglqrrMQEEVARREEVAVEA 1537
Cdd:pfam15921  714 --------TLKSMEGSDGHA-------------MKVAMGMQKQITAKRGQIDA------------LQSKIQFLEEAMTNA 760
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQsseaeiqakarqveaaersrlrieeeirvvRLQLEATERQRggAEGELQALRA---RAEEAEAQ 1614
Cdd:pfam15921  761 NKEKHFLKEEKNKLSQ------------------------------ELSTVATEKNK--MAGELEVLRSqerRLKEKVAN 808
                          570       580       590
                   ....*....|....*....|....*....|..
gi 1920237946 1615 KRQAQEEAERLRRQVQDETQRKRQAEAELALR 1646
Cdd:pfam15921  809 MEVALDKASLQFAECQDIIQRQEQESVRLKLQ 840
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1395-1726 1.97e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 51.17  E-value: 1.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1395 QRFAKQYINAIKDYelqlvtYKAQLEPVASPAKKPKvQSGSESIIQEYVDLRTRY-SELSTLTSQyirfiSETLRRMEEE 1473
Cdd:NF033838    53 NESQKEHAKEVESH------LEKILSEIQKSLDKRK-HTQNVALNKKLSDIKTEYlYELNVLKEK-----SEAELTSKTK 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1474 ERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ----RRMQEEVArreEVAVEAQEQKRSIQEELQ 1549
Cdd:NF033838   121 KELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNYPtntyKTLELEIA---ESDVEVKKAELELVKEEA 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1550 HLRQSSEAEIQAKAR-QVEAAERSRLrieEEIRVVRLQLEATERQRGGAE-GELQALRARAEEAEAQKRQA--------- 1618
Cdd:NF033838   198 KEPRDEEKIKQAKAKvESKKAEATRL---EKIKTDREKAEEEAKRRADAKlKEAVEKNVATSEQDKPKRRAkrgvlgepa 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1619 -----QEEAERLRRQVQDET-------QRKRQAEAE-LALRVQAEAEAAREKQR---ALQALEELRLQAEEAERRLRQAE 1682
Cdd:NF033838   275 tpdkkENDAKSSDSSVGEETlpspslkPEKKVAEAEkKVEEAKKKAKDQKEEDRrnyPTNTYKTLELEIAESDVKVKEAE 354
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 1683 A----ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEH 1726
Cdd:NF033838   355 LelvkEEAKEPRNEEKIKQAKAKVESKKAEATRLEKIKTDRKKAEEEA 402
PLEC smart00250
Plectin repeat;
3923-3959 2.18e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 44.01  E-value: 2.18e-05
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3923 LRLLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDT 3959
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
2283-2488 2.37e-05

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 50.99  E-value: 2.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADA-----EMEKHKQFAEQALRQKaqveqeltalrlQLEETDH---------QKSILDEELQRLKAEVTEAARQ 2348
Cdd:NF012221  1561 LADKERAEAdrqrlEQEKQQQLAAISGSQS------------QLESTDQnaletngqaQRDAILEESRAVTKELTTLAQG 1628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2349 RGQVEEElfslRVQMEELGKlKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQ--------VAEEAARLSVAAQEAAR 2418
Cdd:NF012221  1629 LDALDSQ----ATYAGESGD-QWRNPFAGGLLdrVQEQLDDAKKISGKQLADAKQrhvdnqqkVKDAVAKSEAGVAQGEQ 1703
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2419 LRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQ 2488
Cdd:NF012221  1704 NQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQADAKGAK 1773
PLEC smart00250
Plectin repeat;
3476-3511 4.24e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 43.24  E-value: 4.24e-05
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1920237946  3476 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTA 3511
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3476-3514 4.34e-05

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 43.47  E-value: 4.34e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3476 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTATLL 3514
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PLEC smart00250
Plectin repeat;
4262-4290 4.68e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 43.24  E-value: 4.68e-05
                            10        20
                    ....*....|....*....|....*....
gi 1920237946  4262 VRKRRVVIVDPETGKEMSVYEAYRKGLID 4290
Cdd:smart00250    6 AQSAIGGIIDPETGQKLSVEEALRRGLID 34
PLEC smart00250
Plectin repeat;
3257-3293 5.98e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 42.85  E-value: 5.98e-05
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3257 LRLLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKET 3293
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
2393-2491 9.15e-05

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 47.96  E-value: 9.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2393 QEEAEKMKQVAEEAARLSVAAQEAARL----RQLAEEDLAQQRALAE--KMLKEKMQavQEATRLKAEAELLQQQKElaQ 2466
Cdd:cd16269    191 QALTEKEKEIEAERAKAEAAEQERKLLeeqqRELEQKLEDQERSYEEhlRQLKEKME--EERENLLKEQERALESKL--K 266
                           90       100
                   ....*....|....*....|....*
gi 1920237946 2467 EQARRLQEDKEQMAQQLAQETQGFQ 2491
Cdd:cd16269    267 EQEALLEEGFKEQAELLQEEIRSLK 291
PLEC smart00250
Plectin repeat;
3220-3256 1.31e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 42.08  E-value: 1.31e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3220 KLLSAEKAVTGYKDPYSGQSVSLFQALKKGLIPREQG 3256
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
4092-4126 1.45e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 41.70  E-value: 1.45e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1920237946  4092 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEF 4126
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
PLEC smart00250
Plectin repeat;
4401-4434 1.96e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 41.31  E-value: 1.96e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1920237946  4401 EETGPVAGILDTETLEKVSITEAMHRNLVDNITG 4434
Cdd:smart00250    5 EAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
1804-2016 3.11e-04

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 47.52  E-value: 3.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRELAEQ-----ELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQRQLLEEElarlqreaaaatqkRRE 1878
Cdd:NF012221  1555 DAAQNALADKERAEAdrqrlEQEKQQQLAAISGSQ---SQLESTDQNALETNGQAQRDAILEE--------------SRA 1617
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1879 LEAELAKVRAEMEVLLAS-------------------KARAEEESRSTSEKSKQRLEAEAGRF----RELAEEAARLRAL 1935
Cdd:NF012221  1618 VTKELTTLAQGLDALDSQatyagesgdqwrnpfagglLDRVQEQLDDAKKISGKQLADAKQRHvdnqQKVKDAVAKSEAG 1697
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1936 AEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRlLEEQAAQHKA 2015
Cdd:NF012221  1698 VAQGEQNQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQ-ADAKGAKQDE 1776

                   .
gi 1920237946 2016 D 2016
Cdd:NF012221  1777 S 1777
PLEC smart00250
Plectin repeat;
3143-3180 3.58e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.54  E-value: 3.58e-04
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  3143 RRALRGSGVIAGVWLEEAGQKLSIYEALRKDLLQPEAA 3180
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
3551-3587 3.91e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.54  E-value: 3.91e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3551 KLLSAEKAVTGYRDPYSGSTISLFQAMKKGLVLREHG 3587
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
3512-3547 5.04e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.16  E-value: 5.04e-04
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1920237946  3512 TLLLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPE 3547
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
PLEC smart00250
Plectin repeat;
3183-3216 5.35e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.16  E-value: 5.35e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1920237946  3183 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPE 3216
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
2410-2688 6.35e-04

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 46.37  E-value: 6.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2410 SVAAQEAARLRQLAEEDLAQQRALAEKmlkekmqavqeatrlkaeaellqqqkELAQEQARRLQEDKeqmAQQLAqETQG 2489
Cdd:NF012221  1538 SESSQQADAVSKHAKQDDAAQNALADK--------------------------ERAEADRQRLEQEK---QQQLA-AISG 1587
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2490 FQKTLETERQRQLEMSAEAERlrlrvaemsraqARAEEDARRFRKQAEDIGERLyrtelatqekvmlvQTLETQRQQSDR 2569
Cdd:NF012221  1588 SQSQLESTDQNALETNGQAQR------------DAILEESRAVTKELTTLAQGL--------------DALDSQATYAGE 1641
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2570 DAERLREAIAE--LEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSllqrerciEQEKAKLEQLFQ 2647
Cdd:NF012221  1642 SGDQWRNPFAGglLDRVQEQLDDAKKISGKQLADAKQRHVDNQQKVKDAVAKSEAGVAQG--------EQNQANAEQDID 1713
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 2648 DEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRR-QHEA 2688
Cdd:NF012221  1714 DAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRgEQDA 1755
SPEC smart00150
Spectrin repeats;
745-837 7.39e-04

Spectrin repeats;


Pssm-ID: 197544 [Multi-domain]  Cd Length: 101  Bit Score: 41.93  E-value: 7.39e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   745 HGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQA 824
Cdd:smart00150    1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIEERLE 80
                            90
                    ....*....|...
gi 1920237946   825 ALQTQWSWMLQLC 837
Cdd:smart00150   81 ELNERWEELKELA 93
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
1478-1605 7.82e-04

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 44.88  E-value: 7.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1478 EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAqAEREaqglqRRMQEEVARREEVavEAQEQKRSIQEELQHLRQSSEA 1557
Cdd:cd16269    177 QSKEAEAEAILQADQALTEKEKEIEAERAKAEA-AEQE-----RKLLEEQQRELEQ--KLEDQERSYEEHLRQLKEKMEE 248
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 1558 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegELQALR 1605
Cdd:cd16269    249 ERENLLKEQERALESKLKEQEALLEEGFKEQAELLQE-----EIRSLK 291
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1803-2185 9.24e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 45.39  E-value: 9.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1803 GKAEEQAVRQRELAEQELEK-----QRQLAEGTAQQRLAAEQELIRLRAE--TEQGEQQRQLLEEELARLQREAAAATQK 1875
Cdd:NF033838    50 SSGNESQKEHAKEVESHLEKilseiQKSLDKRKHTQNVALNKKLSDIKTEylYELNVLKEKSEAELTSKTKKELDAAFEQ 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1876 RRELEAELAKVRAEMEVLLA-----SKARAEEESRSTSEKSKQRLEAEAGRFrELAEEAARLRALAEEAKRQRqlaEEDA 1950
Cdd:NF033838   130 FKKDTLEPGKKVAEATKKVEeaekkAKDQKEEDRRNYPTNTYKTLELEIAES-DVEVKKAELELVKEEAKEPR---DEEK 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1951 VRQrAEAErVLAEKlaaiSEATRLKteaEIALKEKEAENERLRRLaedEAFQRRLLEEQAAQHKADIEARLAqlRKASES 2030
Cdd:NF033838   206 IKQ-AKAK-VESKK----AEATRLE---KIKTDREKAEEEAKRRA---DAKLKEAVEKNVATSEQDKPKRRA--KRGVLG 271
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2031 ELERQKGLVEDTLRQRRQVEEEIL---ALKGSFEKAAAGKAELELElGRIRGTAEDTLR-----SKEQAEQEAARqrqla 2102
Cdd:NF033838   272 EPATPDKKENDAKSSDSSVGEETLpspSLKPEKKVAEAEKKVEEAK-KKAKDQKEEDRRnyptnTYKTLELEIAE----- 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2103 aeEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKA---KVEEARRLRERAEQESARQLQLAQEAAQKrlQAEEKA 2179
Cdd:NF033838   346 --SDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAeatRLEKIKTDRKKAEEEAKRKAAEEDKVKEK--PAEQPQ 421

                   ....*.
gi 1920237946 2180 HAFAVQ 2185
Cdd:NF033838   422 PAPAPQ 427
PLEC smart00250
Plectin repeat;
4511-4548 9.46e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 39.39  E-value: 9.46e-04
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  4511 QRFLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTA 4548
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PLEC smart00250
Plectin repeat;
2892-2928 1.32e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 39.00  E-value: 1.32e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  2892 KLLSAERAVTGYKDPYTGEQISLFQAMKKDLIVREHG 2928
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
SPEC smart00150
Spectrin repeats;
648-742 1.69e-03

Spectrin repeats;


Pssm-ID: 197544 [Multi-domain]  Cd Length: 101  Bit Score: 40.78  E-value: 1.69e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   648 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQL---SPATRGAYRDCLGRLD 724
Cdd:smart00150    4 LRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLieeGHPDAEEIEERLEELN 83
                            90
                    ....*....|....*...
gi 1920237946   725 LQYAKLLSSSKARLRSLE 742
Cdd:smart00150   84 ERWEELKELAEERRQKLE 101
PLEC smart00250
Plectin repeat;
2855-2888 3.30e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 37.85  E-value: 3.30e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1920237946  2855 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPE 2888
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
PLEC smart00250
Plectin repeat;
3886-3917 3.30e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 37.85  E-value: 3.30e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1920237946  3886 RLLSAERAVTGYRDPYTEQTISLFQAMKKDLI 3917
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLI 33
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3811-3849 6.52e-03

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 37.31  E-value: 6.52e-03
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3811 YLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVARQL 3849
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PLEC smart00250
Plectin repeat;
3810-3846 8.01e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 37.08  E-value: 8.01e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3810 RYLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVA 3846
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
 
Name Accession Description Interval E-value
CH_PLEC-like_rpt1 cd21188
first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ...
183-287 2.52e-76

first calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409037  Cd Length: 105  Bit Score: 248.86  E-value: 2.52e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  183 DRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKLVNI 262
Cdd:cd21188      1 DAVQKKTFTKWVNKHLIKARRRVVDLFEDLRDGHNLISLLEVLSGESLPRERGRMRFHRLQNVQTALDFLKYRKIKLVNI 80
                           90       100
                   ....*....|....*....|....*
gi 1920237946  263 RNDDIADGNPKLTLGLIWTIILHFQ 287
Cdd:cd21188     81 RAEDIVDGNPKLTLGLIWTIILHFQ 105
CH_PLEC_rpt1 cd21235
first calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ...
180-298 5.94e-74

first calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409084  Cd Length: 119  Bit Score: 242.62  E-value: 5.94e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  180 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKL 259
Cdd:cd21235      1 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKL 80
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1920237946  260 VNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSE 298
Cdd:cd21235     81 VNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQSE 119
CH_DYST_rpt1 cd21236
first calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also ...
178-296 2.90e-72

first calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. Dystonin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409085  Cd Length: 128  Bit Score: 238.35  E-value: 2.90e-72
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  178 AADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQV 257
Cdd:cd21236     10 YKDERDKVQKKTFTKWINQHLMKVRKHVNDLYEDLRDGHNLISLLEVLSGDTLPREKGRMRFHRLQNVQIALDYLKRRQV 89
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1920237946  258 KLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQ 296
Cdd:cd21236     90 KLVNIRNDDITDGNPKLTLGLIWTIILHFQISDIHVTGE 128
CH_PLEC_rpt2 cd21238
second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also ...
300-405 1.17e-69

second calponin homology (CH) domain found in plectin and similar proteins; Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments and anchors intermediate filaments to desmosomes or hemidesmosomes. It can also bind muscle proteins such as actin to membrane complexes in muscle. Plectin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409087  Cd Length: 106  Bit Score: 229.91  E-value: 1.17e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21238      1 MTAKEKLLLWSQRMVEGYQGLRCDNFTSSWRDGRLFNAIIHRHKPMLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 80
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  380 PEDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21238     81 PEDVDVPQPDEKSIITYVSSLYDAMP 106
CH_PLEC-like_rpt2 cd21189
second calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family ...
301-405 4.00e-64

second calponin homology (CH) domain found in the plectin/dystonin/MACF1 family; This family includes plectin, dystonin and microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1). Plectin, also called PCN, PLTN, hemidesmosomal protein 1 (HD1), or plectin-1, is a structural component of muscle. It interlinks intermediate filaments with microtubules and microfilaments, and anchors intermediate filaments to desmosomes or hemidesmosomes. It could also bind muscle proteins such as actin to membrane complexes in muscle. Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409038  Cd Length: 105  Bit Score: 213.79  E-value: 4.00e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 380
Cdd:cd21189      1 SAKEALLLWARRTTEGYPGVRVTNFTSSWRDGLAFNAIIHRNRPDLIDFRSVRNQSNRENLENAFNVAEKEFGVTRLLDP 80
                           90       100
                   ....*....|....*....|....*
gi 1920237946  381 EDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21189     81 EDVDVPEPDEKSIITYVSSLYDVFP 105
CH_MACF1_rpt1 cd21237
first calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, ...
180-297 1.97e-63

first calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1) and similar proteins; MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. MACF1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409086  Cd Length: 118  Bit Score: 212.59  E-value: 1.97e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  180 DERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKL 259
Cdd:cd21237      1 DERDRVQKKTFTKWVNKHLMKVRKHINDLYEDLRDGHNLISLLEVLSGVKLPREKGRMRFHRLQNVQIALDFLKQRQVKL 80
                           90       100       110
                   ....*....|....*....|....*....|....*...
gi 1920237946  260 VNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVSGQS 297
Cdd:cd21237     81 VNIRNDDITDGNPKLTLGLIWTIILHFQISDIYISGES 118
CH_DYST_rpt2 cd21239
second calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also ...
301-405 1.47e-56

second calponin homology (CH) domain found in dystonin and similar proteins; Dystonin, also called 230 kDa bullous pemphigoid antigen, 230/240 kDa bullous pemphigoid antigen, bullous pemphigoid antigen 1 (BPA or BPAG1), dystonia musculorum protein, or hemidesmosomal plaque protein, is a cytoskeletal linker protein that acts as an integrator of intermediate filaments, actin, and microtubule cytoskeleton networks. It is required for anchoring either intermediate filaments to the actin cytoskeleton in neural and muscle cells, or keratin-containing intermediate filaments to hemidesmosomes in epithelial cells. Dystonin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409088  Cd Length: 104  Bit Score: 192.51  E-value: 1.47e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 380
Cdd:cd21239      1 SAKERLLLWSQQMTEGYTGIRCENFTTCWRDGRLFNAIIHKYRPDLIDMNTVAVQSNLANLEHAFYVAEK-LGVTRLLDP 79
                           90       100
                   ....*....|....*....|....*
gi 1920237946  381 EDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21239     80 EDVDVSSPDEKSVITYVSSLYDVFP 104
CH_DMD-like_rpt1 cd21186
first calponin homology (CH) domain found in the dystrophin family; The dystrophin family ...
185-288 1.23e-50

first calponin homology (CH) domain found in the dystrophin family; The dystrophin family includes dystrophin and its paralog, utrophin. Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. Dystrophin is also involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and links the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409035  Cd Length: 107  Bit Score: 175.65  E-value: 1.23e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  185 VQKKTFTKWVNKHLIKAQR-HISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKLVNIR 263
Cdd:cd21186      2 VQKKTFTKWINSQLSKANKpPIKDLFEDLRDGTRLLALLEVLTGKKLKPEKGRMRVHHLNNVNRALQVLEQNNVKLVNIS 81
                           90       100
                   ....*....|....*....|....*
gi 1920237946  264 NDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21186     82 SNDIVDGNPKLTLGLVWSIILHWQV 106
CH_MACF1_rpt2 cd21240
second calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, ...
299-405 5.06e-50

second calponin homology (CH) domain found in microtubule-actin cross-linking factor 1, isoforms 1/2/3/5 (MACF1) and similar proteins; MACF1, also called 620 kDa actin-binding protein (ABP620), actin cross-linking family protein 7 (ACF7), macrophin-1, or trabeculin-alpha, is a large protein containing numerous spectrin and leucine-rich repeat (LRR) domains. It facilitates actin-microtubule interactions at the cell periphery and couples the microtubule network to cellular junctions. MACF1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409089  Cd Length: 107  Bit Score: 173.69  E-value: 5.06e-50
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  299 DMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLL 378
Cdd:cd21240      2 DMSAKEKLLLWTQKVTAGYTGIKCTNFSSCWSDGKMFNALIHRYRPDLVDMERVQIQSNRENLEQAFEVAER-LGVTRLL 80
                           90       100
                   ....*....|....*....|....*..
gi 1920237946  379 DPEDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21240     81 DAEDVDVPSPDEKSVITYVSSIYDAFP 107
CH_SPTB-like_rpt1 cd21246
first calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I ...
179-284 2.83e-48

first calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I spectrin-like family includes beta-I, -II, -III and -IV spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-III spectrin, also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5), may play a crucial role as a longer actin-membrane cross-linker or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Members of this subfamily contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409095  Cd Length: 117  Bit Score: 169.08  E-value: 2.83e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQIALDYLRHRQV 257
Cdd:cd21246     10 ADEREAVQKKTFTKWVNSHLARVGCRINDLYTDLRDGRMLIKLLEVLSGERLPKpTKGKMRIHCLENVDKALQFLKEQRV 89
                           90       100
                   ....*....|....*....|....*..
gi 1920237946  258 KLVNIRNDDIADGNPKLTLGLIWTIIL 284
Cdd:cd21246     90 HLENMGSHDIVDGNHRLTLGLIWTIIL 116
SAC6 COG5069
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];
179-512 2.47e-45

Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];


Pssm-ID: 227401 [Multi-domain]  Cd Length: 612  Bit Score: 176.28  E-value: 2.47e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDRVQKKTFTKWVNKHLIKA-QRHISDLYEDLRDGHNLISLLEVLSGDSLPR--EKGRMRFHKLQNVQIALDYLRHR 255
Cdd:COG5069      3 AKKWQKVQKKTFTKWTNEKLISGgQKEFGDLDTDLKDGVKLAQLLEALQKDNAGEynETPETRIHVMENVSGRLEFIKGK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  256 QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQvsgQSEDMTAKEKLLLWSQRMVEGCQ-GLRCDNFTTSWRDGRL 334
Cdd:COG5069     83 GVKLFNIGPQDIVDGNPKLILGLIWSLISRLTIATIN---EEGELTKHINLLLWCDEDTGGYKpEVDTFDFFRSWRDGLA 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  335 FNAIIHRHKPTLIDMNKVYRQTNLE--NLDQAFSVAERDLGVTRLLDPEDV-DVPQPDEKSIITYVS------SLYD--- 402
Cdd:COG5069    160 FSALIHDSRPDTLDPNVLDLQKKNKalNNFQAFENANKVIGIARLIGVEDIvNVSIPDERSIMTYVSwyiirfGLLEkid 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  403 -AMPRVPDVQDGVKANElQLRwQEYRELVLLLLQWIRAHTAGFEERRFPSSFEEIEILWCQFLKFKETE--LPAKEAD-K 478
Cdd:COG5069    240 iALHRVYRLLEADETLI-QLR-LPYEIILLRLLNLIHLKQANWKVVNFSKDVSDGENYTDLLNQLNALCsrAPLETTDlH 317
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1920237946  479 NRSKGIYQSLEgAVQAGQLKVPPGYHPLDVEKEW 512
Cdd:COG5069    318 SLAGQILQNAE-KYDCRKYLPPAGNPKLDLAFVA 350
CH_beta_spectrin_rpt2 cd21194
second calponin homology (CH) domain found in the beta spectrin family; The beta spectrin ...
301-401 6.87e-45

second calponin homology (CH) domain found in the beta spectrin family; The beta spectrin family includes beta-I, -II, -III, -IV and -V spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Beta-III spectrin is also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5). Beta-V spectrin, also called spectrin beta chain, non-erythrocytic 5 (SPTBN5), is a mammalian ortholog of Drosophila beta H spectrin. Beta-III and Beta-V spectrins may play crucial roles as longer actin-membrane cross-linkers or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409043  Cd Length: 105  Bit Score: 159.11  E-value: 6.87e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 380
Cdd:cd21194      2 SAKDALLLWCQRKTAGYPGVNIQNFTTSWRDGLAFNALIHAHRPDLIDYNRLDPNDHLGNLNNAFDVAEQELGIAKLLDA 81
                           90       100
                   ....*....|....*....|.
gi 1920237946  381 EDVDVPQPDEKSIITYVSSLY 401
Cdd:cd21194     82 EDVDVARPDEKSIMTYVASYY 102
S10_plectin pfam03501
Plectin/S10 domain; This presumed domain is found at the N-terminus of some isoforms of the ...
7-99 1.03e-44

Plectin/S10 domain; This presumed domain is found at the N-terminus of some isoforms of the cytoskeletal muscle protein plectin as well as the ribosomal S10 protein. This domain may be involved in RNA binding.


Pssm-ID: 427337  Cd Length: 92  Bit Score: 158.07  E-value: 1.03e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946    7 MPLDQLRTIYEVLFREGVMVAKKDRRPrSLHPHVpGVTNLQVTRAMASLRARGLVRETFAWRHFYWYLTNEGIAHLRQYL 86
Cdd:pfam03501    1 IPKENRKAIYEYLFKEGVLVAKKDFNL-PKHPEL-NVPNLQVIKAMQSLKSRGYVKEQFAWRHYYWYLTNEGIEYLREYL 78
                           90
                   ....*....|...
gi 1920237946   87 HLPPEIVPASLQR 99
Cdd:pfam03501   79 HLPAEIVPATLKR 91
CH_SYNE1_rpt1 cd21241
first calponin homology (CH) domain found in synaptic nuclear envelope protein 1 and similar ...
181-288 2.89e-44

first calponin homology (CH) domain found in synaptic nuclear envelope protein 1 and similar proteins; Synaptic nuclear envelope protein 1 (SYNE-1), also called nesprin-1, enaptin, KASH domain-containing protein 1 (KASH1), myocyte nuclear envelope protein 1 (MYNE-1), or nuclear envelope spectrin repeat protein 1, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-1 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409090  Cd Length: 113  Bit Score: 157.54  E-value: 2.89e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRVQKKTFTKWVNKHLIKAQR--HISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM--RFHKLQNVQIALDYLRHRQ 256
Cdd:cd21241      1 EQERVQKKTFTNWINSYLAKRKPpmKVEDLFEDIKDGTKLLALLEVLSGEKLPCEKGRRlkRVHFLSNINTALKFLESKK 80
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1920237946  257 VKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21241     81 IKLVNINPTDIVDGKPSIVLGLIWTIILYFQI 112
CH_beta_spectrin_rpt1 cd21193
first calponin homology (CH) domain found in the beta spectrin family; The beta spectrin ...
179-284 8.13e-44

first calponin homology (CH) domain found in the beta spectrin family; The beta spectrin family includes beta-I, -II, -III, -IV and -V spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Beta-III spectrin is also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5). Beta-V spectrin, also called spectrin beta chain, non-erythrocytic 5 (SPTBN5), is a mammalian ortholog of Drosophila beta H spectrin. Beta-III and Beta-V spectrins may play crucial roles as longer actin-membrane cross-linkers or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409042  Cd Length: 116  Bit Score: 156.30  E-value: 8.13e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQIALDYLrHRQV 257
Cdd:cd21193     10 QEERINIQKKTFTKWINSFLEKANLEIGDLFTDLSDGKLLLKLLEIISGEKLGKpNRGRLRVQKIENVNKALAFL-KTKV 88
                           90       100
                   ....*....|....*....|....*..
gi 1920237946  258 KLVNIRNDDIADGNPKLTLGLIWTIIL 284
Cdd:cd21193     89 RLENIGAEDIVDGNPRLILGLIWTIIL 115
CH_SYNE-like_rpt1 cd21190
first calponin homology (CH) domain found in the synaptic nuclear envelope protein family; The ...
181-288 8.49e-44

first calponin homology (CH) domain found in the synaptic nuclear envelope protein family; The synaptic nuclear envelope (SYNE) family includes SYNE-1, -2 and calmin. SYNE-1 (also called nesprin-1, enaptin, KASH domain-containing protein 1, KASH1, myocyte nuclear envelope protein 1, MYNE-1, or nuclear envelope spectrin repeat protein 1) and SYNE-2 (also called nesprin-2, KASH domain-containing protein 2, KASH2, nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE) may act redundantly. They are multi-isomeric modular proteins which form a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. They also act as components of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409039  Cd Length: 113  Bit Score: 156.19  E-value: 8.49e-44
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRVQKKTFTKWVNKHLikaQRH-----ISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM--RFHKLQNVQIALDYLR 253
Cdd:cd21190      1 EQERVQKKTFTNWINSHL---AKLsqpivINDLFVDIKDGTALLRLLEVLSGQKLPIESGRVlqRAHKLSNIRNALDFLT 77
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1920237946  254 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21190     78 KRCIKLVNINSTDIVDGKPSIVLGLIWTIILYFQI 112
CH_SPTB_like_rpt2 cd21248
second calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I ...
301-401 4.77e-43

second calponin homology (CH) domain found in the beta-I spectrin-like subfamily; The beta-I spectrin-like family includes beta-I, -II, -III and -IV spectrins. Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. Beta-I spectrin, also called spectrin beta chain, erythrocytic (SPTB), may be involved in anaemia pathogenesis. Beta-II spectrin, also called spectrin beta chain, non-erythrocytic 1 (SPTBN1), or fodrin beta chain, is a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. Beta-III spectrin, also called spectrin beta chain, non-erythrocytic 2 (SPTBN2), or spinocerebellar ataxia 5 protein (SCA5), may play a crucial role as a longer actin-membrane cross-linker or fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. Beta-IV spectrin is also called spectrin, non-erythroid beta chain 3 (SPTBN3) or spectrin beta chain, non-erythrocytic 4 (SPTBN4). Its mutation associates with congenital myopathy, neuropathy, and central deafness. Members of this subfamily contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409097  Cd Length: 105  Bit Score: 153.71  E-value: 4.77e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 380
Cdd:cd21248      2 SAKDALLLWCQMKTAGYPNVNVRNFTTSWRDGLAFNALIHKHRPDLIDYDKLSKSNALYNLQNAFNVAEQKLGLTKLLDP 81
                           90       100
                   ....*....|....*....|.
gi 1920237946  381 EDVDVPQPDEKSIITYVSSLY 401
Cdd:cd21248     82 EDVNVEQPDEKSIITYVVTYY 102
CH_SYNE1_rpt2 cd21243
second calponin homology (CH) domain found in synaptic nuclear envelope protein 1 (SYNE-1) and ...
300-405 5.42e-41

second calponin homology (CH) domain found in synaptic nuclear envelope protein 1 (SYNE-1) and similar proteins; SYNE-1, also called nesprin-1, enaptin, KASH domain-containing protein 1 (KASH1), myocyte nuclear envelope protein 1 (MYNE-1), or nuclear envelope spectrin repeat protein 1, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-1 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409092  Cd Length: 109  Bit Score: 147.85  E-value: 5.42e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21243      4 GGAKKALLKWVQNAAAKRFGIEVKDFGPSWRDGVAFNAIIHSIRPDLVDMESLKRRSNRENLETAFTVAEKELGIPRLLD 83
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  380 PEDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21243     84 PEDVDVDKPDEKSIMTYVAQFLKKYP 109
CH_SPTBN4_rpt1 cd21318
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) ...
170-284 1.14e-40

first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN4, also called beta-IV spectrin, or spectrin, non-erythroid beta chain 3 (SPTBN3), is a novel spectrin isolated as an interactor of the receptor tyrosine phosphatase-like protein ICA512. Its mutation associates with congenital myopathy, neuropathy, and central deafness. SPTBN4 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409167  Cd Length: 139  Bit Score: 148.25  E-value: 1.14e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  170 RPGPEPAPA------------ADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGR 236
Cdd:cd21318     11 RPWDEPAATaklfecsrikalADEREAVQKKTFTKWVNSHLARVPCRINDLYTDLRDGYVLTRLLEVLSGEQLPKpTRGR 90
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946  237 MRFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIIL 284
Cdd:cd21318     91 MRIHSLENVDKALQFLKEQRVHLENVGSHDIVDGNHRLTLGLIWTIIL 138
CH_ACTN_rpt2 cd21216
second calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin ...
288-403 8.91e-40

second calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) family includes alpha-actinin-1, -2, -3, and -4. They are F-actin cross-linking proteins which are thought to anchor actin to a variety of intracellular structures. ACTN1 mutations cause congenital macrothrombocytopenia. ACTN2 mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. ACTN3 is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. ACTN4 is associated with cell motility and cancer invasion. It is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409065  Cd Length: 115  Bit Score: 144.81  E-value: 8.91e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  288 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 367
Cdd:cd21216      1 IQDISV----EELSAKEGLLLWCQRKTAPYKNVNVQNFHTSWKDGLAFCALIHRHRPDLLDYDKLRKDDPRENLNLAFDV 76
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1920237946  368 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21216     77 AEKHLDIPKMLDAEDiVNTPRPDERSVMTYVSCYYHA 113
CH_SPTBN2_rpt1 cd21317
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) ...
179-284 2.34e-39

first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN2, also called beta-III spectrin, or spinocerebellar ataxia 5 protein (SCA5), probably plays an important role in the neuronal membrane skeleton. Mutations in SPTBN2 is associated with spinocerebellar ataxia type 5. SPTBN2 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409166  Cd Length: 132  Bit Score: 144.43  E-value: 2.34e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQIALDYLRHRQV 257
Cdd:cd21317     25 ADEREAVQKKTFTKWVNSHLARVTCRIGDLYTDLRDGRMLIRLLEVLSGEQLPKpTKGRMRIHCLENVDKALQFLKEQKV 104
                           90       100
                   ....*....|....*....|....*..
gi 1920237946  258 KLVNIRNDDIADGNPKLTLGLIWTIIL 284
Cdd:cd21317    105 HLENMGSHDIVDGNHRLTLGLIWTIIL 131
CH_ACTN_rpt1 cd21214
first calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) ...
183-284 8.00e-39

first calponin homology (CH) domain found in the alpha-actinin family; The alpha-actinin (ACTN) family includes alpha-actinin-1, -2, -3, and -4. They are F-actin cross-linking proteins which are thought to anchor actin to a variety of intracellular structures. ACTN1 mutations cause congenital macrothrombocytopenia. ACTN2 mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. ACTN3 is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. ACTN4 is associated with cell motility and cancer invasion. It is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409063  Cd Length: 105  Bit Score: 141.76  E-value: 8.00e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  183 DRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQIALDYLRHRQVKLVN 261
Cdd:cd21214      3 EKQQRKTFTAWCNSHLRKAGTQIENIEEDFRDGLKLMLLLEVISGERLPKpERGKMRFHKIANVNKALDFIASKGVKLVS 82
                           90       100
                   ....*....|....*....|...
gi 1920237946  262 IRNDDIADGNPKLTLGLIWTIIL 284
Cdd:cd21214     83 IGAEEIVDGNLKMTLGMIWTIIL 105
CH_SYNE2_rpt1 cd21242
first calponin homology (CH) domain found in synaptic nuclear envelope protein 2; Synaptic ...
181-288 1.26e-38

first calponin homology (CH) domain found in synaptic nuclear envelope protein 2; Synaptic nuclear envelope protein 2 (SYNE-2), also called nesprin-2, KASH domain-containing protein 2 (KASH2), nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-2 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-2 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409091  Cd Length: 111  Bit Score: 141.51  E-value: 1.26e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRVQKKTFTKWVNKHLIKAQ--RHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVK 258
Cdd:cd21242      1 EQEQTQKRTFTNWINSQLAKHSppSVVSDLFTDIQDGHRLLDLLEVLSGQQLPREKGHNVFQCRSNIETALSFLKNKSIK 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 1920237946  259 LVNIRNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21242     81 LINIHVPDIIEGKPSIILGLIWTIILHFHI 110
CH_SpAIN1-like_rpt1 cd21215
first calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like ...
185-286 1.95e-38

first calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like protein 1 and similar proteins; Schizosaccharomyces pombe alpha-actinin-like protein 1 (SpAIN1) binds to actin and is involved in actin-ring formation and organization. It plays a role in cytokinesis and is involved in septation. Members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409064  Cd Length: 107  Bit Score: 140.61  E-value: 1.95e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  185 VQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR--EKGRMRFHKLQNVQIALDYLRHRQVKLVNI 262
Cdd:cd21215      4 VQKKTFTKWLNTKLSSRGLSITDLVTDLSDGVRLIQLLEIIGDESLGRynKNPKMRVQKLENVNKALEFIKSRGVKLTNI 83
                           90       100
                   ....*....|....*....|....
gi 1920237946  263 RNDDIADGNPKLTLGLIWTIILHF 286
Cdd:cd21215     84 GAEDIVDGNLKLILGLLWTLILRF 107
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1444-2019 3.65e-38

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 158.18  E-value: 3.65e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1444 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEErERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1523
Cdd:COG1196    226 EAELLLLKLRELEAELEELEAELEELEAELEELEAELAELE-AELEELRLELEELELELEEAQAEEYELLAELARLEQDI 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1524 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLrIEEEIRVVRLQLEATERQRGGAEGELQA 1603
Cdd:COG1196    305 ARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEE-AEAELAEAEEALLEAEAELAEAEEELEE 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1604 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1683
Cdd:COG1196    384 LAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLE 463
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1684 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaEAERARAEAERELERWQLK 1763
Cdd:COG1196    464 LLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLL--------AGLRGLAGAVAVLIGVEAA 535
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1764 ANEALRLRLQAeevAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR 1843
Cdd:COG1196    536 YEAALEAALAA---ALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREAD 612
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1844 LRAETEQGEqqrqLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1923
Cdd:COG1196    613 ARYYVLGDT----LLGRTLVAARLEAALRR--AVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELE---- 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1924 ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQR 2003
Cdd:COG1196    683 ELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
                          570
                   ....*....|....*.
gi 1920237946 2004 RLLEEQAAQHKADIEA 2019
Cdd:COG1196    763 EELERELERLEREIEA 778
CH_SPTBN2_rpt2 cd21321
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) ...
297-401 2.13e-37

second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 2 (SPTBN2) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN2, also called beta-III spectrin, or spinocerebellar ataxia 5 protein (SCA5), probably plays an important role in the neuronal membrane skeleton. Mutations in SPTBN2 is associated with spinocerebellar ataxia type 5. SPTBN2 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409170  Cd Length: 119  Bit Score: 138.27  E-value: 2.13e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  297 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 376
Cdd:cd21321      1 KEKKSAKDALLLWCQMKTAGYPNVNVHNFTTSWRDGLAFNAIVHKHRPDLIDFETLKKSNAHYNLQNAFNVAEKELGLTK 80
                           90       100
                   ....*....|....*....|....*
gi 1920237946  377 LLDPEDVDVPQPDEKSIITYVSSLY 401
Cdd:cd21321     81 LLDPEDVNVDQPDEKSIITYVATYY 105
PTZ00121 PTZ00121
MAEBL; Provisional
1472-2177 2.44e-37

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 156.84  E-value: 2.44e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1472 EEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSiqEELQHL 1551
Cdd:PTZ00121  1101 EEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEEARKA--EDAKKA 1178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1552 RQSSEAEIQAKARQVEAAERSRlRIEEEIRVVRLQlEATERQRGGAEGELQALRaRAEEAeaqkRQAQEEAERLRRQVQD 1631
Cdd:PTZ00121  1179 EAARKAEEVRKAEELRKAEDAR-KAEAARKAEEER-KAEEARKAEDAKKAEAVK-KAEEA----KKDAEEAKKAEEERNN 1251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1632 ETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRlQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASF 1711
Cdd:PTZ00121  1252 EEIRKFEEARMAHFARRQAAIKAEEARKA----DELK-KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEE 1326
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1712 AEKTAQ-LERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEAlrlRLQAEEVAQQKSLTQaeaek 1790
Cdd:PTZ00121  1327 AKKKADaAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAA---KKKAEEKKKADEAKK----- 1398
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1791 qkeeaerearrrgKAEEQAVRQRELAEQELEKQR-QLAEGTAQQRLAAEqELIRLRAETEQGEQQRQLLE-----EELAR 1864
Cdd:PTZ00121  1399 -------------KAEEDKKKADELKKAAAAKKKaDEAKKKAEEKKKAD-EAKKKAEEAKKADEAKKKAEeakkaEEAKK 1464
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1865 LQREAAAATQKRRELE----AELAKVRAEmevllASKARAEEESRSTSEKSKqrleAEAGRFRELAEEAARLRAlAEEAK 1940
Cdd:PTZ00121  1465 KAEEAKKADEAKKKAEeakkADEAKKKAE-----EAKKKADEAKKAAEAKKK----ADEAKKAEEAKKADEAKK-AEEAK 1534
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1941 RQRQLAEEDAVRQRAEAERvlAEKLAAISEatrlKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEAR 2020
Cdd:PTZ00121  1535 KADEAKKAEEKKKADELKK--AEELKKAEE----KKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMK 1608
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2021 LAQLRKASESELERQKGLVEDTLRQRrqveEEILALKGSFEKAAAGKAELELELGRIRGT-----AEDTLRSKEQA--EQ 2093
Cdd:PTZ00121  1609 AEEAKKAEEAKIKAEELKKAEEEKKK----VEQLKKKEAEEKKKAEELKKAEEENKIKAAeeakkAEEDKKKAEEAkkAE 1684
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2094 EAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE----VERLKAKVEEARRLRERAEQESARQLQLAQEAA 2169
Cdd:PTZ00121  1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEEnkikAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKK 1764

                   ....*...
gi 1920237946 2170 QKRLQAEE 2177
Cdd:PTZ00121  1765 EEEKKAEE 1772
CH_SPTB_rpt2 cd21319
second calponin homology (CH) domain found in spectrin beta chain, erythrocytic (SPTB) and ...
297-401 4.61e-37

second calponin homology (CH) domain found in spectrin beta chain, erythrocytic (SPTB) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTB, also called beta-I spectrin, may be involved in anaemia pathogenesis. SPTB contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409168  Cd Length: 112  Bit Score: 137.06  E-value: 4.61e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  297 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 376
Cdd:cd21319      1 RETRSAKDALLLWCQMKTAGYPNVNVTNFTSSWKDGLAFNALIHKHRPDLVDFGKLKKSNARHNLEHAFNVAERQLGITK 80
                           90       100
                   ....*....|....*....|....*
gi 1920237946  377 LLDPEDVDVPQPDEKSIITYVSSLY 401
Cdd:cd21319     81 LLDPEDVFTENPDEKSIITYVVAFY 105
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1477-2078 4.82e-37

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 154.32  E-value: 4.82e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEERERLAEVEAAL-EKQRQLAEAHAQAKaQAEReAQGLQrrmqeevarreevaveAQEQKRSIQEELQHLRQSs 1555
Cdd:COG1196    177 AERKLEATEENLERLEDILgELERQLEPLERQAE-KAER-YRELK----------------EELKELEAELLLLKLREL- 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1556 EAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQR 1635
Cdd:COG1196    238 EAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEER 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1636 KRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALEtaQRSAEAELQSEHASFAEKT 1715
Cdd:COG1196    318 LEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE--AEEELEELAEELLEALRAA 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1716 AQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEA 1795
Cdd:COG1196    396 AELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALL 475
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1796 EREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIR------LRAETEQGEQQRQLLEEELARLQREA 1869
Cdd:COG1196    476 EAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAgavavlIGVEAAYEAALEAALAAALQNIVVED 555
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1870 AAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED 1949
Cdd:COG1196    556 DEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAA 635
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1950 AVRQRAEAERVLAEKLAA--ISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKA 2027
Cdd:COG1196    636 LRRAVTLAGRLREVTLEGegGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEE 715
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2028 SESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIR 2078
Cdd:COG1196    716 RLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELE 766
CH_DMD_rpt1 cd21231
first calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, ...
180-288 6.03e-37

first calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. It is involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Mutations in dystrophin lead to Duchenne muscular dystrophy (DMD). Moreover, dystrophin deficiency is associated with abnormal cerebral diffusion and perfusion, as well as in acute Trypanosoma cruzi infection. The dystrophin subfamily has been characterized by a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, dystrophin contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, approximately 24 spectrin repeats (SRs) and a WW domain. This model corresponds to the first CH domain.


Pssm-ID: 409080  Cd Length: 111  Bit Score: 136.59  E-value: 6.03e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  180 DERDRVQKKTFTKWVNKHLIKAQR-HISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVK 258
Cdd:cd21231      1 YEREDVQKKTFTKWINAQFAKFGKpPIEDLFTDLQDGRRLLELLEGLTGQKLVKEKGSTRVHALNNVNKALQVLQKNNVD 80
                           90       100       110
                   ....*....|....*....|....*....|
gi 1920237946  259 LVNIRNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21231     81 LVNIGSADIVDGNHKLTLGLIWSIILHWQV 110
CH_SPTBN5_rpt2 cd21249
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) ...
300-401 1.15e-35

second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN5, also called beta-V spectrin, is a mammalian ortholog of Drosophila beta H spectrin that may play a crucial role as a longer actin-membrane cross-linker or to fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. SPTBN5 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409098  Cd Length: 109  Bit Score: 132.68  E-value: 1.15e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21249      3 RSAKEALLIWCQRKTAGYTNVNVQDFSRSWRDGLAFNALIHAHRPDLIDYGSLRPDRPLYNLANAFLVAEQELGISQLLD 82
                           90       100
                   ....*....|....*....|..
gi 1920237946  380 PEDVDVPQPDEKSIITYVSSLY 401
Cdd:cd21249     83 PEDVAVPHPDERSIMTYVSLYY 104
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1604-2204 1.18e-35

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 150.09  E-value: 1.18e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1604 LRARAEEAEAQKRQAQEEAERL---RRQVqdETQR---KRQAE-AELALRVQAEAEAaREKQRALQALEELRLQAEEAER 1676
Cdd:COG1196    170 YKERKEEAERKLEATEENLERLediLGEL--ERQLeplERQAEkAERYRELKEELKE-LEAELLLLKLRELEAELEELEA 246
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1677 RLRQAEAERARqvqvaLETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAvvQLREEATRRAQQQAEAERARAEAERE 1756
Cdd:COG1196    247 ELEELEAELEE-----LEAELAELEAELEELRLELEELELELEEAQAEEYEL--LAELARLEQDIARLEERRRELEERLE 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1757 LERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA 1836
Cdd:COG1196    320 ELEEELAELEEELEELEEELEELEEELEEAEEELEEAE---------AELAEAEEALLEAEAELAEAEEELEELAEELLE 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1837 AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLE 1916
Cdd:COG1196    391 ALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLE 470
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1917 AEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA----AISEATRLKTEAEIALKEKEAENERL 1992
Cdd:COG1196    471 EAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRglagAVAVLIGVEAAYEAALEAALAAALQN 550
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1993 RRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELEL 2072
Cdd:COG1196    551 IVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAA 630
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2073 ELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRE 2152
Cdd:COG1196    631 RLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELA 710
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2153 RAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLER 2204
Cdd:COG1196    711 EAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDL 762
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1807-2375 9.02e-35

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 147.01  E-value: 9.02e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1886
Cdd:COG1196    210 EKAERYRELKEELKELEAELL---LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEA 286
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1887 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLA 1966
Cdd:COG1196    287 QAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEE-ELEEAEEELEEAEAELAEAEE 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1967 AISEATRLKTEAEIALKEKEAENERLRRlaedEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQR 2046
Cdd:COG1196    366 ALLEAEAELAEAEEELEELAEELLEALR----AAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEE 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2047 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEA 2126
Cdd:COG1196    442 EALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRG 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2127 ARQRKAALEEVERLKAKVEEAR---RLRERAEQESARQLQLAQEAAQKRL-----QAEEKAHAFAVQQKEQELQQTLQQE 2198
Cdd:COG1196    522 LAGAVAVLIGVEAAYEAALEAAlaaALQNIVVEDDEVAAAAIEYLKAAKAgratfLPLDKIRARAALAAALARGAIGAAV 601
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2199 QSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2278
Cdd:COG1196    602 DLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAEL 681
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2279 EQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2358
Cdd:COG1196    682 EELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
                          570
                   ....*....|....*..
gi 1920237946 2359 LRVQMEELGKLKARIEA 2375
Cdd:COG1196    762 LEELERELERLEREIEA 778
CH_SPTBN4_rpt2 cd21322
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) ...
285-401 1.07e-34

second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 4 (SPTBN4) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN4, also called beta-IV spectrin, or spectrin, non-erythroid beta chain 3 (SPTBN3), is a novel spectrin isolated as an interactor of the receptor tyrosine phosphatase-like protein ICA512. Its mutation associates with congenital myopathy, neuropathy, and central deafness. SPTBN4 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409171  Cd Length: 130  Bit Score: 130.94  E-value: 1.07e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  285 HFQISDIQVSGQSEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQA 364
Cdd:cd21322      1 QIQVIKIETEDNRETRSAKDALLLWCQMKTAGYPEVNIQNFTTSWRDGLAFNALIHRHRPDLIDFSKLTKSNATYNLQQA 80
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1920237946  365 FSVAERDLGVTRLLDPEDVDVPQPDEKSIITYVSSLY 401
Cdd:cd21322     81 FNTAEQHLGLTKLLDPEDVNMEAPDEKSIITYVVSFY 117
CH_DMD-like_rpt2 cd21187
second calponin homology (CH) domain found in the dystrophin family; The dystrophin family ...
304-405 1.26e-34

second calponin homology (CH) domain found in the dystrophin family; The dystrophin family includes dystrophin and its paralog, utrophin. Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. Dystrophin is also involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409036  Cd Length: 104  Bit Score: 129.47  E-value: 1.26e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  304 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 382
Cdd:cd21187      2 EKTLLaWCRQSTRGYEQVDVKNFTTSWRDGLAFNALIHRHRPDLFDFDSLVKDSPESRLEHAFTVAHEHLGIEKLLDPED 81
                           90       100
                   ....*....|....*....|...
gi 1920237946  383 VDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21187     82 VNVEQPDKKSILMYVTSLFQVLP 104
CH_dFLNA-like_rpt1 cd21311
first calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and ...
184-289 7.12e-34

first calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and similar proteins; Drosophila melanogaster filamin-A (dFLNA or dFLN-A), also called actin-binding protein 280 (ABP-280) or filamin-1, is involved in germline ring canal formation. It may tether actin microfilaments within the ovarian ring canal to the cell membrane and contributes to actin microfilament organization. dFLNA contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409160  Cd Length: 124  Bit Score: 128.34  E-value: 7.12e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  184 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGR--MRFHKLQNVQIALDYLRHRQ-VKLV 260
Cdd:cd21311     14 RIQQNTFTRWANEHLKTANKHIADLETDLSDGLRLIALVEVLSGKKFPKFNKRptFRSQKLENVSVALKFLEEDEgIKIV 93
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHFQIS 289
Cdd:cd21311     94 NIDSSDIVDGKLKLILGLIWTLILHYSIS 122
CH_jitterbug-like_rpt1 cd21227
first calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ...
185-288 5.68e-33

first calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409076  Cd Length: 109  Bit Score: 125.09  E-value: 5.68e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  185 VQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR--EKGRMRFHKLQNVQIALDYLRHRQVKLVNI 262
Cdd:cd21227      4 IQKNTFTNWVNEQLKPTGMSVEDLATDLEDGVKLIALVEILQGRKLGRviKKPLNQHQKLENVTLALKAMAEDGIKLVNI 83
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  263 RNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21227     84 GNEDIVNGNLKLILGLIWHLILRYQI 109
CH_SPTBN1_rpt1 cd21316
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) ...
179-284 1.63e-32

first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN1, also called beta-II spectrin, fodrin beta chain, or spectrin, non-erythroid beta chain 1, is also a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. SPTBN1 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409165  Cd Length: 154  Bit Score: 125.54  E-value: 1.63e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQIALDYLRHRQV 257
Cdd:cd21316     47 ADEREAVQKKTFTKWVNSHLARVSCRITDLYMDLRDGRMLIKLLEVLSGERLPKpTKGRMRIHCLENVDKALQFLKEQRV 126
                           90       100
                   ....*....|....*....|....*..
gi 1920237946  258 KLVNIRNDDIADGNPKLTLGLIWTIIL 284
Cdd:cd21316    127 HLENMGSHDIVDGNHRLTLGLIWTIIL 153
CH_UTRN_rpt1 cd21232
first calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also ...
185-288 1.77e-32

first calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Like dystrophin, utrophin has a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, it contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, up to 24 spectrin repeats (SRs), and a WW domain. However, utrophin lacks the intrinsic microtubule binding activity of dystrophin SRs. This model corresponds to the first CH domain.


Pssm-ID: 409081  Cd Length: 107  Bit Score: 123.58  E-value: 1.77e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  185 VQKKTFTKWVNKHLIKAQR-HISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRFHKLQNVQIALDYLRHRQVKLVNIR 263
Cdd:cd21232      2 VQKKTFTKWINARFSKSGKpPIKDMFTDLRDGRKLLDLLEGLTGKSLPKERGSTRVHALNNVNRVLQVLHQNNVELVNIG 81
                           90       100
                   ....*....|....*....|....*
gi 1920237946  264 NDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21232     82 GTDIVDGNHKLTLGLLWSIILHWQV 106
CH_SPTBN1_rpt2 cd21320
second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) ...
301-401 5.83e-32

second calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 1 (SPTBN1) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN1, also called beta-II spectrin, fodrin beta chain, or spectrin, non-erythroid beta chain 1, is also a component of fodrin, which is the general spectrin-like protein that seems to be involved in secretion. Fodrin interacts with calmodulin in a calcium-dependent manner and is thus a candidate for the calcium-dependent movement of the cytoskeleton at the membrane. SPTBN1 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409169  Cd Length: 108  Bit Score: 122.13  E-value: 5.83e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 380
Cdd:cd21320      2 SAKDALLLWCQMKTAGYPNVNIHNFTTSWRDGMAFNALIHKHRPDLIDFDKLKKSNAHYNLQNAFNLAEQHLGLTKLLDP 81
                           90       100
                   ....*....|....*....|.
gi 1920237946  381 EDVDVPQPDEKSIITYVSSLY 401
Cdd:cd21320     82 EDISVDHPDEKSIITYVVTYY 102
PTZ00034 PTZ00034
40S ribosomal protein S10; Provisional
5-114 1.72e-31

40S ribosomal protein S10; Provisional


Pssm-ID: 173331  Cd Length: 124  Bit Score: 121.28  E-value: 1.72e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946    5 MLMPLDQLRTIYEVLFREGVMVAKKDRrPRSLHPHVpGVTNLQVTRAMASLRARGLVRETFAWRHFYWYLTNEGIAHLRQ 84
Cdd:PTZ00034     2 VYVPKANRKAIYRYLFKEGVIVCKKDP-KGPWHPEL-NVPNLHVMMLMRSLKSRGLVKEQFAWQHYYYYLTDEGIEYLRT 79
                           90       100       110
                   ....*....|....*....|....*....|
gi 1920237946   85 YLHLPPEIVPASLQRVRRPVAMVMPARRTP 114
Cdd:PTZ00034    80 YLHLPPDVFPATHKKKSVNFERKTEEEGSR 109
CH_SYNE-like_rpt2 cd21192
second calponin homology (CH) domain found in the synaptic nuclear envelope protein (SYNE) ...
300-398 3.84e-31

second calponin homology (CH) domain found in the synaptic nuclear envelope protein (SYNE) family; The SYNE family includes SYNE-1, -2 and calmin. SYNE-1 (also called nesprin-1, enaptin, KASH domain-containing protein 1, KASH1, myocyte nuclear envelope protein 1, MYNE-1, or nuclear envelope spectrin repeat protein 1) and SYNE-2 (also called nesprin-2, KASH domain-containing protein 2, KASH2, nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE) may act redundantly. They are multi-isomeric modular proteins which form a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. They also act as components of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409041  Cd Length: 107  Bit Score: 119.84  E-value: 3.84e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21192      2 GSAEKALLKWVQAEIGKYYGIRVTDFDKSWRDGVAFLALIHAIRPDLVDMKTVKNRSPRDNLELAFRIAEQHLNIPRLLE 81
                           90
                   ....*....|....*....
gi 1920237946  380 PEDVDVPQPDEKSIITYVS 398
Cdd:cd21192     82 VEDVLVDKPDERSIMTYVS 100
PTZ00121 PTZ00121
MAEBL; Provisional
1465-2069 4.02e-31

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 136.04  E-value: 4.02e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRAE------------ERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREE 1532
Cdd:PTZ00121  1209 EEERKAEEARKAEDAKKAEavkkaeeakkdaEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEE 1288
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1533 V--AVEAQ--EQKRSIQEELQHLRQSSEAEiQAKARQVEA---AERSRLRIEEEirvvRLQLEATERQRGGAEGELQALR 1605
Cdd:PTZ00121  1289 KkkADEAKkaEEKKKADEAKKKAEEAKKAD-EAKKKAEEAkkkADAAKKKAEEA----KKAAEAAKAEAEAAADEAEAAE 1363
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARAEEAEAQKRQAQEEAERLRRQVQ-----DETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRlQAEEAERRLRQ 1680
Cdd:PTZ00121  1364 EKAEAAEKKKEEAKKKADAAKKKAEekkkaDEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKK-KADEAKKKAEE 1442
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1681 A--------EAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLR--EEATRRAQQQAEAERAR 1750
Cdd:PTZ00121  1443 AkkadeakkKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKkaAEAKKKADEAKKAEEAK 1522
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1751 AEAERELERWQLKANEALRlrlqAEEVAQQKSLTQAEAEKQKEEAEREARRRgKAEEqavrQRELAEQELEKQRQLAEGT 1830
Cdd:PTZ00121  1523 KADEAKKAEEAKKADEAKK----AEEKKKADELKKAEELKKAEEKKKAEEAK-KAEE----DKNMALRKAEEAKKAEEAR 1593
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1831 AQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEK 1910
Cdd:PTZ00121  1594 IEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEED 1673
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1911 SKQRLEAEagrfRELAEEAARLRALAEEAKRQRQLAEedaVRQRAEAERVLAEKLAAISEATRLKteAEIALKEKEAENE 1990
Cdd:PTZ00121  1674 KKKAEEAK----KAEEDEKKAAEALKKEAEEAKKAEE---LKKKEAEEKKKAEELKKAEEENKIK--AEEAKKEAEEDKK 1744
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1991 RLRRLAEDEAFQRRLleeqaAQHKADIEARLAQLRKASESELErqKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAE 2069
Cdd:PTZ00121  1745 KAEEAKKDEEEKKKI-----AHLKKEEEKKAEEIRKEKEAVIE--EELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKE 1816
PTZ00121 PTZ00121
MAEBL; Provisional
1465-2097 5.77e-31

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 135.65  E-value: 5.77e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRAEERERLAEVEAALE--KQRQLAEAHAQAKAQAEREAQGLQR----------RMQEEVARREE 1532
Cdd:PTZ00121  1161 EDARKAEEARKAEDAKKAEAARKAEEVRKAEElrKAEDARKAEAARKAEEERKAEEARKaedakkaeavKKAEEAKKDAE 1240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1533 VAVEAQEQKRSIQ-EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRlQLEATERQRGGAEGELQALRAR-AEE 1610
Cdd:PTZ00121  1241 EAKKAEEERNNEEiRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD-EAKKAEEKKKADEAKKKAEEAKkADE 1319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1611 AEAQKRQAQEEAERLRRQVQdETQRKRQAEAELALRVQAEAEAAREKQRALQ-ALEELRLQAEEAERRlrqaeAERARQV 1689
Cdd:PTZ00121  1320 AKKKAEEAKKKADAAKKKAE-EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEkKKEEAKKKADAAKKK-----AEEKKKA 1393
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1690 QVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARaeaerelerwqlKANEALR 1769
Cdd:PTZ00121  1394 DEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE------------EAKKAEE 1461
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1770 LRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQElIRLRAETE 1849
Cdd:PTZ00121  1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEE-AKKADEAK 1540
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1850 QGEQQR---QLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEaEAGRFRELA 1926
Cdd:PTZ00121  1541 KAEEKKkadELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAE-EAKKAEEAK 1619
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1927 EEAARLRALAEEAKRQRQLAEEDA--------VRQRAEAERVLAEKLAAISEATrlKTEAEIALKEKEAENERLRRLAED 1998
Cdd:PTZ00121  1620 IKAEELKKAEEEKKKVEQLKKKEAeekkkaeeLKKAEEENKIKAAEEAKKAEED--KKKAEEAKKAEEDEKKAAEALKKE 1697
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1999 EAFQRRLleEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVeEEILALKGSFEKAAAGKAELELELGRIR 2078
Cdd:PTZ00121  1698 AEEAKKA--EELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA-EEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
                          650
                   ....*....|....*....
gi 1920237946 2079 GTAEDTLRSKEQAEQEAAR 2097
Cdd:PTZ00121  1775 KEKEAVIEEELDEEDEKRR 1793
CH_SYNE2_rpt2 cd21244
second calponin homology (CH) domain found in synaptic nuclear envelope protein 2 (SYNE-2) and ...
300-398 1.18e-30

second calponin homology (CH) domain found in synaptic nuclear envelope protein 2 (SYNE-2) and similar proteins; SYNE-2, also called nesprin-2, KASH domain-containing protein 2 (KASH2), nuclear envelope spectrin repeat protein 2, nucleus and actin connecting element protein, or protein NUANCE, is a multi-isomeric modular protein which forms a linking network between organelles and the actin cytoskeleton to maintain subcellular spatial organization. SYNE-2 also acts as a component of the LINC (LInker of Nucleoskeleton and Cytoskeleton) complex, which is involved in the connection between the nuclear lamina and the cytoskeleton. SYNE-2 contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409093  Cd Length: 109  Bit Score: 118.40  E-value: 1.18e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21244      4 MSARKALLLWAQEQCAKVGSISVTDFKSSWRNGLAFLAIIHALRPGLVDMEKLKGRSNRENLEEAFRIAEQELKIPRLLE 83
                           90
                   ....*....|....*....
gi 1920237946  380 PEDVDVPQPDEKSIITYVS 398
Cdd:cd21244     84 PEDVDVVNPDEKSIMTYVA 102
CH_SpAIN1-like_rpt2 cd21291
second calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like ...
288-403 1.65e-30

second calponin homology (CH) domain found in Schizosaccharomyces pombe alpha-actinin-like protein 1 and similar proteins; Schizosaccharomyces pombe alpha-actinin-like protein 1 (SpAIN1) binds to actin and is involved in actin-ring formation and organization. It plays a role in cytokinesis and is involved in septation. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409140  Cd Length: 115  Bit Score: 118.40  E-value: 1.65e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  288 ISDIQvsgqSEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 367
Cdd:cd21291      1 IADIN----EEGLTAKEGLLLWCQRKTAGYDEVDVQDFTTSWTDGLAFCALIHRHRPDLIDYDKLDKKDHRGNMQLAFDI 76
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1920237946  368 AERDLGVTRLLDPEDV-DVPQPDEKSIITYVSSLYDA 403
Cdd:cd21291     77 ASKEIGIPQLLDVEDVcDVAKPDERSIMTYVAYYFHA 113
CH_UTRN_rpt2 cd21234
second calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also ...
304-405 1.85e-30

second calponin homology (CH) domain found in utrophin and similar proteins; Utrophin, also called dystrophin-related protein 1 (DRP-1), is an autosomal dystrophin homolog that increases dystrophic muscle function and reduces pathology. It is broadly expressed in both the mRNA and protein levels, and occurs in the cerebrovascular endothelium. Utrophin forms the utrophin-glycoprotein complex (UGC) by interacting with dystroglycans (DGs) and sarcoglycan-dystroglycans, as well as sarcoglycan and sarcospan (SG-SSPN) subcomplexes. It may act as a scaffolding protein that stabilizes lipid microdomains and clusters mechanosensitive channel subunits, and link the F-actin cytoskeleton to the cell membrane via the associated glycoprotein complex. Like dystrophin, utrophin has a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, it contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, up to 24 spectrin repeats (SRs), and a WW domain. However, utrophin lacks the intrinsic microtubule binding activity of dystrophin SRs. This model corresponds to the second CH domain.


Pssm-ID: 409083 [Multi-domain]  Cd Length: 104  Bit Score: 117.75  E-value: 1.85e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  304 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 382
Cdd:cd21234      2 EKILLsWVRQSTRPYSQVNVLNFTTSWTDGLAFNAVLHRHKPDLFSWDKVVKMSPVERLEHAFSKAKNHLGIEKLLDPED 81
                           90       100
                   ....*....|....*....|...
gi 1920237946  383 VDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21234     82 VAVQLPDKKSIIMYLTSLFEVLP 104
CH_CLMN_rpt1 cd21191
first calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called ...
181-290 2.06e-30

first calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Calmin contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409040  Cd Length: 114  Bit Score: 118.07  E-value: 2.06e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRVQKKTFTKWVNKHLIKAQR--HISDLYEDLRDGHNLISLLEVLSGDSLPRE--KGRMRFHKLQNVQIALDYLRHRQ 256
Cdd:cd21191      1 ERENVQKRTFTRWINLHLEKCNPplEVKDLFVDIQDGKILMALLEVLSGQNLLQEykPSSHRIFRLNNIAKALKFLEDSN 80
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1920237946  257 VKLVNIRNDDIADGNPKLTLGLIWTIILHFQISD 290
Cdd:cd21191     81 VKLVSIDAAEIADGNPSLVLGLIWNIILFFQIKE 114
Spectrin_like pfam18373
Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links ...
1020-1097 2.57e-30

Spectrin like domain; Desmoplakin (DP) is an integral part of desmosomes, where it links desmosomal cadherins to the intermediate filaments. The N-terminal region of DP contains a plakin domain common to members of the plakin family. Plakin domains contain multiple copies of spectrin repeats (SRs) pfam00435. Spectrin repeats (SRs) consist of three alpha-helices (A, B, and C) that form an antiparallel triple-helical bundle. This entry describes SR6 which has a divergent structure relative to the other SRs. SR6 shows significant deviations in helices A and B where they are significantly shorter than in other repeats. Structural comparison revealed that SR6 is more similar to other three-helix-bundle proteins, including target of Myb1 and the syntaxin Habc domain, than to other SR proteins. Due to these differences with other spectrin repeats, this region is termed spectrin-like repeat.


Pssm-ID: 465730  Cd Length: 78  Bit Score: 116.16  E-value: 2.57e-30
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 1020 LAWQSLGRDMQLIRSWSLATFRTLKPEEQRQALRSLELHYQAFLRDSQDAGGFGPEDRLQAEREYGSCSRHYQQLLQS 1097
Cdd:pfam18373    1 VSWQYLLKDIQRINSWTISMLKTMRPEEYRQVLKNLETHYQDFLRDSQESEMFGAEDRRQLEREVNSAQQHYQTLLVS 78
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1874-2585 3.54e-30

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 132.37  E-value: 3.54e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1874 QKRRELEAELAKVRAEMEvllaskaRAEEEsrsTSEKSKQ--RLEAEAgrfrELAEEAARLRALAEEAKRQRQLAEedav 1951
Cdd:COG1196    172 ERKEEAERKLEATEENLE-------RLEDI---LGELERQlePLERQA----EKAERYRELKEELKELEAELLLLK---- 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1952 RQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAqhkadiEARLAQLRKASESE 2031
Cdd:COG1196    234 LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYEL------LAELARLEQDIARL 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2032 LERQkglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRRE 2111
Cdd:COG1196    308 EERR----RELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2112 AEERVQKSLAAEEEAARQRKAALEEVERLKAkvEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQEL 2191
Cdd:COG1196    384 LAEELLEALRAAAELAAQLEELEEAEEALLE--RLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEAL 461
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2192 QQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQE 2271
Cdd:COG1196    462 LELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALE 541
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2272 AARRAQAEQAALRQKQAAdaemekhkqfAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRgq 2351
Cdd:COG1196    542 AALAAALQNIVVEDDEVA----------AAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLR-- 609
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2352 vEEELFSLRVQMEELGklkarieaenRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2431
Cdd:COG1196    610 -EADARYYVLGDTLLG----------RTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAE 678
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2432 ALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERL 2511
Cdd:COG1196    679 AELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPE 758
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2512 RLRVAEMSRAQARAEEDARRFRK-------QAEDIGERLyrTELATQekvmlVQTLETQRqqsdrdaERLREAIAELEHE 2584
Cdd:COG1196    759 PPDLEELERELERLEREIEALGPvnllaieEYEELEERY--DFLSEQ-----REDLEEAR-------ETLEEAIEEIDRE 824

                   .
gi 1920237946 2585 K 2585
Cdd:COG1196    825 T 825
CH_FLN-like_rpt1 cd21183
first calponin homology (CH) domain found in the filamin family; The filamin family includes ...
184-286 4.61e-30

first calponin homology (CH) domain found in the filamin family; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. This family also includes Drosophila melanogaster protein jitterbug (Jbug), which is an actin-meshwork organizing protein containing three copies of the CH domain. Other members of this family contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409032  Cd Length: 108  Bit Score: 116.81  E-value: 4.61e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  184 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21183      3 RIQANTFTRWCNEHLKERGMQIHDLATDFSDGLCLIALLENLSTRPLKRsynRRPAFQQHYLENVSTALKFIEADHIKLV 82
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHF 286
Cdd:cd21183     83 NIGSGDIVNGNIKLILGLIWTLILHY 108
CH_DMD_rpt2 cd21233
second calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, ...
304-406 3.02e-29

second calponin homology (CH) domain found in dystrophin and similar proteins; Dystrophin, encoded by the DMD gene, is a large, submembrane cytoskeletal protein that is the main component of the dystrophin-glycoprotein complex (DGC) in skeletal muscles. It links the transmembrane DGC to the actin cytoskeleton through binding strongly to the cytoplasmic tail of beta-dystroglycan, the transmembrane subunit of a highly O-glycosylated cell-surface protein. It is involved in maintaining the structural integrity of cells, as well as in the formation of the blood-brain barrier (BBB). Mutations in dystrophin lead to Duchenne muscular dystrophy (DMD). Moreover, dystrophin deficiency is associated with abnormal cerebral diffusion and perfusion, as well as in acute Trypanosoma cruzi infection. The dystrophin subfamily has been characterized by a compact cluster of domains comprising four EF-hand-like motifs and a ZZ-domain, followed by a looser region with two coiled-coils. These domains are believed to be involved in protein-protein interactions. In addition, dystrophin contains two syntrophin binding sites (SBSs) and a long N-terminal extension that comprises two actin-binding calponin homology (CH) domains, approximately 24 spectrin repeats (SRs) and a WW domain. The model corresponds to the second CH domain.


Pssm-ID: 409082  Cd Length: 111  Bit Score: 114.64  E-value: 3.02e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  304 EKLLL-WSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN-LENLDQAFSVAERDLGVTRLLDPE 381
Cdd:cd21233      2 EKILLsWVRQSTRNYPQVNVINFTSSWSDGLAFNALIHSHRPDLFDWNSVVSQQSaTERLDHAFNIARQHLGIEKLLDPE 81
                           90       100
                   ....*....|....*....|....*
gi 1920237946  382 DVDVPQPDEKSIITYVSSLYDAMPR 406
Cdd:cd21233     82 DVATAHPDKKSILMYVTSLFQVLPQ 106
CH_MICALL2 cd21253
calponin homology (CH) domain found in MICAL-like protein 2 and similar proteins; MICAL-like ...
306-401 3.55e-29

calponin homology (CH) domain found in MICAL-like protein 2 and similar proteins; MICAL-like protein 2 (MICAL-L2), also called junctional Rab13-binding protein (JRAB), or molecule interacting with CasL-like 2, acts as an effector of small Rab GTPases which is involved in junctional complexes assembly through the regulation of cell adhesion molecule transport to the plasma membrane, and actin cytoskeleton reorganization. It regulates the endocytic recycling of occludins, claudins, and E-cadherin to the plasma membrane and may thereby regulate the establishment of tight junctions and adherens junctions. Members of this subfamily contain a single copy of CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409102  Cd Length: 106  Bit Score: 113.98  E-value: 3.55e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  306 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED-VD 384
Cdd:cd21253      6 LQQWCRQQTEGYRDVKVTNMTTSWRDGLAFCAIIHRFRPDLIDFDSLSKENVYENNKLAFTVAEKELGIPALLDAEDmVA 85
                           90
                   ....*....|....*..
gi 1920237946  385 VPQPDEKSIITYVSSLY 401
Cdd:cd21253     86 LKVPDKLSILTYVSQYY 102
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1634-2349 1.29e-28

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 126.98  E-value: 1.29e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1634 QRKRQAEAELAlrvQAEAEAAR--------EKQralqaLEELRLQAEEAERRLRQAEAERARQVQVALetaqrsaeAELQ 1705
Cdd:COG1196    172 ERKEEAERKLE---ATEENLERledilgelERQ-----LEPLERQAEKAERYRELKEELKELEAELLL--------LKLR 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1706 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRaqqqaeaeraraeaerelerwqlkanEALRLRLQAEEvaqqksltq 1785
Cdd:COG1196    236 ELEAELEELEAELEELEAELEELEAELAELEAEL--------------------------EELRLELEELE--------- 280
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1786 aeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARL 1865
Cdd:COG1196    281 ------------------LELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEEL 342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1866 QREAAAATQKRRELEAELAKVRAEmevLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQL 1945
Cdd:COG1196    343 EEELEEAEEELEEAEAELAEAEEA---LLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL 419
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1946 AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLR 2025
Cdd:COG1196    420 EEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA 499
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2026 KASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEE 2105
Cdd:COG1196    500 EADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPL 579
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2106 ERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ 2185
Cdd:COG1196    580 DKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAG 659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2186 QKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLR 2265
Cdd:COG1196    660 GSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLE 739
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2266 KEAEQEAARRAQAEQAALRQKQAADAEMEKHKqfAEQALRQKAQV----EQELTALRLQLEETDHQKSILDEELQRLK-- 2339
Cdd:COG1196    740 ELLEEEELLEEEALEELPEPPDLEELERELER--LEREIEALGPVnllaIEEYEELEERYDFLSEQREDLEEARETLEea 817
                          730
                   ....*....|.
gi 1920237946 2340 -AEVTEAARQR 2349
Cdd:COG1196    818 iEEIDRETRER 828
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1320-1938 1.37e-28

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 126.98  E-value: 1.37e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1320 QTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVpLANSQAVREQLRQEKALLED-IERHGEKveecqrfa 1398
Cdd:COG1196    219 KEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAE-LAELEAELEELRLELEELELeLEEAQAE-------- 289
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1399 kqyinaikdyELQLVTYKAQLEPVASPAKkpkvqsgsESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAE 1478
Cdd:COG1196    290 ----------EYELLAELARLEQDIARLE--------ERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEE 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1479 QQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAqglqrrmqeEVARREEVAVEAQEQKRSIQEELQHLRQSSEAE 1558
Cdd:COG1196    352 ELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELL---------EALRAAAELAAQLEELEEAEEALLERLERLEEE 422
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1559 IQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRqvQDETQRKRQ 1638
Cdd:COG1196    423 LEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAA--RLLLLLEAE 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1639 AEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAE----K 1714
Cdd:COG1196    501 ADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATflplD 580
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1715 TAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEE 1794
Cdd:COG1196    581 KIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGG 660
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1795 AEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQ 1874
Cdd:COG1196    661 SLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEE 740
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1875 KRRELEAELAKVRAEMEvllaskaraEEESRSTSEKSKQRLEAEAGRF--------RELAEEAARLRALAEE 1938
Cdd:COG1196    741 LLEEEELLEEEALEELP---------EPPDLEELERELERLEREIEALgpvnllaiEEYEELEERYDFLSEQ 803
CH_ACTN4_rpt2 cd21290
second calponin homology (CH) domain found in alpha-actinin-4; Alpha-actinin-4 (ACTN4), also ...
286-403 3.08e-28

second calponin homology (CH) domain found in alpha-actinin-4; Alpha-actinin-4 (ACTN4), also called non-muscle alpha-actinin 4, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. It is associated with cell motility and cancer invasion. ACTN4 is probably involved in vesicular trafficking via its association with the CART complex, which is necessary for efficient transferrin receptor recycling but not for epidermal growth factor receptor (EGFR) degradation. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409139  Cd Length: 125  Bit Score: 112.10  E-value: 3.08e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  286 FQISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAF 365
Cdd:cd21290      2 FAIQDISV----EETSAKEGLLLWCQRKTAPYKNVNVQNFHISWKDGLAFNALIHRHRPELIEYDKLRKDDPVTNLNNAF 77
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1920237946  366 SVAERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21290     78 EVAEKYLDIPKMLDAEDiVNTARPDEKAIMTYVSSFYHA 116
PTZ00121 PTZ00121
MAEBL; Provisional
1848-2738 2.06e-27

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 124.10  E-value: 2.06e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1848 TEQGEQQRQLLEEELARLQREAAAATqkRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFREL-- 1925
Cdd:PTZ00121  1033 TEYGNNDDVLKEKDIIDEDIDGNHEG--KAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETgk 1110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1926 AEEAARlralAEEAKRQrqlAEEdaVRQRAEAERvlAEKLAAISEATRLKTE--AEIALKEKEAENERLRRLAEDeafQR 2003
Cdd:PTZ00121  1111 AEEARK----AEEAKKK---AED--ARKAEEARK--AEDARKAEEARKAEDAkrVEIARKAEDARKAEEARKAED---AK 1176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2004 RLLEEQAAqhkadIEARLA-QLRKASESelerqkglvedtlrqrRQVEEeilalkgsfekaaAGKAELELELGRIRgTAE 2082
Cdd:PTZ00121  1177 KAEAARKA-----EEVRKAeELRKAEDA----------------RKAEA-------------ARKAEEERKAEEAR-KAE 1221
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2083 DTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERlkaKVEEARRLRERAEQESARQL 2162
Cdd:PTZ00121  1222 DAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEAR---KADELKKAEEKKKADEAKKA 1298
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2163 QLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERlrseaeaarraaeeaeaareraereaaqsRRQVEEAER 2242
Cdd:PTZ00121  1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEE-----------------------------AKKAAEAAK 1349
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2243 LKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQaLRQKAQVEQELTALRLQLE 2322
Cdd:PTZ00121  1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADE-LKKAAAAKKKADEAKKKAE 1428
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2323 ETDHQksildEELQRLKAEVTEAARQRGQVEEelfslRVQMEELGKlkaRIEAENRALVLRDKDSAQRLLQE---EAEKM 2399
Cdd:PTZ00121  1429 EKKKA-----DEAKKKAEEAKKADEAKKKAEE-----AKKAEEAKK---KAEEAKKADEAKKKAEEAKKADEakkKAEEA 1495
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2400 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK----EKMQAVQEATRLKAEAELLQQQKELAQEQARRLQED 2475
Cdd:PTZ00121  1496 KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKadeaKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2476 KeQMAQQLAQETQgfqktlETERQRQLEMSAEAERLRLRVAEmsraQARAEEDARrfrKQAEDIGErlyrtelaTQEKVM 2555
Cdd:PTZ00121  1576 K-NMALRKAEEAK------KAEEARIEEVMKLYEEEKKMKAE----EAKKAEEAK---IKAEELKK--------AEEEKK 1633
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2556 LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQalqqsflsekdslLQRErci 2635
Cdd:PTZ00121  1634 KVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA-------------LKKE--- 1697
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2636 EQEKAKLEQL---FQDEVAKAQALREEQQRQQQqmqqekqqlaaSMEEARRRQHE----AEEgVRRQQEELQRLAQQQQQ 2708
Cdd:PTZ00121  1698 AEEAKKAEELkkkEAEEKKKAEELKKAEEENKI-----------KAEEAKKEAEEdkkkAEE-AKKDEEEKKKIAHLKKE 1765
                          890       900       910
                   ....*....|....*....|....*....|....
gi 1920237946 2709 QEKLLAEENQR----LRERLQHLEEERRAALARS 2738
Cdd:PTZ00121  1766 EEKKAEEIRKEkeavIEEELDEEDEKRRMEVDKK 1799
CH_FLN_rpt1 cd21228
first calponin homology (CH) domain found in filamins; The filamin family includes filamin-A ...
184-286 2.12e-27

first calponin homology (CH) domain found in filamins; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. Members of this family contain two copies of the CH domain. The model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409077  Cd Length: 108  Bit Score: 109.11  E-value: 2.12e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  184 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21228      3 KIQQNTFTRWCNEHLKCVNKRIYNLETDLSDGLRLIALLEVLSQKRMYKkynKRPTFRQMKLENVSVALEFLERESIKLV 82
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHF 286
Cdd:cd21228     83 SIDSSAIVDGNLKLILGLIWTLILHY 108
CH_MICAL_EHBP-like cd22198
calponin homology (CH) domain found in the MICAL and EHBP families; This group is composed of ...
304-403 9.47e-27

calponin homology (CH) domain found in the MICAL and EHBP families; This group is composed of the molecule interacting with CasL protein (MICAL) and EH domain-binding protein (EHBP) families. MICAL is a large, multidomain, cytosolic protein with a single LIM domain, a calponin homology (CH) domain and a flavoprotein monooxygenase (MO) domain. In Drosophila, MICAL is expressed in axons, interacts with the neuronal A (PlexA) receptor and is required for Semaphorin 1a (Sema-1a)-PlexA-mediated repulsive axon guidance. The LIM and CH domains mediate interactions with the cytoskeleton, cytoskeletal adaptor proteins, and other signaling proteins. The flavoprotein MO is required for semaphorin-plexin repulsive axon guidance during axonal pathfinding in the Drosophila neuromuscular system. The EHBP family includes EHBP1 and EHBP1-like protein (EHBP1L1). EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP proteins contain a single CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409188  Cd Length: 105  Bit Score: 107.37  E-value: 9.47e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  304 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED- 382
Cdd:cd22198      3 EELLSWCQEQTEGYRGVKVTDLTSSWRSGLALCAIIHRFRPDLIDFSSLDPENIAENNQLAFDVAEQELGIPPVMTGQEm 82
                           90       100
                   ....*....|....*....|.
gi 1920237946  383 VDVPQPDEKSIITYVSSLYDA 403
Cdd:cd22198     83 ASLAVPDKLSMVSYLSQFYEA 103
CH_FLNC_rpt1 cd21310
first calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C ...
184-289 1.26e-26

first calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C (FLN-C), also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. FLN-C contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409159  Cd Length: 125  Bit Score: 107.81  E-value: 1.26e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  184 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRE---KGRMRFHKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21310     15 KIQQNTFTRWCNEHLKCVQKRLNDLQKDLSDGLRLIALLEVLSQKKMYRKyhpRPNFRQMKLENVSVALEFLDREHIKLV 94
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHFQIS 289
Cdd:cd21310     95 SIDSKAIVDGNLKLILGLIWTLILHYSIS 123
CH_ACTN1_rpt2 cd21287
second calponin homology (CH) domain found in alpha-actinin-1; Alpha-actinin-1 (ACTN1), also ...
288-403 1.61e-26

second calponin homology (CH) domain found in alpha-actinin-1; Alpha-actinin-1 (ACTN1), also called alpha-actinin cytoskeletal isoform, or non-muscle alpha-actinin-1, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN1 is a bundling protein. Its mutations cause congenital macrothrombocytopenia. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409136  Cd Length: 124  Bit Score: 107.48  E-value: 1.61e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  288 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 367
Cdd:cd21287      1 IQDISV----EETSAKEGLLLWCQRKTAPYKNVNIQNFHISWKDGLGFCALIHRHRPELIDYGKLRKDDPLTNLNTAFDV 76
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1920237946  368 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21287     77 AEKYLDIPKMLDAEDiVGTARPDEKAIMTYVSSFYHA 113
PTZ00121 PTZ00121
MAEBL; Provisional
2014-2740 3.40e-26

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 119.86  E-value: 3.40e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2014 KADIEARLAQLRKASESELERQKGLVEDTlRQRRQVEEEilalKGSFE---KAAAGKAELELELGRIRGTAEDTLRSKE- 2089
Cdd:PTZ00121  1059 KAEAKAHVGQDEGLKPSYKDFDFDAKEDN-RADEATEEA----FGKAEeakKTETGKAEEARKAEEAKKKAEDARKAEEa 1133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2090 -QAE-----QEAARQRQLAAEEERRRREAEERVQKSLAAEE----EAARQ----RKAA----LEEVERLKA--KVEEARR 2149
Cdd:PTZ00121  1134 rKAEdarkaEEARKAEDAKRVEIARKAEDARKAEEARKAEDakkaEAARKaeevRKAEelrkAEDARKAEAarKAEEERK 1213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2150 LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAERE 2229
Cdd:PTZ00121  1214 AEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKAD 1293
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2230 AAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADA--EMEKHKQFAEQALRQK 2307
Cdd:PTZ00121  1294 EAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAadEAEAAEEKAEAAEKKK 1373
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2308 AQVEQELTALRLQLEETDHQksildEELQRLKAEVTEAARQRGQVEEElfslrvqmeelgKLKARiEAENRALVLRDKDS 2387
Cdd:PTZ00121  1374 EEAKKKADAAKKKAEEKKKA-----DEAKKKAEEDKKKADELKKAAAA------------KKKAD-EAKKKAEEKKKADE 1435
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQlAEEdlAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2465
Cdd:PTZ00121  1436 AKKKAEEakKADEAKKKAEEAKKAEEAKKKAEEAKK-ADE--AKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKA 1512
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2466 QEqARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQA--RAEEDARRFRKQAEDigerL 2543
Cdd:PTZ00121  1513 DE-AKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK-KAEEKKKAEEakKAEEDKNMALRKAEE----A 1586
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2544 YRTELATQEKVM-LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFL 2622
Cdd:PTZ00121  1587 KKAEEARIEEVMkLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEE 1666
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2623 SEKDsllqrerciEQEKAKLEQLFQDEVAKAQAlreeqqrqqqqmqqeKQQLAASMEEARRrqheAEEgVRRQQEELQRL 2702
Cdd:PTZ00121  1667 AKKA---------EEDKKKAEEAKKAEEDEKKA---------------AEALKKEAEEAKK----AEE-LKKKEAEEKKK 1717
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|
gi 1920237946 2703 AQQQQQQEkllaEENQRLRERLQHLEEE--RRAALARSEE 2740
Cdd:PTZ00121  1718 AEELKKAE----EENKIKAEEAKKEAEEdkKKAEEAKKDE 1753
PTZ00121 PTZ00121
MAEBL; Provisional
1540-2342 5.68e-26

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 119.09  E-value: 5.68e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1540 QKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEaEAQKRQAQ 1619
Cdd:PTZ00121  1043 KEKDIIDEDIDGNHEGKAEAKAHVGQDEGLKPSYKDFDFDAKEDNRADEATEEAFGKAEEAKKTETGKAEE-ARKAEEAK 1121
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1620 EEAERLR-----RQVQD--ETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE----AERARQ 1688
Cdd:PTZ00121  1122 KKAEDARkaeeaRKAEDarKAEEARKAEDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVRKAEelrkAEDARK 1201
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1689 VQVAletaqRSAEAELQSEHASFAEKTAQLERTLKEEHVAvvQLREEATRRAQQQAEAERARAEAERELERWQLKANEAL 1768
Cdd:PTZ00121  1202 AEAA-----RKAEEERKAEEARKAEDAKKAEAVKKAEEAK--KDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKA 1274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1769 RLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQavRQRELAEQELEKQRQLAEGTAQQrlaAEQEliRLRAET 1848
Cdd:PTZ00121  1275 EEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEA--KKADEAKKKAEEAKKKADAAKKK---AEEA--KKAAEA 1347
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1849 EQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEmEVLLA--SKARAEEESRSTSEKSKQrlEAEAGRFRELA 1926
Cdd:PTZ00121  1348 AKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAE-EKKKAdeAKKKAEEDKKKADELKKA--AAAKKKADEAK 1424
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1927 EEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVlAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLL 2006
Cdd:PTZ00121  1425 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK-AEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAK 1503
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2007 EEQAAQHKADiEARLAQLRKASEsELERQKglvedtlrQRRQVEEeilaLKGSFEKAAAGKAELELELGRirgtAEDTlR 2086
Cdd:PTZ00121  1504 KAAEAKKKAD-EAKKAEEAKKAD-EAKKAE--------EAKKADE----AKKAEEKKKADELKKAEELKK----AEEK-K 1564
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2087 SKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRlrerAEQESARQLQLaq 2166
Cdd:PTZ00121  1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKK----AEEEKKKVEQL-- 1638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2167 eaaqKRLQAEEKAHAFAVQQKEQELQQTLqqeqsvlERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQs 2246
Cdd:PTZ00121  1639 ----KKKEAEEKKKAEELKKAEEENKIKA-------AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEE- 1706
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2247 aeeQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADaemEKHKQFAEQALRQKAQVEQELTALRLQLEETDH 2326
Cdd:PTZ00121  1707 ---LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAE---EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAV 1780
                          810
                   ....*....|....*.
gi 1920237946 2327 QKSILDEELQRLKAEV 2342
Cdd:PTZ00121  1781 IEEELDEEDEKRRMEV 1796
CH_ACTN3_rpt2 cd21289
second calponin homology (CH) domain found in alpha-actinin-3; Alpha-actinin-3 (ACTN3), also ...
288-403 7.26e-26

second calponin homology (CH) domain found in alpha-actinin-3; Alpha-actinin-3 (ACTN3), also called alpha-actinin skeletal muscle isoform 3, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN3 is a bundling protein. It is critical in anchoring the myofibrillar actin filaments and plays a key role in muscle contraction. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409138  Cd Length: 124  Bit Score: 105.58  E-value: 7.26e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  288 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 367
Cdd:cd21289      1 IQDISV----EETSAKEGLLLWCQRKTAPYRNVNVQNFHTSWKDGLALCALIHRHRPDLIDYAKLRKDDPIGNLNTAFEV 76
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1920237946  368 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21289     77 AEKYLDIPKMLDAEDiVNTPKPDEKAIMTYVSCFYHA 113
CH_SPTBN5_rpt1 cd21247
first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) ...
180-288 1.49e-25

first calponin homology (CH) domain found in spectrin beta chain, non-erythrocytic 5 (SPTBN5) and similar proteins; Spectrin is an actin crosslinking and molecular scaffold protein that links the plasma membrane to the actin cytoskeleton, and functions in the determination of cell shape, arrangement of transmembrane proteins, and organization of organelles. It is composed of two antiparallel dimers of alpha- and beta- subunits. SPTBN5, also called beta-V spectrin, is a mammalian ortholog of Drosophila beta H spectrin that may play a crucial role as a longer actin-membrane cross-linker or to fulfill the need for greater extensible flexibility than can be provided by the other smaller conventional spectrins. SPTBN5 contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409096  Cd Length: 125  Bit Score: 104.45  E-value: 1.49e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  180 DERDRVQKKTFTKWVNKHLIKAQR--HISDLYEDLRDGHNLISLLEVLSGDSLPR-EKGRMRFHKLQNVQIALDYLRHR- 255
Cdd:cd21247     15 EQRMTMQKKTFTKWMNNVFSKNGAkiEITDIYTELKDGIHLLRLLELISGEQLPRpSRGKMRVHFLENNSKAITFLKTKv 94
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1920237946  256 QVKLVNIRNddIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21247     95 PVKLIGPEN--IVDGDRTLILGLIWIIILRFQI 125
CH smart00033
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ...
188-285 2.25e-25

Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.


Pssm-ID: 214479 [Multi-domain]  Cd Length: 101  Bit Score: 103.16  E-value: 2.25e-25
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   188 KTFTKWVNKHLIKA-QRHISDLYEDLRDGHNLISLLEVLSGDSLPREK---GRMRFHKLQNVQIALDYLRHRQVKLVNIR 263
Cdd:smart00033    1 KTLLRWVNSLLAEYdKPPVTNFSSDLKDGVALCALLNSLSPGLVDKKKvaaSLSRFKKIENINLALSFAEKLGGKVVLFE 80
                            90       100
                    ....*....|....*....|..
gi 1920237946   264 NDDIADGnPKLTLGLIWTIILH 285
Cdd:smart00033   81 PEDLVEG-PKLILGVIWTLISL 101
growth_prot_Scy NF041483
polarized growth protein Scy;
1477-2688 2.54e-25

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 116.85  E-value: 2.54e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEERERLAEVEAalEKQRQLAE-AHAQAKAQAEREAQGLQRRMQ--EEVARREEvAVEAQEQKRSIQEELQHLRQ 1553
Cdd:NF041483    85 ADQLRADAERELRDARA--QTQRILQEhAEHQARLQAELHTEAVQRRQQldQELAERRQ-TVESHVNENVAWAEQLRART 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 SSEA-----EIQAKARQVEAAERSRlrieeeirVVRLQLEAteRQRGGAEGElqalRARAEeAEAQKRQAQEEAERLRRQ 1628
Cdd:NF041483   162 ESQArrlldESRAEAEQALAAARAE--------AERLAEEA--RQRLGSEAE----SARAE-AEAILRRARKDAERLLNA 226
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1629 VQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQaeEAERRLRQAEAERARQVQVALETA-QRSAEAELQSE 1707
Cdd:NF041483   227 ASTQAQEATDHAEQLRSSTAAESDQARRQAAELSRAAEQRMQ--EAEEALREARAEAEKVVAEAKEAAaKQLASAESANE 304
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1708 hasfaektaQLERTLKEEhvaVVQLREEATRRAQQQAEAERARAEAERELERWQL-KANEALRLRLQAEEVAQQKSLTQA 1786
Cdd:NF041483   305 ---------QRTRTAKEE---IARLVGEATKEAEALKAEAEQALADARAEAEKLVaEAAEKARTVAAEDTAAQLAKAART 372
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1787 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQ-RLAAEQELIRLRAETEQgeqqrqlLEEELARL 1865
Cdd:NF041483   373 AEEVLTKASEDAKATTRAAAEEAERIRREAEAEADRLRGEAADQAEQlKGAAKDDTKEYRAKTVE-------LQEEARRL 445
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1866 QREAAaatQKRRELEAELAKVRAEmevllaskARAEeesrstsekSKQRLEAEAGRFRELAEEAarlRALAEEAKRQrql 1945
Cdd:NF041483   446 RGEAE---QLRAEAVAEGERIRGE--------ARRE---------AVQQIEEAARTAEELLTKA---KADADELRST--- 499
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1946 aeedavrQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAA-QHKADIEARLAQL 2024
Cdd:NF041483   500 -------ATAESERVRTE---AIERATTLRRQAEETLERTRAEAERLRAEAEEQAEEVRAAAERAArELREETERAIAAR 569
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2025 RKASESELERQKGLVEdtlrQRRQVEEEilALKGSFEKAAAGKAELELELGRIRGTAEDTLRS-KEQAEQEAARQRqlaa 2103
Cdd:NF041483   570 QAEAAEELTRLHTEAE----ERLTAAEE--ALADARAEAERIRREAAEETERLRTEAAERIRTlQAQAEQEAERLR---- 639
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2104 eeerrRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLR-------ERAEQESARQLQLAQ-EAAQKRLQ 2174
Cdd:NF041483   640 -----TEAAADASAARAEGENVAVRLRSEAAAEAERLKSEAQEsADRVRaeaaaaaERVGTEAAEALAAAQeEAARRRRE 714
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2175 AEE---KAHAFAVQQKEQELQQTLQQEQSVLERLrseaeaarraaeeaeaareraEREAAQSRRQVEEAERlkqsaeeqa 2251
Cdd:NF041483   715 AEEtlgSARAEADQERERAREQSEELLASARKRV---------------------EEAQAEAQRLVEEADR--------- 764
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2252 qaqaqaqaaaeklrkeaeqeaarraqaeqaalRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEET-DHQKSI 2330
Cdd:NF041483   765 --------------------------------RATELVSAAEQTAQQVRDSVAGLQEQAEEEIAGLRSAAEHAaERTRTE 812
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2331 LDEELQRLKAEvteAARQRGQVEEELFSLRVQ-MEELGKLKARIEAEnralVLRDKDSAQRLLQEEAEKMKQVAEEAA-R 2408
Cdd:NF041483   813 AQEEADRVRSD---AYAERERASEDANRLRREaQEETEAAKALAERT----VSEAIAEAERLRSDASEYAQRVRTEASdT 885
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2409 LSVAAQEAARLRQLAEEDLAQQRALA---EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAq 2485
Cdd:NF041483   886 LASAEQDAARTRADAREDANRIRSDAaaqADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIA- 964
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2486 etqgfqktleterqrqlEMSAEAERLRLRVAE-MSRAQARAE---EDARRFRKQAEDIGERLyRTELATQEKVMLVQTLE 2561
Cdd:NF041483   965 -----------------EATGEAERLRAEAAEtVGSAQQHAErirTEAERVKAEAAAEAERL-RTEAREEADRTLDEARK 1026
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2562 TQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLL----QETQALQQSFLSEKDSLLQRERcieq 2637
Cdd:NF041483  1027 DANKRRSEAAEQADTLITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVgaarKEAERIVAEATVEGNSLVEKAR---- 1102
                         1210      1220      1230      1240      1250
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2638 ekAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLaasMEEARRRQHEA 2688
Cdd:NF041483  1103 --TDADELLVGARRDATAIRERAEELRDRITGEIEEL---HERARRESAEQ 1148
PTZ00121 PTZ00121
MAEBL; Provisional
1820-2643 2.94e-25

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 116.78  E-value: 2.94e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1820 LEKQRQLAEGTAQQRLAAEQELIRLRAE-TEQGEQQRQLLEEELARLQREAA----AATQKRRELEA-ELAKVRAEmEVL 1893
Cdd:PTZ00121  1026 IEKIEELTEYGNNDDVLKEKDIIDEDIDgNHEGKAEAKAHVGQDEGLKPSYKdfdfDAKEDNRADEAtEEAFGKAE-EAK 1104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1894 LASKARAEEESRStsEKSKQRLEaEAGRFREL--AEEAARlralAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAIS-- 1969
Cdd:PTZ00121  1105 KTETGKAEEARKA--EEAKKKAE-DARKAEEArkAEDARK----AEEARKAEDAKRVEIARKAEDARKAEEARKAEDAkk 1177
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1970 -EATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEA--RLAQLRKASESELERQKGLVEDTLRQR 2046
Cdd:PTZ00121  1178 aEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAvkKAEEAKKDAEEAKKAEEERNNEEIRKF 1257
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2047 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRqlaAEEERRRreaeervqkslaaeEEA 2126
Cdd:PTZ00121  1258 EEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKK---AEEAKKA--------------DEA 1320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2127 ARQRKAALEEVERLKAKVEEARRLRERAEQEsarqlqlAQEAAQKRLQAEEKAHAfavqqkeqelqqtlqqeqsvLERLR 2206
Cdd:PTZ00121  1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAE-------AEAAADEAEAAEEKAEA--------------------AEKKK 1373
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2207 SEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKlRKEAEQEAARRAQAEQAALRQK 2286
Cdd:PTZ00121  1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK-KKADEAKKKAEEAKKADEAKKK 1452
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2287 QAADAEMEKHKQFAEQA-----LRQKAQVEQELTALRLQLEETDHQKsildEELQRlKAEVTEAARQRGQVEEelfslRV 2361
Cdd:PTZ00121  1453 AEEAKKAEEAKKKAEEAkkadeAKKKAEEAKKADEAKKKAEEAKKKA----DEAKK-AAEAKKKADEAKKAEE-----AK 1522
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2362 QMEELGKLKARIEAEnRALVLRDKDSAQRLlqEEAEKMKQvAEEAARLSVAAQEAARlRQLAEEDLAQQRALAEKMLKEK 2441
Cdd:PTZ00121  1523 KADEAKKAEEAKKAD-EAKKAEEKKKADEL--KKAEELKK-AEEKKKAEEAKKAEED-KNMALRKAEEAKKAEEARIEEV 1597
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2442 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQqlaqetqgFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2521
Cdd:PTZ00121  1598 MKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQ--------LKKKEAEEKKKAEELKKAEEENKIKAAEEAKK 1669
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2522 QARAEEDARRFRKQAED---IGERLYRTElatqEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLK 2598
Cdd:PTZ00121  1670 AEEDKKKAEEAKKAEEDekkAAEALKKEA----EEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKK 1745
                          810       820       830       840       850
                   ....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2599 SEEMQTVRQE-----QLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLE 2643
Cdd:PTZ00121  1746 AEEAKKDEEEkkkiaHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRME 1795
CH_ACTN2_rpt2 cd21288
second calponin homology (CH) domain found in alpha-actinin-2; Alpha-actinin-2 (ACTN2), also ...
288-403 2.06e-24

second calponin homology (CH) domain found in alpha-actinin-2; Alpha-actinin-2 (ACTN2), also called alpha-actinin skeletal muscle isoform 2, is an F-actin cross-linking protein which is thought to anchor actin to a variety of intracellular structures. ACTN2 is a bundling protein. Its mutations are associated with cardiomyopathies, as well as skeletal muscle disorder. It contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409137  Cd Length: 124  Bit Score: 101.30  E-value: 2.06e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  288 ISDIQVsgqsEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSV 367
Cdd:cd21288      1 IQDISV----EETSAKEGLLLWCQRKTAPYRNVNIQNFHTSWKDGLGLCALIHRHRPDLIDYSKLNKDDPIGNINLAMEI 76
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1920237946  368 AERDLGVTRLLDPED-VDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21288     77 AEKHLDIPKMLDAEDiVNTPKPDERAIMTYVSCFYHA 113
CH pfam00307
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ...
185-288 2.30e-24

Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.


Pssm-ID: 425596 [Multi-domain]  Cd Length: 109  Bit Score: 100.44  E-value: 2.30e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  185 VQKKTFTKWVNKHLIKAQRH--ISDLYEDLRDGHNLISLLEVLSGDSLP-REKGRMRFHKLQNVQIALDYLRHRQ-VKLV 260
Cdd:pfam00307    2 ELEKELLRWINSHLAEYGPGvrVTNFTTDLRDGLALCALLNKLAPGLVDkKKLNKSEFDKLENINLALDVAEKKLgVPKV 81
                           90       100
                   ....*....|....*....|....*...
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:pfam00307   82 LIEPEDLVEGDNKSVLTYLASLFRRFQA 109
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1514-2450 2.47e-24

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 113.23  E-value: 2.47e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1514 REAQGLQRRMQEEVARREEVAVEAQEQKRSIQ------EELQHLR-QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ 1586
Cdd:TIGR02168  175 KETERKLERTRENLDRLEDILNELERQLKSLErqaekaERYKELKaELRELELALLVLRLEELREELEELQEELKEAEEE 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1587 LEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEE 1666
Cdd:TIGR02168  255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1667 LRLQAEEAERRLrqaeaerarqvqvaletaqrsaeAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaea 1746
Cdd:TIGR02168  335 LAEELAELEEKL-----------------------EELKEELESLEAELEELEAELEELESRLEELEEQLE--------- 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1747 eraraeaerelerwQLKANEALRLRLQAEEVAQQKSLTQAEaekqkeeaerearrrgkaeEQAVRQRELAEQELEKQRQl 1826
Cdd:TIGR02168  383 --------------TLRSKVAQLELQIASLNNEIERLEARL-------------------ERLEDRRERLQQEIEELLK- 428
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1827 aEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRS 1906
Cdd:TIGR02168  429 -KLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEG 507
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1907 TSE--KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlaeEDAVRQRAEAERVLAEKLAAiSEATRLKTEAEIALKE 1984
Cdd:TIGR02168  508 VKAllKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRL---QAVVVENLNAAKKAIAFLKQ-NELGRVTFLPLDSIKG 583
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1985 KEAENERLRRLAEDEAFQRRL--LEEQAAQHKADIEARLAQLRKASEselerqkglVEDTLRQRRQVEEEIL-------- 2054
Cdd:TIGR02168  584 TEIQGNDREILKNIEGFLGVAkdLVKFDPKLRKALSYLLGGVLVVDD---------LDNALELAKKLRPGYRivtldgdl 654
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2055 -----ALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaaeeerRRREAEERVQKSLAAEEEAARQ 2129
Cdd:TIGR02168  655 vrpggVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK-------ELEELEEELEQLRKELEELSRQ 727
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2130 RKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAfaVQQKEQELQQTLQQEQSVLERLRSEA 2209
Cdd:TIGR02168  728 ISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAE--AEAEIEELEAQIEQLKEELKALREAL 805
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2210 EAARRAAEEAEAARERAEREAAQSRRQVEEAERLkqsaeeqaqaqaqaqaaaekLRKEAEQEAARRAQAEQAALRQKQAA 2289
Cdd:TIGR02168  806 DELRAELTLLNEEAANLRERLESLERRIAATERR--------------------LEDLEEQIEELSEDIESLAAEIEELE 865
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2290 DAEMEKHKQFaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL-GK 2368
Cdd:TIGR02168  866 ELIEELESEL-EALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLqER 944
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2369 L--KARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAarLRQLAE--EDLAQQRALAEKMLKEKMQA 2444
Cdd:TIGR02168  945 LseEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLAAIEE--YEELKEryDFLTAQKEDLTEAKETLEEA 1022

                   ....*.
gi 1920237946 2445 VQEATR 2450
Cdd:TIGR02168 1023 IEEIDR 1028
CH pfam00307
Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal ...
300-406 6.37e-24

Calponin homology (CH) domain; The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most member proteins have from two to four copies of the CH domain, however some proteins such as calponin have only a single copy.


Pssm-ID: 425596 [Multi-domain]  Cd Length: 109  Bit Score: 99.28  E-value: 6.37e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  300 MTAKEKLLLWSQRMVEGC-QGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVY--RQTNLENLDQAFSVAERDLGVTR 376
Cdd:pfam00307    1 LELEKELLRWINSHLAEYgPGVRVTNFTTDLRDGLALCALLNKLAPGLVDKKKLNksEFDKLENINLALDVAEKKLGVPK 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1920237946  377 -LLDPEDVDvpQPDEKSIITYVSSLYDAMPR 406
Cdd:pfam00307   81 vLIEPEDLV--EGDNKSVLTYLASLFRRFQA 109
CH_MICALL cd21197
calponin homology (CH) domain found in the MICAL-like protein family; The MICAL-L family ...
306-401 6.41e-24

calponin homology (CH) domain found in the MICAL-like protein family; The MICAL-L family includes MICAL-L1 and MICAL-L2. MICAL-L1, also called molecule interacting with Rab13 (MIRab13), is a probable lipid-binding protein with higher affinity for phosphatidic acid, a lipid enriched in recycling endosome membranes. It is a tubular endosomal membrane hub that connects Rab35 and Arf6 with Rab8a. It may be involved in a late step of receptor-mediated endocytosis regulating endocytosed-EGF receptor trafficking. Alternatively, it may regulate slow endocytic recycling of endocytosed proteins back to the plasma membrane. MICAL-L1 may indirectly play a role in neurite outgrowth. MICAL-L2, also called junctional Rab13-binding protein (JRAB), or molecule interacting with CasL-like 2, acts as an effector of small Rab GTPases which is involved in junctional complexes assembly through the regulation of cell adhesion molecule transport to the plasma membrane, and actin cytoskeleton reorganization. It regulates the endocytic recycling of occludins, claudins, and E-cadherin to the plasma membrane and may thereby regulate the establishment of tight junctions and adherens junctions. Members of this family contain a single copy of CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409046  Cd Length: 105  Bit Score: 99.15  E-value: 6.41e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  306 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED-VD 384
Cdd:cd21197      5 LLRWCRRQCEGYPGVNITNLTSSFRDGLAFCAILHRHRPELIDFHSLKKDNWLENNRLAFRVAETSLGIPALLDAEDmVT 84
                           90
                   ....*....|....*..
gi 1920237946  385 VPQPDEKSIITYVSSLY 401
Cdd:cd21197     85 MHVPDRLSIITYVSQYY 101
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1440-2176 1.51e-23

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 110.92  E-value: 1.51e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1440 QEYVDLRTRYSELS-TLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAAL--------EKQRQLAEA------ 1504
Cdd:TIGR02168  213 ERYKELKAELRELElALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLeelrlevsELEEEIEELqkelya 292
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1505 HAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE------ELQHLRQSSEAEIQAKARQVEAAERSRLRIEE 1578
Cdd:TIGR02168  293 LANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDElaeelaELEEKLEELKEELESLEAELEELEAELEELES 372
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1579 EIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQ 1658
Cdd:TIGR02168  373 RLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEI--EELLKKLEEAELKELQAELEELEEELE 450
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1659 RALQALEELRLQAEEAERRLRQAEAERarqvqvaletaqRSAEAELQsEHASFAEKTAQLERTLKEEHVAVVQLREEAtr 1738
Cdd:TIGR02168  451 ELQEELERLEEALEELREELEEAEQAL------------DAAERELA-QLQARLDSLERLQENLEGFSEGVKALLKNQ-- 515
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1739 raQQQAEAERARAEAERELERWQLKANEALRLRLQA---EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQREL 1815
Cdd:TIGR02168  516 --SGLSGILGVLSELISVDEGYEAAIEAALGGRLQAvvvENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTEIQGNDREI 593
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1816 AEQELEKQRQLAE---GTAQQRLAAEQELIRLR-AETEQG--EQQRQLLEEELA------------RLQREAAAATQKRR 1877
Cdd:TIGR02168  594 LKNIEGFLGVAKDlvkFDPKLRKALSYLLGGVLvVDDLDNalELAKKLRPGYRIvtldgdlvrpggVITGGSAKTNSSIL 673
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1878 ELEAELAKVRAEMEvLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEA 1957
Cdd:TIGR02168  674 ERRREIEELEEKIE-ELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQL 752
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1958 ERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAED-----EAFQRRLLEEQAAQHKADIEARLAQLRKAS-ESE 2031
Cdd:TIGR02168  753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQlkeelKALREALDELRAELTLLNEEAANLRERLESlERR 832
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2032 LERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELgrirgtaEDTLRSKEQAEQEAARQRQLAAEEERRRRE 2111
Cdd:TIGR02168  833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL-------EALLNERASLEEALALLRSELEELSEELRE 905
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2112 AEERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLRERA--EQESARQLQLAQEAAQKRLQAE 2176
Cdd:TIGR02168  906 LESKRSELRRELEELREKLAQLELRLEGLEVRIDNlQERLSEEYslTLEEAEALENKIEDDEEEARRR 973
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1832-2624 1.98e-23

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 110.53  E-value: 1.98e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1832 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAAtQKRRELEAELAKVRAEmevLLASKARAEEESRSTSEKS 1911
Cdd:TIGR02168  172 ERRKETERKLERTRENLDRLEDILNELERQLKSLERQAEKA-ERYKELKAELRELELA---LLVLRLEELREELEELQEE 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1912 KQRLEAEagrFRELAEEAARLRALAEEAKRQRQLAEEDAvrqrAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENER 1991
Cdd:TIGR02168  248 LKEAEEE---LEELTAELQELEEKLEELRLEVSELEEEI----EELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1992 LRRLAEDEAFQRRLLEEQAAQHKADIEaRLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFekaaagkAELE 2071
Cdd:TIGR02168  321 LEAQLEELESKLDELAEELAELEEKLE-ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV-------AQLE 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2072 LELGRIRGTAEDTLRSKEQAEQEAARQRQ-LAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRL 2150
Cdd:TIGR02168  393 LQIASLNNEIERLEARLERLEDRRERLQQeIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2151 RERAEQESARQLQLAQE--AAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAER 2228
Cdd:TIGR02168  473 AEQALDAAERELAQLQArlDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEAALGGRLQAVVV 552
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2229 EAAQSRRQVEEAerLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAaDAEMEKHKQFAeqaLRQKA 2308
Cdd:TIGR02168  553 ENLNAAKKAIAF--LKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKF-DPKLRKALSYL---LGGVL 626
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2309 QVEQELTALRLQlEETDHQKSI--LDEELQRLKAEVTEAARQRGQVeeeLFSLRVQMEELGKLKARIEAENRAL--VLRD 2384
Cdd:TIGR02168  627 VVDDLDNALELA-KKLRPGYRIvtLDGDLVRPGGVITGGSAKTNSS---ILERRREIEELEEKIEELEEKIAELekALAE 702
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2385 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM-----------QAVQEATRLKA 2453
Cdd:TIGR02168  703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEaeieeleerleEAEEELAEAEA 782
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2454 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlrvaEMSRAQARAEEDARRFR 2533
Cdd:TIGR02168  783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLE----DLEEQIEELSEDIESLA 858
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2534 KQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTvRQEQLLQE 2613
Cdd:TIGR02168  859 AEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL-RLEGLEVR 937
                          810
                   ....*....|.
gi 1920237946 2614 TQALQQSFLSE 2624
Cdd:TIGR02168  938 IDNLQERLSEE 948
growth_prot_Scy NF041483
polarized growth protein Scy;
1468-2179 2.45e-23

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 110.30  E-value: 2.45e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLA-EAHAQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQE 1546
Cdd:NF041483   254 RQAAELSRAAEQRMQEAEEALREARAEAEKVVAEAkEAAAKQLASAESANEQRTRTAKEEIAR---LVGEATKEAEALKA 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1547 ELQHLRQSSEAEiqAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGElQALRARAEEAEAQKRQAQEEAERLR 1626
Cdd:NF041483   331 EAEQALADARAE--AEKLVAEAAEKARTVAAED--------TAAQLAKAARTAE-EVLTKASEDAKATTRAAAEEAERIR 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1627 RQVQDETQRKRQAEAELALRVQAEAEAAREKQRAlqalEELRLQaEEAeRRLRqAEAERARqvqvaletaqrsaeaelqs 1706
Cdd:NF041483   400 REAEAEADRLRGEAADQAEQLKGAAKDDTKEYRA----KTVELQ-EEA-RRLR-GEAEQLR------------------- 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1707 ehasfAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELErwqlkANEALRLRLQAEEVAqqKSLTQA 1786
Cdd:NF041483   454 -----AEAVAEGERIRGEARREAVQQIEEAARTAEELLTKAKADADELRSTA-----TAESERVRTEAIERA--TTLRRQ 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1787 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA-AEQELIRLRAETEQ----GEQQRQLLEEE 1861
Cdd:NF041483   522 AEETLERTRAEAERLRAEAEEQAEEVRAAAERAARELREETERAIAARQAeAAEELTRLHTEAEErltaAEEALADARAE 601
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1862 LARLQREAAAATQKRRELEAE-----LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrelAEEAARLRALA 1936
Cdd:NF041483   602 AERIRREAAEETERLRTEAAErirtlQAQAEQEAERLRTEAAADASAARAEGENVAVRLRSEA------AAEAERLKSEA 675
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1937 EE-AKRQRQLAEEDAVRQRAEAERVLAeklAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAfqrrllEEQAAQHKA 2015
Cdd:NF041483   676 QEsADRVRAEAAAAAERVGTEAAEALA---AAQEEAARRRREAEETLGSARAEADQERERAREQS------EELLASARK 746
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2016 DIEARLAQLRKASESELERQKGLV---EDTLRQRR--------QVEEEILALKGSFEKAAA-GKAELELELGRIRGtaeD 2083
Cdd:NF041483   747 RVEEAQAEAQRLVEEADRRATELVsaaEQTAQQVRdsvaglqeQAEEEIAGLRSAAEHAAErTRTEAQEEADRVRS---D 823
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2084 TLRSKEQAEQEAARQRQlaaeeerrrreaeERVQKSLAAEEEAARQRKAALEEVERLKAKVEE-ARRLRERAeqeSARQL 2162
Cdd:NF041483   824 AYAERERASEDANRLRR-------------EAQEETEAAKALAERTVSEAIAEAERLRSDASEyAQRVRTEA---SDTLA 887
                          730
                   ....*....|....*..
gi 1920237946 2163 QLAQEAAQKRLQAEEKA 2179
Cdd:NF041483   888 SAEQDAARTRADAREDA 904
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1811-2581 3.85e-23

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 109.38  E-value: 3.85e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1811 RQRELAEQELEKQRQLAEgtaqqrlaAEQELIRLRAETeqgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEM 1890
Cdd:TIGR02168  207 RQAEKAERYKELKAELRE--------LELALLVLRLEE---------LREELEELQEELKEAEEELEELTAELQELEEKL 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1891 EVLLASKARAEEEsrstsekskqrLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISE 1970
Cdd:TIGR02168  270 EELRLEVSELEEE-----------IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1971 ATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQH---KADIEARLAQLRKASeSELERQKGLVEDTLRQRR 2047
Cdd:TIGR02168  339 LAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLetlRSKVAQLELQIASLN-NEIERLEARLERLEDRRE 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2048 QVEEEILALKGSFEKAAagKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRrreaeervQKSLAAEEEAA 2127
Cdd:TIGR02168  418 RLQQEIEELLKKLEEAE--LKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA--------LDAAERELAQL 487
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2128 RQRKAALEEV-ERLKAKVEEARRLRERAEQ------------ESARQLQLAQEAA-QKRLQA------EEKAHAFAVQQK 2187
Cdd:TIGR02168  488 QARLDSLERLqENLEGFSEGVKALLKNQSGlsgilgvlseliSVDEGYEAAIEAAlGGRLQAvvvenlNAAKKAIAFLKQ 567
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2188 EQELQQTLQQEQSVLER-LRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE-------AERLKQSAEEQAQAQAQAQA 2259
Cdd:TIGR02168  568 NELGRVTFLPLDSIKGTeIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYllggvlvVDDLDNALELAKKLRPGYRI 647
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2260 AAEKLRKEAEQEAARRAQAEQAALRQKQaaDAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLK 2339
Cdd:TIGR02168  648 VTLDGDLVRPGGVITGGSAKTNSSILER--RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELS 725
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2340 AEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQ----- 2414
Cdd:TIGR02168  726 RQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE-IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKalrea 804
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2415 ------------EAARLRQLAEEDLAQQRALAEKML----KEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQ 2478
Cdd:TIGR02168  805 ldelraeltllnEEAANLRERLESLERRIAATERRLedleEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAS 884
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2479 MAQQLAQ---ETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDigerLYRTELATQEKvm 2555
Cdd:TIGR02168  885 LEEALALlrsELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSE----EYSLTLEEAEA-- 958
                          810       820
                   ....*....|....*....|....*.
gi 1920237946 2556 LVQTLETQRQQSDRDAERLREAIAEL 2581
Cdd:TIGR02168  959 LENKIEDDEEEARRRLKRLENKIKEL 984
growth_prot_Scy NF041483
polarized growth protein Scy;
1464-2180 3.97e-23

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 109.53  E-value: 3.97e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1464 SETLRRMEEEERL---AEQQRAE---ERERLaEVEAALEKQRQLAEAHAQAK---AQAEREAQGLQRRMQEEVAR-REEV 1533
Cdd:NF041483   433 AKTVELQEEARRLrgeAEQLRAEavaEGERI-RGEARREAVQQIEEAARTAEellTKAKADADELRSTATAESERvRTEA 511
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1534 AVEAQEQKRSIQEELQHLRqsSEAEiQAKARQVEAAERSRLRIEEEIRVVRLQLE-ATERQRGGAEGELQALRARAEE-- 1610
Cdd:NF041483   512 IERATTLRRQAEETLERTR--AEAE-RLRAEAEEQAEEVRAAAERAARELREETErAIAARQAEAAEELTRLHTEAEErl 588
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1611 --AEAQKRQAQEEAERLRRQVQDETQRKRQAEAE--LALRVQAEAEAAREKQRALQALEELRLQAEEAERRLR-QAEAER 1685
Cdd:NF041483   589 taAEEALADARAEAERIRREAAEETERLRTEAAEriRTLQAQAEQEAERLRTEAAADASAARAEGENVAVRLRsEAAAEA 668
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1686 ARQVQVALETAQRsaeaeLQSEHASFAEKTAQlertlkeehvavvqlreeatrraqqqaeaeraraeaerelerwqlKAN 1765
Cdd:NF041483   669 ERLKSEAQESADR-----VRAEAAAAAERVGT---------------------------------------------EAA 698
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1766 EALrlrlqaeevaqqksltqaeaekqkeeaerearrrGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI--- 1842
Cdd:NF041483   699 EAL----------------------------------AAAQEEAARRRREAEETLGSARAEADQERERAREQSEELLasa 744
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1843 RLRAETEQGEQQRqLLEEELARLQREAAAATQKRRELEAELAKV--RAEMEV--LLASKARAEEESRSTSEKSKQRLEAE 1918
Cdd:NF041483   745 RKRVEEAQAEAQR-LVEEADRRATELVSAAEQTAQQVRDSVAGLqeQAEEEIagLRSAAEHAAERTRTEAQEEADRVRSD 823
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1919 AGRFRELA-EEAARLRALA-EEAKRQRQLAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEIALKEKEAENERLRRLA 1996
Cdd:NF041483   824 AYAERERAsEDANRLRREAqEETEAAKALAERTVSEAIAEAERLRSD---ASEYAQRVRTEASDTLASAEQDAARTRADA 900
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1997 EDEAFQRRllEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAElelelgR 2076
Cdd:NF041483   901 REDANRIR--SDAAAQADRLIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEATGEAE------R 972
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2077 IRGTAEDTLRSkeqAEQEAARQRQLAAEEERRrreaeervqkslaAEEEAARQRKAALEEVERL--KAKVEEARRLRERA 2154
Cdd:NF041483   973 LRAEAAETVGS---AQQHAERIRTEAERVKAE-------------AAAEAERLRTEAREEADRTldEARKDANKRRSEAA 1036
                          730       740
                   ....*....|....*....|....*.
gi 1920237946 2155 EQESARQLQLAQEAAQKRLQAEEKAH 2180
Cdd:NF041483  1037 EQADTLITEAAAEADQLTAKAQEEAL 1062
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2003-2644 5.86e-23

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 108.49  E-value: 5.86e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2003 RRLLEEQA--AQHKADIEARLAQLRKASE---------SELERQKglveDTLRQRRQVEEEILALKGSFEKAaagkaELE 2071
Cdd:COG1196    158 RAIIEEAAgiSKYKERKEEAERKLEATEEnlerledilGELERQL----EPLERQAEKAERYRELKEELKEL-----EAE 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2072 LELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLR 2151
Cdd:COG1196    229 LLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLE 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2152 ERAEQESARQLQLAQEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqsvlERLRSEAEAARRAAEEAEAARERAEREAA 2231
Cdd:COG1196    309 ERRRELEERLEELEEELAELEEELEELE-----------------------EELEELEEELEEAEEELEEAEAELAEAEE 365
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2232 QSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKeaeqeaarraqAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVE 2311
Cdd:COG1196    366 ALLEAEAELAEAEEELEELAEELLEALRAAAELAA-----------QLEELEEAEEALLERLERLEEELEELEEALAELE 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2312 QELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRvqmEELGKLKARIEAENRALVLRDKDSAQRL 2391
Cdd:COG1196    435 EEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELL---EELAEAAARLLLLLEAEADYEGFLEGVK 511
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2392 LQEEAEKMKQVA---------EEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAV--QEATRLKAEAELLQQ 2460
Cdd:COG1196    512 AALLLAGLRGLAgavavligvEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRAtfLPLDKIRARAALAAA 591
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2461 QKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIG 2540
Cdd:COG1196    592 LARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELL 671
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2541 ERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQS 2620
Cdd:COG1196    672 AALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEE 751
                          650       660
                   ....*....|....*....|....
gi 1920237946 2621 FLSEKDSLLQRERcIEQEKAKLEQ 2644
Cdd:COG1196    752 ALEELPEPPDLEE-LERELERLER 774
CH_MICALL1 cd21252
calponin homology (CH) domain found in MICAL-like protein 1; MICAL-like protein 1 (MICAL-L1), ...
302-401 1.04e-22

calponin homology (CH) domain found in MICAL-like protein 1; MICAL-like protein 1 (MICAL-L1), also called molecule interacting with Rab13 (MIRab13), is a probable lipid-binding protein with higher affinity for phosphatidic acid, a lipid enriched in recycling endosome membranes. It is a tubular endosomal membrane hub that connects Rab35 and Arf6 with Rab8a. It may be involved in a late step of receptor-mediated endocytosis regulating endocytosed-EGF receptor trafficking. Alternatively, it may regulate slow endocytic recycling of endocytosed proteins back to the plasma membrane. MICAL-L1 may indirectly play a role in neurite outgrowth. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409101  Cd Length: 107  Bit Score: 95.71  E-value: 1.04e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  302 AKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPE 381
Cdd:cd21252      1 ARRALQAWCRRQCEGYPGVEIRDLSSSFRDGLAFCAILHRHRPDLIDFDSLSKDNVYENNRLAFEVAERELGIPALLDPE 80
                           90       100
                   ....*....|....*....|.
gi 1920237946  382 D-VDVPQPDEKSIITYVSSLY 401
Cdd:cd21252     81 DmVSMKVPDCLSIMTYVSQYY 101
CH_CTX_rpt2 cd21226
second calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling ...
304-404 1.34e-22

second calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling proteins that play a critical role in regulating cell morphology and actin cytoskeleton reorganization. They play a major role in cytokinesis and contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409075  Cd Length: 103  Bit Score: 95.22  E-value: 1.34e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  304 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPEDV 383
Cdd:cd21226      3 DGLLAWCRQTTEGYDGVNITSFKSSFNDGRAFLALLHAYDPELFKQAAIEQMDAEARLNLAFDFAEKKLGIPKLLEAEDV 82
                           90       100
                   ....*....|....*....|.
gi 1920237946  384 DVPQPDEKSIITYVSSLYDAM 404
Cdd:cd21226     83 MTGNPDERSIVLYTSLFYHAF 103
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1915-2730 1.66e-22

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 107.45  E-value: 1.66e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1915 LEAEAGRFRELAEEAA---RLRALAEEAKRQRQLAEEDAVR------------QRAEAERVLAEKLAAISEATRlKTEAE 1979
Cdd:TIGR02168  150 IEAKPEERRAIFEEAAgisKYKERRKETERKLERTRENLDRledilnelerqlKSLERQAEKAERYKELKAELR-ELELA 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1980 IALKEKEAENERLRRLAEDEAFQRRLLEEQAAQhKADIEARLAQLRKAS---ESELERQKGLVEDTLRQRRQVEEEILAL 2056
Cdd:TIGR02168  229 LLVLRLEELREELEELQEELKEAEEELEELTAE-LQELEEKLEELRLEVselEEEIEELQKELYALANEISRLEQQKQIL 307
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2057 KGSFEKAAAGKAELELELgrirgtaEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 2136
Cdd:TIGR02168  308 RERLANLERQLEELEAQL-------EELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQ 380
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2137 VERLKAKVEEARR--LRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ-QKEQELQQTLQQEQSVLERLRSEAEAAR 2213
Cdd:TIGR02168  381 LETLRSKVAQLELqiASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEeAELKELQAELEELEEELEELQEELERLE 460
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2214 RAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQaeqaaLRQKQAADAEM 2293
Cdd:TIGR02168  461 EALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGV-----LSELISVDEGY 535
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2294 EKHKQFAEQALRQKAQVEQELTALRLQleetDHQKsilDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARI 2373
Cdd:TIGR02168  536 EAAIEAALGGRLQAVVVENLNAAKKAI----AFLK---QNELGRVTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLV 608
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2374 EAENRA-----------LVLRDKDSAQRLLQEEAEKMKQVAEEAARLS---VAAQEAARLRQLAeedLAQQRALAEkmLK 2439
Cdd:TIGR02168  609 KFDPKLrkalsyllggvLVVDDLDNALELAKKLRPGYRIVTLDGDLVRpggVITGGSAKTNSSI---LERRREIEE--LE 683
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2440 EKMQAVQEAtrlkaEAELLQQQKELAQEQarrlqedkeqmaQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMS 2519
Cdd:TIGR02168  684 EKIEELEEK-----IAELEKALAELRKEL------------EELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLE 746
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2520 RAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKS 2599
Cdd:TIGR02168  747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERL 826
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2600 EEMQTvRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAASME 2679
Cdd:TIGR02168  827 ESLER-RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEEL-ESELEALLNERASLEEALALLRSELEELSEELR 904
                          810       820       830       840       850
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2680 EARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEE 2730
Cdd:TIGR02168  905 ELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEE 955
CH_FLN-like_rpt2 cd21184
second calponin homology (CH) domain found in the filamin family; The filamin family includes ...
301-399 2.15e-22

second calponin homology (CH) domain found in the filamin family; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. This family also includes Drosophila melanogaster protein jitterbug (Jbug), which is an actin-meshwork organizing protein containing three copies of the CH domain. Other members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409033  Cd Length: 103  Bit Score: 94.61  E-value: 2.15e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCqglRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVY-RQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21184      1 SGKSLLLEWVNSKIPEY---KVKNFTTDWNDGKALAALVDALKPGLIPDNESLdKENPLENATKAMDIAEEELGIPKIIT 77
                           90       100
                   ....*....|....*....|
gi 1920237946  380 PEDVDVPQPDEKSIITYVSS 399
Cdd:cd21184     78 PEDMVSPNVDELSVMTYLSY 97
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1142-1706 2.70e-22

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 106.56  E-value: 2.70e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1142 ARECAQRITEQQKAQAEVDGLGKGVARLSAEAEKvlalpepspaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVI 1221
Cdd:COG1196    252 EAELEELEAELAELEAELEELRLELEELELELEE-------------AQAEEYELLAELARLEQDIARLEERRRELEERL 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1222 RSTQEAEEVLRAHEEQLKEAQAvpATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVE 1301
Cdd:COG1196    319 EELEEELAELEEELEELEEELE--ELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAA 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1302 RWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL 1381
Cdd:COG1196    397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLE 476
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1382 EDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1461
Cdd:COG1196    477 AALAE-----------LLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA------ 539
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1462 fisetlrrMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA----HAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1537
Cdd:COG1196    540 --------LEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRAtflpLDKIRARAALAAALARGAIGAAVDLVASDLREA 611
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1617
Cdd:COG1196    612 DARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEE 691
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1618 AQEEAERLRRQvqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA-LETA 1696
Cdd:COG1196    692 ELELEEALLAE---EEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEeLERE 768
                          570
                   ....*....|
gi 1920237946 1697 QRSAEAELQS 1706
Cdd:COG1196    769 LERLEREIEA 778
CH_EHBP cd21198
calponin homology (CH) domain found in the EH domain-binding protein (EHBP) family; The EHBP ...
301-401 3.07e-22

calponin homology (CH) domain found in the EH domain-binding protein (EHBP) family; The EHBP family includes EHBP1 and EHBP1-like protein (EHBP1L1). EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP1 is associated with aggressive prostate cancer and insulin-stimulated trafficking and cell migration. EHBP1L1 may also act as Rab effector protein and play a role in vesicle trafficking. It coordinates Rab8 and Bin1 to regulate apical-directed transport in polarized epithelial cells. Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409047  Cd Length: 105  Bit Score: 94.41  E-value: 3.07e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 380
Cdd:cd21198      1 SSGQDLLEWCQEVTKGYRGVKITNLTTSWRNGLAFCAILHHFRPDLIDFSSLSPHDIKENCKLAFDAAAK-LGIPRLLDP 79
                           90       100
                   ....*....|....*....|..
gi 1920237946  381 EDVDVPQ-PDEKSIITYVSSLY 401
Cdd:cd21198     80 ADMVLLSvPDKLSVMTYLHQIR 101
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2118-2733 3.57e-22

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 106.17  E-value: 3.57e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2118 KSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQ 2197
Cdd:COG1196    203 EPLERQAEKAERYRELKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELE 282
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2198 EQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQ 2277
Cdd:COG1196    283 LEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2278 AEQAALRQKQAADAEMEKHKQFAEQALRQkaqvEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELF 2357
Cdd:COG1196    363 AEEALLEAEAELAEAEEELEELAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEE 438
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2358 SLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM 2437
Cdd:COG1196    439 EEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAG 518
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2438 LKEKMQAVQEATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRV 2515
Cdd:COG1196    519 LRGLAGAVAVLIGVEAAYEAALEAALAAalQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIG 598
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2516 AEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLL 2595
Cdd:COG1196    599 AAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAE 678
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2596 QLKSEEMQTVRQEQLLQETQALQQsflsekdsLLQRERCIEQEKAKLEQlfqdevakaqalreeqqrqqqqmqqekqqlA 2675
Cdd:COG1196    679 AELEELAERLAEEELELEEALLAE--------EEEERELAEAEEERLEE------------------------------E 720
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2676 ASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRA 2733
Cdd:COG1196    721 LEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEA 778
COG5045 COG5045
Ribosomal protein S10E [Translation, ribosomal structure and biogenesis];
5-112 3.73e-22

Ribosomal protein S10E [Translation, ribosomal structure and biogenesis];


Pssm-ID: 227378  Cd Length: 105  Bit Score: 94.22  E-value: 3.73e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946    5 MLMPLDQLRTIYEVLFREGVMVAKKDRRpRSLHPHVpGVTNLQVTRAMASLRARGLVRETFAWRHFYWYLTNEGIAHLRQ 84
Cdd:COG5045      1 MLVPKENRYKIHQRLFQKGVAVAKKDFN-LGKHREL-EIPNLHVIKAMQSLISYGYVKTIHVWRHSYYTLTPEGVEYLRE 78
                           90       100
                   ....*....|....*....|....*...
gi 1920237946   85 YLHLPPEIVPASLQRVRRPVAmvMPARR 112
Cdd:COG5045     79 YLVLPDEGVPSTEAPAVSPTQ--RPQRR 104
CH_FLNA_rpt1 cd21308
first calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A ...
184-289 5.31e-22

first calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A (FLN-A) is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also anchors various transmembrane proteins to the actin cytoskeleton and serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-A contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409157  Cd Length: 129  Bit Score: 94.77  E-value: 5.31e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  184 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21308     19 KIQQNTFTRWCNEHLKCVSKRIANLQTDLSDGLRLIALLEVLSQKKMHRkhnQRPTFRQMQLENVSVALEFLDRESIKLV 98
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHFQIS 289
Cdd:cd21308     99 SIDSKAIVDGNLKLILGLIWTLILHYSIS 127
CH_FLNB_rpt1 cd21309
first calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B ...
184-289 5.94e-22

first calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B (FLN-B) is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton. It may promote orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It anchors various transmembrane proteins to the actin cytoskeleton. FLN-B contains two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409158  Cd Length: 131  Bit Score: 94.38  E-value: 5.94e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  184 RVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR---EKGRMRFHKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21309     16 KIQQNTFTRWCNEHLKCVNKRIGNLQTDLSDGLRLIALLEVLSQKRMYRkyhQRPTFRQMQLENVSVALEFLDRESIKLV 95
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHFQIS 289
Cdd:cd21309     96 SIDSKAIVDGNLKLILGLVWTLILHYSIS 124
CH_CLMN_rpt2 cd21245
second calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called ...
301-405 6.93e-22

second calponin homology (CH) domain found in calmin and similar proteins; Calmin, also called calponin-like transmembrane domain protein, is a protein with calponin homology (CH) and transmembrane domains expressed in maturing spermatogenic cells. It may be involved in the development and/or maintenance of neuronal functions. Calmin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409094  Cd Length: 106  Bit Score: 93.32  E-value: 6.93e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGcQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 380
Cdd:cd21245      3 KAIKALLNWVQRRTRK-YGVAVQDFGSSWRSGLAFLALIKAIDPSLVDMRQALEKSPRENLEDAFRIAQESLGIPPLLEP 81
                           90       100
                   ....*....|....*....|....*
gi 1920237946  381 EDVDVPQPDEKSIITYVSSLYDAMP 405
Cdd:cd21245     82 EDVMVDSPDEQSIMTYVAQFLEHFP 106
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1223-2027 2.95e-21

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 103.21  E-value: 2.95e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1223 STQEAEEVLRAHEEQLKEAQAvpatlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVER 1302
Cdd:TIGR02168  247 ELKEAEEELEELTAELQELEE------KLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEE 320
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1303 WRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLE 1382
Cdd:TIGR02168  321 LEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNN 400
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1383 DIERhgekveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQyirf 1462
Cdd:TIGR02168  401 EIER-----------LEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEE---- 465
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQE-EVARREEVAVEAQ--- 1538
Cdd:TIGR02168  466 LREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELiSVDEGYEAAIEAAlgg 545
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1539 -------EQKRSIQEELQHLRQSSE------AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALR 1605
Cdd:TIGR02168  546 rlqavvvENLNAAKKAIAFLKQNELgrvtflPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGV 625
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARAEEAEAQKRQAQEEAERLR-------------------RQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEE 1666
Cdd:TIGR02168  626 LVVDDLDNALELAKKLRPGYRivtldgdlvrpggvitggsAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRK 705
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1667 LRLQAEEAERRLRQAEAERARQVqvaleTAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQqqaea 1746
Cdd:TIGR02168  706 ELEELEEELEQLRKELEELSRQI-----SALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEE----- 775
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1747 eraraeaerelerwQLKANEALRLRLQAeEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELaEQELEKQRQL 1826
Cdd:TIGR02168  776 --------------ELAEAEAEIEELEA-QIEQLKEELKALREALDELRAELTLLNEEAANLRERLESL-ERRIAATERR 839
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1827 AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLL--ASKARAE-EE 1903
Cdd:TIGR02168  840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELEskRSELRRElEE 919
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1904 SRSTSEKSKQRLEAEAGRFRELAEE-AARLRALAEEAKRQRQLAEEDAVRQRAEAERvLAEKLAAISEATRLkteaeiAL 1982
Cdd:TIGR02168  920 LREKLAQLELRLEGLEVRIDNLQERlSEEYSLTLEEAEALENKIEDDEEEARRRLKR-LENKIKELGPVNLA------AI 992
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*
gi 1920237946 1983 KEKEAENERLRRLaedeafqrrlleeqaAQHKADIEARLAQLRKA 2027
Cdd:TIGR02168  993 EEYEELKERYDFL---------------TAQKEDLTEAKETLEEA 1022
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2124-2743 1.41e-20

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 100.78  E-value: 1.41e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2124 EEAARQRKAALEEVERLKAKVEEARR----LRERAEQ-ESARQLQlaqeAAQKRLQAEEKAHAFAVQQKEqelqqtlqqe 2198
Cdd:COG1196    175 EEAERKLEATEENLERLEDILGELERqlepLERQAEKaERYRELK----EELKELEAELLLLKLRELEAE---------- 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2199 qsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQsaeeqaqaqaqaqAAAEKLRKEAEQEAARRAQA 2278
Cdd:COG1196    241 ---LEELEAELEELEAELEELEAELAELEAELEELRLELEELELELE-------------EAQAEEYELLAELARLEQDI 304
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2279 EQAALRQKQAADAEMEKHKQfAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2358
Cdd:COG1196    305 ARLEERRRELEERLEELEEE-LAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2359 LRVQMEELgklkARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKML 2438
Cdd:COG1196    384 LAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEE 459
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2439 KEKmQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAER---LRLRV 2515
Cdd:COG1196    460 ALL-ELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIgveAAYEA 538
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2516 AEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLL 2595
Cdd:COG1196    539 ALEAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVL 618
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2596 Q--LKSEEMQTVRQEQLLQETQALQQSFLS---EKDSLLQRERCIEQEKAKLEQlfqdEVAKAQALREEQQRQQQQMQQE 2670
Cdd:COG1196    619 GdtLLGRTLVAARLEAALRRAVTLAGRLREvtlEGEGGSAGGSLTGGSRRELLA----ALLEAEAELEELAERLAEEELE 694
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2671 KQQLAASMEEARRRQHEAEEGVRRQQEELQRLAqqqqqqEKLLAEENQRLRERLQHLEEERRAALARSEEIAP 2743
Cdd:COG1196    695 LEEALLAEEEEERELAEAEEERLEEELEEEALE------EQLEAEREELLEELLEEEELLEEEALEELPEPPD 761
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1225-2025 1.92e-20

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 100.52  E-value: 1.92e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1225 QEAEEVLRAHEEQLKEAQAVpatlpeLEATKAALKKLRAQAEAQQPvFDALRDELRgaqevgerlqqrHGERDVEVERWR 1304
Cdd:TIGR02168  175 KETERKLERTRENLDRLEDI------LNELERQLKSLERQAEKAER-YKELKAELR------------ELELALLVLRLE 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1305 ErvtlLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVRE---QLRQEKALL 1381
Cdd:TIGR02168  236 E----LREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRleqQKQILRERL 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1382 EDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPV-----ASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLT 1456
Cdd:TIGR02168  312 ANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEElesleAELEELEAELEELESRLEELEEQLETLRSKVAQL 391
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1457 SQYIRFISETLRRMEEE-ERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAV 1535
Cdd:TIGR02168  392 ELQIASLNNEIERLEARlERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELE 471
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1536 EAQEQKRSIQEELQHLR------QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAE----GELQALR 1605
Cdd:TIGR02168  472 EAEQALDAAERELAQLQarldslERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEGYEAAIEaalgGRLQAVV 551
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARAEEAEAQKRQAQEEAERLRR----------QVQDETQRKRQAEAELALRVQAEAEAAREK-QRALQALEELRLQAEEA 1674
Cdd:TIGR02168  552 VENLNAAKKAIAFLKQNELGRVtflpldsikgTEIQGNDREILKNIEGFLGVAKDLVKFDPKlRKALSYLLGGVLVVDDL 631
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1675 ERRLRQAEAERARQVQVALE---------TAQRSAEAEL-----QSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRA 1740
Cdd:TIGR02168  632 DNALELAKKLRPGYRIVTLDgdlvrpggvITGGSAKTNSsilerRREIEELEEKIEELEEKIAELEKALAELRKELEELE 711
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1741 QQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQkeeaerearrrgkaeeqavrqRELAEQEL 1820
Cdd:TIGR02168  712 EELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE---------------------IEELEERL 770
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1821 EKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARA 1900
Cdd:TIGR02168  771 EEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEEL 850
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1901 EEEsRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEI 1980
Cdd:TIGR02168  851 SED-IESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929
                          810       820       830       840
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946 1981 ALKEKEAENERLR-RLAEDEafqrRLLEEQAAQHKADIEARLAQLR 2025
Cdd:TIGR02168  930 RLEGLEVRIDNLQeRLSEEY----SLTLEEAEALENKIEDDEEEAR 971
growth_prot_Scy NF041483
polarized growth protein Scy;
1471-2207 3.88e-20

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 99.52  E-value: 3.88e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEahaQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQEELQh 1550
Cdd:NF041483   560 EETERAIAARQAEAAEELTRLHTEAEERLTAAE---EALADARAEAERIRREAAEETER---LRTEAAERIRTLQAQAE- 632
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1551 lrqsSEAEiqaKARQVEAAERSRLRIEEEIRVVRLQLEA-TERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQV 1629
Cdd:NF041483   633 ----QEAE---RLRTEAAADASAARAEGENVAVRLRSEAaAEAERLKSEAQESADRVRAEAAAAAERVGTEAAEALAAAQ 705
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1630 QDETQRKRQAEAELAlrvQAEAEAAREKQRALQALEELrlqAEEAERRLRQAEAERARQVqvalETAQRSAeaelqSEHA 1709
Cdd:NF041483   706 EEAARRRREAEETLG---SARAEADQERERAREQSEEL---LASARKRVEEAQAEAQRLV----EEADRRA-----TELV 770
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1710 SFAEKTAQLERTlkeehvAVVQLREEATRraqqqaeaeraraeaerelerwqlkanEALRLRLQAEEVAQQksLTQAEAE 1789
Cdd:NF041483   771 SAAEQTAQQVRD------SVAGLQEQAEE---------------------------EIAGLRSAAEHAAER--TRTEAQE 815
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1790 KQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQ--------------- 1854
Cdd:NF041483   816 EADRVRSDAYAERERASEDANRLRREAQEETEAAKALAERTVSE---AIAEAERLRSDASEYAQRvrteasdtlasaeqd 892
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1855 ----RQLLEEELARLQREAAA-----ATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrel 1925
Cdd:NF041483   893 aartRADAREDANRIRSDAAAqadrlIGEATSEAERLTAEARAEAERLRDEARAEAERVRADAAAQAEQLIAEA------ 966
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1926 AEEAARLRALAEEAKRQrqlAEEDAVRQRAEAERVLAEklaAISEATRLKTEAEialkekeAENERLRRLAEDEAFQRR- 2004
Cdd:NF041483   967 TGEAERLRAEAAETVGS---AQQHAERIRTEAERVKAE---AAAEAERLRTEAR-------EEADRTLDEARKDANKRRs 1033
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2005 ---------LLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELG 2075
Cdd:NF041483  1034 eaaeqadtlITEAAAEADQLTAKAQEEALRTTTEAEAQADTMVGAARKEAERIVAEATVEGNSLVEKARTDADELLVGAR 1113
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2076 R----IRGTAEDtLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLK--------AK 2143
Cdd:NF041483  1114 RdataIRERAEE-LRDRITGEIEELHERARRESAEQMKSAGERCDALVKAAEEQLAEAEAKAKELVSDANseaskvriAA 1192
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2144 VEEARRLRERAEQESARQLQLAQ------EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRS 2207
Cdd:NF041483  1193 VKKAEGLLKEAEQKKAELVREAEkikaeaEAEAKRTVEEGKRELDVLVRRREDINAEISRVQDVLEALES 1262
CH_MICAL2_3-like cd21195
calponin homology (CH) domain found in molecule interacting with CasL protein 2 (MICAL-2), ...
305-402 1.58e-19

calponin homology (CH) domain found in molecule interacting with CasL protein 2 (MICAL-2), MICAL-3, and similar proteins; Molecule interacting with CasL protein (MICAL) is a large, multidomain, cytosolic protein with a single LIM domain, a calponin homology (CH) domain and a flavoprotein monooxygenase (MO) domain. In Drosophila, MICAL is expressed in axons, interacts with the neuronal A (PlexA) receptor and is required for Semaphorin 1a (Sema-1a)-PlexA-mediated repulsive axon guidance. The LIM and CH domains mediate interactions with the cytoskeleton, cytoskeletal adaptor proteins, and other signaling proteins. The flavoprotein MO is required for semaphorin-plexin repulsive axon guidance during axonal pathfinding in the Drosophila neuromuscular system. In addition, MICAL functions to interact with Rab13 and Rab8 to coordinate the assembly of tight junctions and adherens junctions in epithelial cells. Thus, MICAL is also called junctional Rab13-binding protein (JRAB). Members of this family, which includes MICAL-2, MICAL-3, and similar proteins, contain one CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409044 [Multi-domain]  Cd Length: 110  Bit Score: 87.02  E-value: 1.58e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  305 KLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD-PEDV 383
Cdd:cd21195      8 KLLTWCQQQTEGYQHVNVTDLTTSWRSGLALCAIIHRFRPELINFDSLNEDDAVENNQLAFDVAEREFGIPPVTTgKEMA 87
                           90
                   ....*....|....*....
gi 1920237946  384 DVPQPDEKSIITYVSSLYD 402
Cdd:cd21195     88 SAQEPDKLSMVMYLSKFYE 106
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
2295-2742 2.66e-19

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 96.54  E-value: 2.66e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2295 KHKQFAEQALRQKAQVEQELTALRLQLEEtdhqksiLDEELQRLKAEVtEAARQRGQVEEELFSLRVQmeelgklkarie 2374
Cdd:COG1196    169 KYKERKEEAERKLEATEENLERLEDILGE-------LERQLEPLERQA-EKAERYRELKEELKELEAE------------ 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2375 aenraLVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAE 2454
Cdd:COG1196    229 -----LLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQD 303
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2455 AELLQQQ-KELAQEQAR------RLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEE 2527
Cdd:COG1196    304 IARLEERrRELEERLEEleeelaELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEE 383
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2528 DARR-FRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVR 2606
Cdd:COG1196    384 LAEElLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLE 463
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2607 QEQLLQETQALQQSFLSEKDSLLQRERcieQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQH 2686
Cdd:COG1196    464 LLAELLEEAALLEAALAELLEELAEAA---ARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAAL 540
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2687 EAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLrERLQHLEEERRAALARSEEIA 2742
Cdd:COG1196    541 EAALAAALQNIVVEDDEVAAAAIEYLKAAKAGRA-TFLPLDKIRARAALAAALARG 595
CH_NAV2-like cd21212
calponin homology (CH) domain found in neuron navigator (NAV) 2, NAV3, and similar proteins; ...
186-286 1.04e-18

calponin homology (CH) domain found in neuron navigator (NAV) 2, NAV3, and similar proteins; This family includes neuron navigator 2 (NAV2) and NAV3, both of which contain a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs. NAV2, also called helicase APC down-regulated 1 (HELAD1), pore membrane and/or filament-interacting-like protein 2 (POMFIL2), retinoic acid inducible in neuroblastoma 1 (RAINB1), Steerin-2 (STEERIN2), or Unc-53 homolog 2 (unc53H2), possesses 3' to 5' helicase activity and exonuclease activity. It is involved in neuronal development, specifically in the development of different sensory organs. NAV3, also called pore membrane and/or filament-interacting-like protein 1 (POMFIL1), Steerin-3 (STEERIN3), or Unc-53 homolog 3 (unc53H3), may regulate IL2 production by T-cells. It may be involved in neuron regeneration.


Pssm-ID: 409061  Cd Length: 105  Bit Score: 84.17  E-value: 1.04e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  186 QKKTFTKWVNKHLIKA--QRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGR--MRFHKLQNVQIALDYLRHRQVKLVN 261
Cdd:cd21212      1 EIEIYTDWANHYLEKGghKRIITDLQKDLGDGLTLVNLIEAVAGEKVPGIHSRpkTRAQKLENIQACLQFLAALGVDVQG 80
                           90       100
                   ....*....|....*....|....*
gi 1920237946  262 IRNDDIADGNPKLTLGLIWTIILHF 286
Cdd:cd21212     81 ITAEDIVDGNLKAILGLFFSLSRYK 105
CH_EHBP1L1 cd21255
calponin homology (CH) domain found in EH domain-binding protein 1-like protein 1 and similar ...
301-400 1.65e-18

calponin homology (CH) domain found in EH domain-binding protein 1-like protein 1 and similar proteins; EHBP1L1 may act as Rab effector protein and play a role in vesicle trafficking. It coordinates Rab8 and Bin1 to regulate apical-directed transport in polarized epithelial cells. Members of this subfamily contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409104  Cd Length: 105  Bit Score: 83.68  E-value: 1.65e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 380
Cdd:cd21255      1 SSSQSLLEWCQEVTAGYRGVRVTNFTTSWRNGLAFCAILHHFHPDLVDYESLDPLDIKENNKKAFEAFAS-LGVPRLLEP 79
                           90       100
                   ....*....|....*....|.
gi 1920237946  381 ED-VDVPQPDEKSIITYVSSL 400
Cdd:cd21255     80 ADmVLLPIPDKLIVMTYLCQL 100
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1465-2053 1.80e-18

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 94.21  E-value: 1.80e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQR------AEERERLAEVEAALEKQRQLAEAHAQAKAQAERE-AQGLQRRMQEEVARREEVAVEA 1537
Cdd:COG4913    235 DDLERAHEALEDAREQIellepiRELAERYAAARERLAELEYLRAALRLWFAQRRLElLEAELEELRAELARLEAELERL 314
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQ-----------SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATE-----------RQRG 1595
Cdd:COG4913    315 EARLDALREELDELEAqirgnggdrleQLEREIERLERELEERERRRARLEALLAALGLPLPASAeefaalraeaaALLE 394
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1596 GAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELrLQAEEAE 1675
Cdd:COG4913    395 ALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRDALAEALGLDEAELPFVGEL-IEVRPEE 473
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1676 RRLRQAeAERarqvqvALETAQRSaeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAER 1755
Cdd:COG4913    474 ERWRGA-IER------VLGGFALT----LLVPPEHYAAALRWVNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDF 542
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1756 ELERWQLKANEALRLR---LQAEEVAQ----QKSLTQAEAEKQKEEAEREARRRGKAEE-----QAVRQRELAEQELEK- 1822
Cdd:COG4913    543 KPHPFRAWLEAELGRRfdyVCVDSPEElrrhPRAITRAGQVKGNGTRHEKDDRRRIRSRyvlgfDNRAKLAALEAELAEl 622
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1823 QRQLAEGTAQ-QRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA---ELAKVRAEMEVLLASKA 1898
Cdd:COG4913    623 EEELAEAEERlEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERLDAssdDLAALEEQLEELEAELE 702
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1899 RAEEESRSTSEKsKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRlKTEA 1978
Cdd:COG4913    703 ELEEELDELKGE-IGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERID-ALRA 780
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1979 EIALKEKEAENERLRRLAEDEAFQ-------------RRLLEEQAAQHKADIEARLAQLRKasESELERQKGLVEDTLRQ 2045
Cdd:COG4913    781 RLNRAEEELERAMRAFNREWPAETadldadleslpeyLALLDRLEEDGLPEYEERFKELLN--ENSIEFVADLLSKLRRA 858

                   ....*...
gi 1920237946 2046 RRQVEEEI 2053
Cdd:COG4913    859 IREIKERI 866
CH_SF cd00014
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding ...
187-284 3.40e-18

calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding motifs, which may be present as a single copy or in tandem repeats (which increase binding affinity). They either function as autonomous actin binding motifs or serve a regulatory function. CH domains are found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, as well as proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).


Pssm-ID: 409031 [Multi-domain]  Cd Length: 103  Bit Score: 82.77  E-value: 3.40e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  187 KKTFTKWVNKHL-IKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPRE--KGRMRFHKLQNVQIALDYLRHRQV-KLVNI 262
Cdd:cd00014      1 EEELLKWINEVLgEELPVSITDLFESLRDGVLLCKLINKLSPGSIPKInkKPKSPFKKRENINLFLNACKKLGLpELDLF 80
                           90       100
                   ....*....|....*....|...
gi 1920237946  263 RNDDI-ADGNPKLTLGLIWTIIL 284
Cdd:cd00014     81 EPEDLyEKGNLKKVLGTLWALAL 103
SH3_10 pfam17902
SH3 domain; This entry represents an SH3 domain.
919-985 3.49e-18

SH3 domain; This entry represents an SH3 domain.


Pssm-ID: 407754  Cd Length: 65  Bit Score: 81.15  E-value: 3.49e-18
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946  919 QLKPRSpaHPMRGRVPLLAVCDYKQVEVTVHKGDECQMVGPAQPFYWKVLGSSCSEAAMPSVCFLVP 985
Cdd:pfam17902    1 PLKQRR--SPVTRPIPVKALCDYKQGEVTVEKGEECTLLDNSDREKWKVQTSSGVEKLVPSVCFLIP 65
CH_MICAL3 cd21251
calponin homology (CH) domain found in molecule interacting with CasL protein 3; MICAL-3 is a ...
297-402 3.55e-18

calponin homology (CH) domain found in molecule interacting with CasL protein 3; MICAL-3 is a [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-3 seems to act as a Rab effector protein and plays a role in vesicle trafficking. It is involved in exocytic vesicle tethering and fusion. MICAL3 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409100 [Multi-domain]  Cd Length: 111  Bit Score: 83.07  E-value: 3.55e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  297 SEDMTAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTR 376
Cdd:cd21251      1 NESVARSSKLLGWCQRQTEGYAGVNVTDLTMSWKSGLALCAIIHRYRPDLIDFDSLDEQDVEKNNQLAFDIAEKEFGISP 80
                           90       100
                   ....*....|....*....|....*..
gi 1920237946  377 LLDPEDV-DVPQPDEKSIITYVSSLYD 402
Cdd:cd21251     81 IMTGKEMaSVGEPDKLSMVMYLTQFYE 107
CH_SMTN-like cd21200
calponin homology (CH) domain found in the smoothelin family; The smoothelin family includes ...
301-401 4.20e-18

calponin homology (CH) domain found in the smoothelin family; The smoothelin family includes smoothelin and smoothelin-like proteins. Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. SMTNL1, also called calponin homology-associated smooth muscle protein (CHASM), plays a role in the regulation of contractile properties of both striated and smooth muscles. It can bind to calmodulin and tropomyosin. When it is unphosphorylated, SMTNL1 may inhibit myosin dephosphorylation. SMTNL2 is highly expressed in skeletal muscle and could be associated with differentiating myocytes. Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409049  Cd Length: 107  Bit Score: 82.78  E-value: 4.20e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDP 380
Cdd:cd21200      1 SIKQMLLEWCQAKTRGYEHVDITNFSSSWSDGMAFCALIHHFFPDAFDYSSLDPKNRRKNFELAFSTAEELADIAPLLEV 80
                           90       100
                   ....*....|....*....|...
gi 1920237946  381 EDVDV--PQPDEKSIITYVSSLY 401
Cdd:cd21200     81 EDMVRmgNRPDWKCVFTYVQSLY 103
CH smart00033
Calponin homology domain; Actin binding domains present in duplicate at the N-termini of ...
304-400 4.76e-18

Calponin homology domain; Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.


Pssm-ID: 214479 [Multi-domain]  Cd Length: 101  Bit Score: 82.36  E-value: 4.76e-18
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   304 EKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN----LENLDQAFSVAERDLGVTRLLD 379
Cdd:smart00033    1 KTLLRWVNSLLAEYDKPPVTNFSSDLKDGVALCALLNSLSPGLVDKKKVAASLSrfkkIENINLALSFAEKLGGKVVLFE 80
                            90       100
                    ....*....|....*....|.
gi 1920237946   380 PEDVDVPQPDEKSIITYVSSL 400
Cdd:smart00033   81 PEDLVEGPKLILGVIWTLISL 101
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1463-2034 5.38e-18

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 92.41  E-value: 5.38e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRRMEEEERLAEQQR----------AEERERLAEVEAALEKQRQlaeahaqAKAQAEREAQGLQRRMQEEVARREE 1532
Cdd:PRK02224   218 LDEEIERYEEQREQARETRdeadevleehEERREELETLEAEIEDLRE-------TIAETEREREELAEEVRDLRERLEE 290
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1533 vaveaqeqkrsIQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAE 1609
Cdd:PRK02224   291 -----------LEEERDDLLAEAGlddADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAE 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1610 EAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQRalQALEELRLQAEEAERRLRQAEAErarqv 1689
Cdd:PRK02224   360 ELREEAAELESELEEAREAV--EDRREEIEELEEEIEELRERFGDAPVDL--GNAEDFLEELREERDELREREAE----- 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1690 qvaLETAQRSAEAELQSEHASFAE-KTAQLERTLKE-EHVAVVQLREEatrraqqqaeaeraraeaerelerwQLKANEA 1767
Cdd:PRK02224   431 ---LEATLRTARERVEEAEALLEAgKCPECGQPVEGsPHVETIEEDRE-------------------------RVEELEA 482
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1768 LRLRLQAEEVAQQKSLTQAEAEKqkeeaerearrrgKAEEQAVR---QRELAEQELEKQRQLAEGTAQQRLAAEQELIRL 1844
Cdd:PRK02224   483 ELEDLEEEVEEVEERLERAEDLV-------------EAEDRIERleeRREDLEELIAERRETIEEKRERAEELRERAAEL 549
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1845 RAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAeMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRE 1924
Cdd:PRK02224   550 EAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLER-IRTLLAAIADAEDEIERLREKREALAELNDERRER 628
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1925 LAEEAARLRALAEEAKRQRqLAEEDAVRQRAEA--ERVlAEKLAAISEAtRLKTEAEIALKEKEAEN-ERLR-RLAEDEA 2000
Cdd:PRK02224   629 LAEKRERKRELEAEFDEAR-IEEAREDKERAEEylEQV-EEKLDELREE-RDDLQAEIGAVENELEElEELReRREALEN 705
                          570       580       590
                   ....*....|....*....|....*....|....*.
gi 1920237946 2001 FQRRL--LEEQAAQHKADIEARLAQLRKASESELER 2034
Cdd:PRK02224   706 RVEALeaLYDEAEELESMYGDLRAELRQRNVETLER 741
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1523-2428 6.95e-18

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 92.34  E-value: 6.95e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1523 MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERsrlriEEEIRVVRLQLEATERQRGGAEGELQ 1602
Cdd:pfam02463  158 IEEEAAGSRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQA-----KKALEYYQLKEKLELEEEYLLYLDYL 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1603 ALRARAEEAEAQKRQAQEEAERLRRQVQDetqrKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE 1682
Cdd:pfam02463  233 KLNEERIDLLQELLRDEQEEIESSKQEIE----KEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1683 AERARQVQVALETAQRSAEAELQSEHASFAEKtaqleRTLKEEHVAVVQLREEatrraqqqaeaeraraeaerelerwql 1762
Cdd:pfam02463  309 KVDDEEKLKESEKEKKKAEKELKKEKEEIEEL-----EKELKELEIKREAEEE--------------------------- 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1763 kANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELI 1842
Cdd:pfam02463  357 -EEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEE 435
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1843 RLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEA----- 1917
Cdd:pfam02463  436 EESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKvllal 515
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1918 -EAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLA 1996
Cdd:pfam02463  516 iKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIA 595
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1997 EDEAFQRRLLeeqAAQHKADIEARLAQ---------LRKASESELERQKGLVEDTLRQRRQVEEEILALKG---SFEKAA 2064
Cdd:pfam02463  596 VLEIDPILNL---AQLDKATLEADEDDkrakvvegiLKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEvkaSLSELT 672
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2065 AGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKV 2144
Cdd:pfam02463  673 KELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEE 752
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2145 EEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAhafAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARE 2224
Cdd:pfam02463  753 EKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLK---VEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEK 829
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2225 RAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaadAEMEKHKQFAEQAL 2304
Cdd:pfam02463  830 IKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKE---EKEKEEKKELEEES 906
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2305 RQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEvteaarQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRD 2384
Cdd:pfam02463  907 QKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE------EADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAI 980
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....
gi 1920237946 2385 KDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLA 2428
Cdd:pfam02463  981 EEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLE 1024
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1092-2001 7.89e-18

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 92.05  E-value: 7.89e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1092 QQLLQSLEQGEQEESRCQRCISELKDiRLQLEACETRTVHRLRLPLDKEPARECAQRITEQQKAQAEVDGLGKGVARLSA 1171
Cdd:TIGR02169  173 EKALEELEEVEENIERLDLIIDEKRQ-QLERLRREREKAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEE 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1172 EAEKVLALpepspaaptlRSELELTLGKLEQVRSLSAIYLEKL---------KTISLVIRSTQEAEEVLRAHEEQLKEAQ 1242
Cdd:TIGR02169  252 ELEKLTEE----------ISELEKRLEEIEQLLEELNKKIKDLgeeeqlrvkEKIGELEAEIASLERSIAEKERELEDAE 321
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1243 AVPATL-PELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERvtlllerwqavlaqT 1321
Cdd:TIGR02169  322 ERLAKLeAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE--------------L 387
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1322 DVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAvplansqavreqlrqekalleDIERHGEKVEECQRFAKQY 1401
Cdd:TIGR02169  388 KDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNA---------------------AIAGIEAKINELEEEKEDK 446
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1402 INAIKDYELQLVTYKAQLepvaspakkpkvqsgsESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQR 1481
Cdd:TIGR02169  447 ALEIKKQEWKLEQLAADL----------------SKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGR 510
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1482 AEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGlqrRMQEEVARREEVAVEAQEQKRS-------------IQEEL 1548
Cdd:TIGR02169  511 AVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGN---RLNNVVVEDDAVAKEAIELLKRrkagratflplnkMRDER 587
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1549 QHLRQSSEA--------------EIQAKARQV-------EAAERSRlRIEEEIRVVRLQLEATERQ---RGGA-EGELQA 1603
Cdd:TIGR02169  588 RDLSILSEDgvigfavdlvefdpKYEPAFKYVfgdtlvvEDIEAAR-RLMGKYRMVTLEGELFEKSgamTGGSrAPRGGI 666
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1604 LRARAEEAEAQkrQAQEEAERLRRQVQDETQRKRQAEAELalrvqaeAEAAREKQRALQALEELRLQAEEAERRlRQAEA 1683
Cdd:TIGR02169  667 LFSRSEPAELQ--RLRERLEGLKRELSSLQSELRRIENRL-------DELSQELSDASRKIGEIEKEIEQLEQE-EEKLK 736
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1684 ERARQVQVALETAQRSAEAElQSEHASFAEKTAQLERTLKEEHVAVVQLREEatrraqqqaeaeraraeaeRELERWQLK 1763
Cdd:TIGR02169  737 ERLEELEEDLSSLEQEIENV-KSELKELEARIEELEEDLHKLEEALNDLEAR-------------------LSHSRIPEI 796
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1764 ANEalrLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRlAAEQELIR 1843
Cdd:TIGR02169  797 QAE---LSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKE-ELEEELEE 872
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1844 LRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRStsekskqrLEAEAGRFR 1923
Cdd:TIGR02169  873 LEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSE--------IEDPKGEDE 944
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1924 ELAEEAARLRALAEEAKR-QRQLAEEDAVRQRA--EAERVLAEKLAAISEATRLKTEAEiALKEKEAENERLRRLAEDEA 2000
Cdd:TIGR02169  945 EIPEEELSLEDVQAELQRvEEEIRALEPVNMLAiqEYEEVLKRLDELKEKRAKLEEERK-AILERIEEYEKKKREVFMEA 1023

                   .
gi 1920237946 2001 F 2001
Cdd:TIGR02169 1024 F 1024
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1470-2341 8.03e-18

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 91.96  E-value: 8.03e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1470 MEEEERLAEQQRAEERERLAEVEaaLEKQRQLAEAHAQAKAQAEREAQGLqrRMQEEVARREEVAVEAQEQKRSIQEELQ 1549
Cdd:pfam02463  179 IEETENLAELIIDLEELKLQELK--LKEQAKKALEYYQLKEKLELEEEYL--LYLDYLKLNEERIDLLQELLRDEQEEIE 254
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1550 HLRQSSEAEIQakarqvEAAERSRLRIEEE--IRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRR 1627
Cdd:pfam02463  255 SSKQEIEKEEE------KLAQVLKENKEEEkeKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEK 328
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1628 QVQDETQRKRQAEAELAL--RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERA--RQVQVALETAQRSAEAE 1703
Cdd:pfam02463  329 ELKKEKEEIEELEKELKEleIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAakLKEEELELKSEEEKEAQ 408
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1704 LQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSL 1783
Cdd:pfam02463  409 LLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLEL 488
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1784 TQAEAEKQKEEAEREARRRGKAEEQAVRQRE---LAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEE 1860
Cdd:pfam02463  489 LLSRQKLEERSQKESKARSGLKVLLALIKDGvggRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVR 568
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1861 ELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEaeagrfRELAEEAARLRALAEEAK 1940
Cdd:pfam02463  569 ALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKV------VEGILKDTELTKLKESAK 642
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1941 RQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEAR 2020
Cdd:pfam02463  643 AKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEEL 722
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2021 LAQLRKaseSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQ 2100
Cdd:pfam02463  723 LADRVQ---EAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQ 799
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2101 laaeeerrrreaeervQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAH 2180
Cdd:pfam02463  800 ----------------EEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEI 863
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2181 AFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAA 2260
Cdd:pfam02463  864 TKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE 943
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2261 AEKLRKEAEQEAARRAQAEQAALRQKQAadAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA 2340
Cdd:pfam02463  944 EADEKEKEENNKEEEEERNKRLLLAKEE--LGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKE 1021

                   .
gi 1920237946 2341 E 2341
Cdd:pfam02463 1022 F 1022
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1460-2094 1.64e-17

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 90.90  E-value: 1.64e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1460 IRFISETLRRMEEE--------ERLAEQQRAEERERLAEVEAALEK----QRQLAEAHAQaKAQAEREAQGLQRRMQEEV 1527
Cdd:TIGR02169  193 IDEKRQQLERLRRErekaeryqALLKEKREYEGYELLKEKEALERQkeaiERQLASLEEE-LEKLTEEISELEKRLEEIE 271
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1528 ARREEVAVE----AQEQKRSIQEELQHL---RQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRG----- 1595
Cdd:TIGR02169  272 QLLEELNKKikdlGEEEQLRVKEKIGELeaeIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEeerkr 351
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1596 ---------GAEGELQALRARAEEAEAQKR--------------QAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAE 1652
Cdd:TIGR02169  352 rdklteeyaELKEELEDLRAELEEVDKEFAetrdelkdyrekleKLKREINELKRELDRLQEELQRLSEELA-DLNAAIA 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1653 AAREKQRALQA-LEELRLQAEEAERRLRQAEAER--ARQVQVALETAQRSAEAELQS--EHASFAEKTAQLERTLKEEHV 1727
Cdd:TIGR02169  431 GIEAKINELEEeKEDKALEIKKQEWKLEQLAADLskYEQELYDLKEEYDRVEKELSKlqRELAEAEAQARASEERVRGGR 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1728 AVVQLREEatrraqQQAEAERARAEAERELERWQLKANEALRLRLQA-----EEVAQQK---------------SLTQAE 1787
Cdd:TIGR02169  511 AVEEVLKA------SIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNvvvedDAVAKEAiellkrrkagratflPLNKMR 584
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1788 AEKQKEEAEREARRRGKA---------EEQAVR---QRELAEQELEKQRQL--------------------------AEG 1829
Cdd:TIGR02169  585 DERRDLSILSEDGVIGFAvdlvefdpkYEPAFKyvfGDTLVVEDIEAARRLmgkyrmvtlegelfeksgamtggsraPRG 664
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1830 TAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLL----ASKARAEEESR 1905
Cdd:TIGR02169  665 GILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEqeeeKLKERLEELEE 744
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1906 STSEKSKQRLEAEAgrfrELAEEAARLRALaEEAKRQRQLAEED--------AVRQRAEAERVLAEKLAAISEATRlktE 1977
Cdd:TIGR02169  745 DLSSLEQEIENVKS----ELKELEARIEEL-EEDLHKLEEALNDlearlshsRIPEIQAELSKLEEEVSRIEARLR---E 816
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1978 AEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKAsESELERQKGLVEDTLRQRRQVEEEILALK 2057
Cdd:TIGR02169  817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEEL-EEELEELEAALRDLESRLGDLKKERDELE 895
                          730       740       750
                   ....*....|....*....|....*....|....*..
gi 1920237946 2058 GSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQE 2094
Cdd:TIGR02169  896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEE 932
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1250-2176 4.52e-17

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 89.65  E-value: 4.52e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1250 ELEATKAALKKLRAQAEAQQPVFDALRDELRGaqevgerlqqrhgerdveverwrervtlllerwqavlaqtdvRQRELE 1329
Cdd:pfam02463  167 LKRKKKEALKKLIEETENLAELIIDLEELKLQ------------------------------------------ELKLKE 204
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1330 QLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALL-EDIERHGEKVEECQRFAKQYINAIKDY 1408
Cdd:pfam02463  205 QAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSkQEIEKEEEKLAQVLKENKEEEKEKKLQ 284
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1409 ELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRtryselstltsqyIRFISETLRRMEEEERLAEQQRAEERERL 1488
Cdd:pfam02463  285 EEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKE-------------KKKAEKELKKEKEEIEELEKELKELEIKR 351
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1489 AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQsseaeiQAKARQVEA 1568
Cdd:pfam02463  352 EAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQ------LEDLLKEEK 425
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1569 AERSRLRIEEEIRVVRLQLEATERQrggaegelqaLRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ 1648
Cdd:pfam02463  426 KEELEILEEEEESIELKQGKLTEEK----------EELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKL 495
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1649 AEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ-LERTLKEEHV 1727
Cdd:pfam02463  496 EERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQkLVRALTELPL 575
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1728 AVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGK--A 1805
Cdd:pfam02463  576 GARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVslE 655
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1806 EEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK 1885
Cdd:pfam02463  656 EGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKIN 735
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1886 VRAEMEVLLASKARAEEESRSTSEKSKQRLEAEagrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKL 1965
Cdd:pfam02463  736 EELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSE----LSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELK 811
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1966 AAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDT-LR 2044
Cdd:pfam02463  812 EEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELeSK 891
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2045 QRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAArQRQLAAEEERRRREAEERVQKSLAAEE 2124
Cdd:pfam02463  892 EEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLE-EADEKEKEENNKEEEEERNKRLLLAKE 970
                          890       900       910       920       930
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2125 EAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAE 2176
Cdd:pfam02463  971 ELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQRLKEF 1022
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1268-2178 6.75e-17

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 89.08  E-value: 6.75e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1268 QQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQA----------VLAQTDVRQRELEQLGRQLRY 1337
Cdd:pfam01576    3 QEEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQLQAetelcaeaeeMRARLAARKQELEEILHELES 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1338 YRESADPLGAWLRDAKQR-QEQIQAVP--LANSQAVREQLRQEKALLE-DIERHGEKVEECQRFAKQYINAIKDYELQLV 1413
Cdd:pfam01576   83 RLEEEEERSQQLQNEKKKmQQHIQDLEeqLDEEEAARQKLQLEKVTTEaKIKKLEEDILLLEDQNSKLSKERKLLEERIS 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1414 TYKAQLEPVASPAKK-PKVQSGSESIIQEyVDLRTRYSElstltsqyirfisETLRRMEEEERLAEQQRAEERERLAEVE 1492
Cdd:pfam01576  163 EFTSNLAEEEEKAKSlSKLKNKHEAMISD-LEERLKKEE-------------KGRQELEKAKRKLEGESTDLQEQIAELQ 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1493 AalekqrQLAEAHAQAkAQAEREAQGLQRRMQEEVARREevavEAQEQKRSIQEELQHLRQSSEAEIQAKARqveaAERS 1572
Cdd:pfam01576  229 A------QIAELRAQL-AKKEEELQAALARLEEETAQKN----NALKKIRELEAQISELQEDLESERAARNK----AEKQ 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1573 RLRIEEEIRVVRLQLEATErqrgGAEGELQALRARAE-EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELalrvqaeA 1651
Cdd:pfam01576  294 RRDLGEELEALKTELEDTL----DTTAAQQELRSKREqEVTELKKALEEETRSHEAQLQEMRQKHTQALEEL-------T 362
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1652 EAAREKQRALQALEELRlQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEktAQLERTLKEEHVAVVQ 1731
Cdd:pfam01576  363 EQLEQAKRNKANLEKAK-QALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSE--SERQRAELAEKLSKLQ 439
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1732 LREEATRRAQQQAEAERARAEAERELERWQLKANEALRlrlqAEEVAQQKSLTQAEAEKqkeeaerearrrgkaEEQAVR 1811
Cdd:pfam01576  440 SELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQELL----QEETRQKLNLSTRLRQL---------------EDERNS 500
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1812 QRELAEQELEKQRQLaegtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK------ 1885
Cdd:pfam01576  501 LQEQLEEEEEAKRNV----ERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKlektkn 576
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1886 -VRAEMEVLLAS-------------------KARAEEESRSTS-EKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQ 1944
Cdd:pfam01576  577 rLQQELDDLLVDldhqrqlvsnlekkqkkfdQMLAEEKAISARyAEERDRAEAEA---REKETRALSLARALEEALEAKE 653
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1945 LAEEDAVRQRAEAERVLAEKLAA---ISEATRLKTEAEIALKEKEAENERLR-RLAEDEAFQRRL---LEEQAAQHKADI 2017
Cdd:pfam01576  654 ELERTNKQLRAEMEDLVSSKDDVgknVHELERSKRALEQQVEEMKTQLEELEdELQATEDAKLRLevnMQALKAQFERDL 733
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2018 EARlaqlrkaSESELERQKGLVedtlRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAAR 2097
Cdd:pfam01576  734 QAR-------DEQGEEKRRQLV----KQVRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKK 802
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2098 QRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR-QLQLAQEAAQKRLQAE 2176
Cdd:pfam01576  803 LQAQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQLQEDLAASERARRQAQQERDElADEIASGASGKSALQD 882

                   ..
gi 1920237946 2177 EK 2178
Cdd:pfam01576  883 EK 884
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1556-2482 9.99e-17

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 88.20  E-value: 9.99e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1556 EAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEgELQALRARAEEAEA-----QKRQAQEEAERLRRQVQ 1630
Cdd:TIGR02169  169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAE-RYQALLKEKREYEGyellkEKEALERQKEAIERQLA 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1631 DETQRKRQAEAELALRVQAEAEAAR------EKQRALQALEELRLQAEEAE-----RRLRQAEAERARQVQVALETaqrs 1699
Cdd:TIGR02169  248 SLEEELEKLTEEISELEKRLEEIEQlleelnKKIKDLGEEEQLRVKEKIGEleaeiASLERSIAEKERELEDAEER---- 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1700 aEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQaeaeraraeaerelerwqlkanEALRLRLQAEEVAQ 1779
Cdd:TIGR02169  324 -LAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL----------------------EDLRAELEEVDKEF 380
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1780 QKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQR-ELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1858
Cdd:TIGR02169  381 AETRDELKDYREKLEKLKREINELKRELDRLQEElQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQL 460
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1859 EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE---SRSTSEKSKQRLEAEAGRFRELAEeaarlral 1935
Cdd:TIGR02169  461 AADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERvrgGRAVEEVLKASIQGVHGTVAQLGS-------- 532
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1936 aeeAKRQRQLAEEDAVRQRAEAERVLAEKLAAiseatrlktEAEIALKEKEAENER---LRRLAEDEAFQRRLLEEQAAQ 2012
Cdd:TIGR02169  533 ---VGERYATAIEVAAGNRLNNVVVEDDAVAK---------EAIELLKRRKAGRATflpLNKMRDERRDLSILSEDGVIG 600
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2013 HKADIEARLAQLRKASESELeRQKGLVEDTLRQRRQVEE-EILALKGS-FEKAAA--GKAElelelgRIRGTAEDTLRSK 2088
Cdd:TIGR02169  601 FAVDLVEFDPKYEPAFKYVF-GDTLVVEDIEAARRLMGKyRMVTLEGElFEKSGAmtGGSR------APRGGILFSRSEP 673
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2089 EQAEQEAARqrqlaaeeerrrreaeervqkslaaEEEAARQRKAALEEVERLKAKVEEARRLRERAEQ-----ESARQLQ 2163
Cdd:TIGR02169  674 AELQRLRER-------------------------LEGLKRELSSLQSELRRIENRLDELSQELSDASRkigeiEKEIEQL 728
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2164 LAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRseaeaarraAEEAEAARERAEREAAQSRRQVEEAERL 2243
Cdd:TIGR02169  729 EQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELE---------EDLHKLEEALNDLEARLSHSRIPEIQAE 799
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2244 KQSAEeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEE 2323
Cdd:TIGR02169  800 LSKLE--------------EEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE 865
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2324 tdhqksiLDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRlLQEEAEKMKQVA 2403
Cdd:TIGR02169  866 -------LEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAK-LEALEEELSEIE 937
                          890       900       910       920       930       940       950
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2404 EEAARLSVAAQEAARLRQLAEEDLAQQRALaEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQ 2482
Cdd:TIGR02169  938 DPKGEDEEIPEEELSLEDVQAELQRVEEEI-RALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015
CH_EHBP1 cd21254
calponin homology (CH) domain found in EH domain-binding protein 1 and similar proteins; EHBP1 ...
301-400 1.24e-16

calponin homology (CH) domain found in EH domain-binding protein 1 and similar proteins; EHBP1 is a regulator of endocytic recycling and may play a role in actin reorganization by linking clathrin-mediated endocytosis to the actin cytoskeleton. It may act as an effector of small GTPases, including RAB-10 (Rab10), and play a role in vesicle trafficking. EHBP1 is associated with aggressive prostate cancer and insulin-stimulated trafficking and cell migration. Members of this subfamily contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409103  Cd Length: 107  Bit Score: 78.36  E-value: 1.24e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDP 380
Cdd:cd21254      1 NASQSLLAWCKEVTKGYRGVKITNFTTSWRNGLAFCAILHHFRPDLIDYKSLNPHDIKENNKKAYDGFAS-LGISRLLEP 79
                           90       100
                   ....*....|....*....|.
gi 1920237946  381 ED-VDVPQPDEKSIITYVSSL 400
Cdd:cd21254     80 SDmVLLAVPDKLTVMTYLYQI 100
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1047-1735 2.09e-16

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 87.42  E-value: 2.09e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1047 EQRQALRSLELHYQA----FLRDSQDAGgfgPEDRLQAEREYGSCSRHYQQLLQSLEQGEQE----ESRCQRCISELKDI 1118
Cdd:TIGR02168  217 ELKAELRELELALLVlrleELREELEEL---QEELKEAEEELEELTAELQELEEKLEELRLEvselEEEIEELQKELYAL 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1119 RLQLEACETRTVH---RLRlPLDKEPARECAQRITEQQK---AQAEVDGLGKGVARLSAEAEkvlALPEPSPAAPTLRSE 1192
Cdd:TIGR02168  294 ANEISRLEQQKQIlreRLA-NLERQLEELEAQLEELESKldeLAEELAELEEKLEELKEELE---SLEAELEELEAELEE 369
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1193 LELTLGKL-EQVRSLSAIYLEKLKTISLvIRSTQEaeeVLRAHEEQLKEAQAVPATlpelEATKAALKKLRAQAEAQQPV 1271
Cdd:TIGR02168  370 LESRLEELeEQLETLRSKVAQLELQIAS-LNNEIE---RLEARLERLEDRRERLQQ----EIEELLKKLEEAELKELQAE 441
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1272 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLL---LERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAW 1348
Cdd:TIGR02168  442 LEELEEELEELQEELERLEEALEELREELEEAEQALDAAereLAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGI 521
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1349 LRDAKQR---------------QEQIQAVPLANSQAVR---EQLRQEK----ALLEDIERHGEKVEECQRFAKQYINAIK 1406
Cdd:TIGR02168  522 LGVLSELisvdegyeaaieaalGGRLQAVVVENLNAAKkaiAFLKQNElgrvTFLPLDSIKGTEIQGNDREILKNIEGFL 601
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1407 DYELQLVTYKAQLEPVASP---------------AKKPKVQSGSESIIQEYVDLRTRYS--------ELSTL-TSQYIRF 1462
Cdd:TIGR02168  602 GVAKDLVKFDPKLRKALSYllggvlvvddldnalELAKKLRPGYRIVTLDGDLVRPGGVitggsaktNSSILeRRREIEE 681
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKR 1542
Cdd:TIGR02168  682 LEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEA 761
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1543 SIQEELQHLRQSSEAEIQAKARQVEAaersrlriEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEA 1622
Cdd:TIGR02168  762 EIEELEERLEEAEEELAEAEAEIEEL--------EAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRI 833
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1623 ERLRRQVQDETQRKRQAEAELAlrvqaeaeaarekqRALQALEELRLQAEEAERRLRQAEAERArQVQVALETAqRSAEA 1702
Cdd:TIGR02168  834 AATERRLEDLEEQIEELSEDIE--------------SLAAEIEELEELIEELESELEALLNERA-SLEEALALL-RSELE 897
                          730       740       750
                   ....*....|....*....|....*....|....*
gi 1920237946 1703 ELQSEHASFAEKTAQLERTLKE--EHVAVVQLREE 1735
Cdd:TIGR02168  898 ELSEELRELESKRSELRRELEElrEKLAQLELRLE 932
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1483-2045 4.62e-16

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 86.12  E-value: 4.62e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1483 EERERLAEVEAALEKQRQLAEAHAQAKaQAEReaqglQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRqsseaeIQAK 1562
Cdd:COG4913    219 EEPDTFEAADALVEHFDDLERAHEALE-DARE-----QIELLEPIRELAERYAAARERLAELEYLRAALR------LWFA 286
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1563 ARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAE-AQKRQAQEEAERLRRQvQDETQRKRQAEA 1641
Cdd:COG4913    287 QRRLELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGNGgDRLEQLEREIERLERE-LEERERRRARLE 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1642 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQS-EH------ASFAEK 1714
Cdd:COG4913    366 ALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASlERrksnipARLLAL 445
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1715 TAQLERTLKEEHVAV------VQLREEAtrraqqqaeaeraraeaerelERWQLKANEAL---RLRL--------QAEEV 1777
Cdd:COG4913    446 RDALAEALGLDEAELpfvgelIEVRPEE---------------------ERWRGAIERVLggfALTLlvppehyaAALRW 504
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1778 AQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEK--QRQLAEGTAQQRLAAEQELIRL-RAETEQG--- 1851
Cdd:COG4913    505 VNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAwlEAELGRRFDYVCVDSPEELRRHpRAITRAGqvk 584
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1852 ------------------------EQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRST 1907
Cdd:COG4913    585 gngtrhekddrrrirsryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVA 664
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1908 S-EKSKQRLEAEagrFRELAEEAARLRALAEEAKRQRQlAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKE 1986
Cdd:COG4913    665 SaEREIAELEAE---LERLDASSDDLAALEEQLEELEA-ELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAE 740
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1987 AENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRkaseSELERQKGLVEDTLRQ 2045
Cdd:COG4913    741 DLARLELRALLEERFAAALGDAVERELRENLEERIDALR----ARLNRAEEELERAMRA 795
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1440-2096 6.15e-16

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 85.66  E-value: 6.15e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1440 QEYVDLRTRYSELSTLTSQYIRFISETLRRMEE-EERLAE--QQRAEERERLAEVEAALEKQRQLAEAhAQAKAQAEREA 1516
Cdd:pfam12128  248 QEFNTLESAELRLSHLHFGYKSDETLIASRQEErQETSAElnQLLRTLDDQWKEKRDELNGELSAADA-AVAKDRSELEA 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1517 QGLQRRMQEEVarREEVAVEAQEQKRSIQEELQHLRQSSEAeIQAKARQVEAA-ERSRLRIEEEIR--VVRL------QL 1587
Cdd:pfam12128  327 LEDQHGAFLDA--DIETAAADQEQLPSWQSELENLEERLKA-LTGKHQDVTAKyNRRRSKIKEQNNrdIAGIkdklakIR 403
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1588 EATERQRGGAEGELQALRAR-AEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEA-----EAAREKQ--- 1658
Cdd:pfam12128  404 EARDRQLAVAEDDLQALESElREQLEAGKLEFNEEEYRLKSRLGELKLRLNQATATPELLLQLENfderiERAREEQeaa 483
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1659 -----RALQALEELRLQAEEAERRLRQAEAeRARQVQVALETAQR-------------SAEAELQSEHASFAEKTAQLER 1720
Cdd:pfam12128  484 naeveRLQSELRQARKRRDQASEALRQASR-RLEERQSALDELELqlfpqagtllhflRKEAPDWEQSIGKVISPELLHR 562
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1721 TLKEEHVAVVQLREEATRRaqqqaeaeraraeaerelerwqlkaneALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREAR 1800
Cdd:pfam12128  563 TDLDPEVWDGSVGGELNLY---------------------------GVKLDLKRIDVPEWAASEEELRERLDKAEEALQS 615
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1801 RRGKAEEQAvRQRELAEQELEKQrQLAEGTAQQRLA-AEQELIRLraeTEQGEQQRQLLEEELARLQREaaaATQKRREL 1879
Cdd:pfam12128  616 AREKQAAAE-EQLVQANGELEKA-SREETFARTALKnARLDLRRL---FDEKQSEKDKKNKALAERKDS---ANERLNSL 687
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1880 EAELAKVraEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQraeaer 1959
Cdd:pfam12128  688 EAQLKQL--DKKHQAWLEEQKEQKREARTEKQAYWQVVEGALDAQLALLKAAIAARRSGAKAELKALETWYKRD------ 759
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1960 vLAEKlaAISEATRLKTEAEIalKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASeSELERQKG-L 2038
Cdd:pfam12128  760 -LASL--GVDPDVIAKLKREI--RTLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLATQLSNIERAI-SELQQQLArL 833
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2039 VEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTlrSKEQAEQEAA 2096
Cdd:pfam12128  834 IADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMSKLATLKEDA--NSEQAQGSIG 889
CH_MICAL2 cd21250
calponin homology (CH) domain found in molecule interacting with CasL protein 2; MICAL-2 is a ...
305-402 9.75e-16

calponin homology (CH) domain found in molecule interacting with CasL protein 2; MICAL-2 is a nuclear [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-2 acts as a key regulator of the serum response factor (SRF) signaling pathway elicited by nerve growth factor and serum. It mediates oxidation and subsequent depolymerization of nuclear actin, leading to the increased MKL1/MRTF-A presence in the nucleus, promoting SRF:MKL1/MRTF-A-dependent gene transcription. MICAL-2 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409099 [Multi-domain]  Cd Length: 110  Bit Score: 76.07  E-value: 9.75e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  305 KLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLD-PEDV 383
Cdd:cd21250      8 KLLTWCQKQTEGYQNVNVTDLTTSWKSGLALCAIIHRFRPELIDFDSLNEDDAVKNNQLAFDVAEREFGIPPVTTgKEMA 87
                           90
                   ....*....|....*....
gi 1920237946  384 DVPQPDEKSIITYVSSLYD 402
Cdd:cd21250     88 SAEEPDKLSMVMYLSKFYE 106
PTZ00121 PTZ00121
MAEBL; Provisional
2119-2741 1.45e-15

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 84.81  E-value: 1.45e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2119 SLAAEEEAARQRKAALEEVERLKAKVEEARRlRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQE 2198
Cdd:PTZ00121  1075 SYKDFDFDAKEDNRADEATEEAFGKAEEAKK-TETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKR 1153
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2199 QSVLERLrseaeaarraaeeaeaareraereaaQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2278
Cdd:PTZ00121  1154 VEIARKA--------------------------EDARKAEEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARK 1207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2279 EQAALRQKQAADAEMEKHKQFAEQA--LRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVE--E 2354
Cdd:PTZ00121  1208 AEEERKAEEARKAEDAKKAEAVKKAeeAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKkaE 1287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2355 ElfslRVQMEELGKLKARIEAENRALVLRDKDSAQRLlQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALA 2434
Cdd:PTZ00121  1288 E----KKKADEAKKAEEKKKADEAKKKAEEAKKADEA-KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAA 1362
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2435 EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQmAQQLaqetqgfqKTLETERQRQLEMSAEAERLRlr 2514
Cdd:PTZ00121  1363 EEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK-ADEL--------KKAAAAKKKADEAKKKAEEKK-- 1431
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2515 VAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKvmlvqtletQRQQSDRDAERLREAiAELEHEKDKLKQEAQL 2594
Cdd:PTZ00121  1432 KADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK---------KADEAKKKAEEAKKA-DEAKKKAEEAKKKADE 1501
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2595 LQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALReeqqrqqqqmqqekqql 2674
Cdd:PTZ00121  1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKK----------------- 1564
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2675 aaSMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAA--LARSEEI 2741
Cdd:PTZ00121  1565 --KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAeeLKKAEEE 1631
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1874-2741 1.91e-15

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 84.35  E-value: 1.91e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1874 QKRRELEAELAKVrAEMEVLLAsKARAEEEsrsTSEKSKQRLEAeagRFRELAEEAARLRALAEEAKRQRQLAEEdavRQ 1953
Cdd:TIGR02169  153 VERRKIIDEIAGV-AEFDRKKE-KALEELE---EVEENIERLDL---IIDEKRQQLERLRREREKAERYQALLKE---KR 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1954 RAEAERVLAEKLAAisEATRLKTEAEIAlkEKEAENERLRRLAEDeafqrrlLEEQAAQHKADIEARLAQLRKASESELE 2033
Cdd:TIGR02169  222 EYEGYELLKEKEAL--ERQKEAIERQLA--SLEEELEKLTEEISE-------LEKRLEEIEQLLEELNKKIKDLGEEEQL 290
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2034 RQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDtlrSKEQAEQEAARQRQLAAEEERRRREAE 2113
Cdd:TIGR02169  291 RVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEE---LEREIEEERKRRDKLTEEYAELKEELE 367
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2114 ERVQKSLAAEEEAARQR---KAALEEVERLKAKVEE----ARRLRERAEQESARQLQLAQ-----EAAQKRLQAEEKAHA 2181
Cdd:TIGR02169  368 DLRAELEEVDKEFAETRdelKDYREKLEKLKREINElkreLDRLQEELQRLSEELADLNAaiagiEAKINELEEEKEDKA 447
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2182 FAVQQKEQELQQTLQQEQSVLERLRseaeaarraaeeaeaareraerEAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAA 2261
Cdd:TIGR02169  448 LEIKKQEWKLEQLAADLSKYEQELY----------------------DLKEEYDRVEKELSKLQRELAEAEAQARASEER 505
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2262 EKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTAlrlqleetdhQKSIldEELQRLKA- 2340
Cdd:TIGR02169  506 VRGGRAVEEVLKASIQGVHGTVAQLGSVGERYATAIEVAAGNRLNNVVVEDDAVA----------KEAI--ELLKRRKAg 573
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2341 EVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENR-----------ALVLRDKDSAQRLLQ---------EEAEKMK 2400
Cdd:TIGR02169  574 RATFLPLNKMRDERRDLSILSEDGVIGFAVDLVEFDPKyepafkyvfgdTLVVEDIEAARRLMGkyrmvtlegELFEKSG 653
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2401 QVA----EEAARLSVAAQEAARLRQLAEE---------DLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2467
Cdd:TIGR02169  654 AMTggsrAPRGGILFSRSEPAELQRLRERleglkrelsSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEE 733
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2468 QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARaeEDARRFRKQAEDIGERLYRTE 2547
Cdd:TIGR02169  734 KLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIE 811
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2548 LATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQalqqsfLSEKDS 2627
Cdd:TIGR02169  812 ARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRD------LESRLG 885
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2628 LLQRERC-IEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEG---------VRRQQE 2697
Cdd:TIGR02169  886 DLKKERDeLEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEElsledvqaeLQRVEE 965
                          890       900       910       920
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 2698 ELQRLAQQQQQQEKLLAEENQR---LRERLQHLEEERRAALARSEEI 2741
Cdd:TIGR02169  966 EIRALEPVNMLAIQEYEEVLKRldeLKEKRAKLEEERKAILERIEEY 1012
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
2931-2969 2.27e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 72.36  E-value: 2.27e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 2931 LLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEMNRVL 2969
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
989-1719 2.49e-15

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 83.95  E-value: 2.49e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  989 QEAQEAIARLEAQHQALVALWHQLHTEMKSLLAWQSLGRDMQLIRSWSLATFRTLKpEEQRQALRSLELHYQAFLRDSQD 1068
Cdd:TIGR02168  291 YALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL-EELKEELESLEAELEELEAELEE 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1069 AggfgpEDRLQA-EREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQ 1147
Cdd:TIGR02168  370 L-----ESRLEElEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEE 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1148 RITEQQKAQAEVDGLGKGVARLSAE-AEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLS---AIYLEKLKTISLVIRS 1223
Cdd:TIGR02168  445 LEEELEELQEELERLEEALEELREElEEAEQALDAAERELAQLQARLDSLERLQENLEGFSegvKALLKNQSGLSGILGV 524
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1224 TQEAEEVlrahEEQLKeaQAVPATLPE---------LEATKAALKKLrAQAEAQQPVFDALrDELRGAQEVGERLQQRHG 1294
Cdd:TIGR02168  525 LSELISV----DEGYE--AAIEAALGGrlqavvvenLNAAKKAIAFL-KQNELGRVTFLPL-DSIKGTEIQGNDREILKN 596
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1295 ERDV-----EVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYR------ESADPLGAWLRDAKQRqeqiQAVP 1363
Cdd:TIGR02168  597 IEGFlgvakDLVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYRivtldgDLVRPGGVITGGSAKT----NSSI 672
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1364 LANSQAVREqLRQEKALLEDIERHGEKveecqrfakqyinAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYV 1443
Cdd:TIGR02168  673 LERRREIEE-LEEKIEELEEKIAELEK-------------ALAELRKELEELEEELE---------QLRKELEELSRQIS 729
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1444 DLRTRYSELST---LTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ 1520
Cdd:TIGR02168  730 ALRKDLARLEAeveQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELR 809
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1521 ---RRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQR 1594
Cdd:TIGR02168  810 aelTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEelsEDIESLAAEIEELEELIEELESELEALLNERASLEEAL 889
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1595 GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA-LRVQAEAEAAREKQRALQALEELRLQAEE 1673
Cdd:TIGR02168  890 ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDnLQERLSEEYSLTLEEAEALENKIEDDEEE 969
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946 1674 AERRLRQAEAERARQVQVALEtaqrsAEAELQSEHASFAEKTAQLE 1719
Cdd:TIGR02168  970 ARRRLKRLENKIKELGPVNLA-----AIEEYEELKERYDFLTAQKE 1010
CH_CTX_rpt1 cd21225
first calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling ...
183-282 2.53e-15

first calponin homology (CH) domain found in cortexillin; Cortexillins are actin-bundling proteins that play a critical role in regulating cell morphology and actin cytoskeleton reorganization. They play a major role in cytokinesis and contain two copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409074  Cd Length: 111  Bit Score: 74.87  E-value: 2.53e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  183 DRVQKKTFTKWVNKHLIKAQ-RHISDLYEDLRDGHNLISLLEVLSGDSLPRE---KGRMRFHKLQNVQIALDYLRHR-QV 257
Cdd:cd21225      2 EKVQIKAFTAWVNSVLEKRGiPKISDLATDLSDGVRLIFFLELVSGKKFPKKfdlEPKNRIQMIQNLHLAMLFIEEDlKI 81
                           90       100
                   ....*....|....*....|....*
gi 1920237946  258 KLVNIRNDDIADGNPKLTLGLIWTI 282
Cdd:cd21225     82 RVQGIGAEDFVDNNKKLILGLLWTL 106
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1225-1725 3.29e-15

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 83.17  E-value: 3.29e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1225 QEAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRaqaeaqqpvfDALRDELRGAQEVGERLQQRHGERDVEVERWR 1304
Cdd:PRK02224   237 DEADEVLEEHEERREELETLEAEIEDLRETIAETERER----------EELAEEVRDLRERLEELEEERDDLLAEAGLDD 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1305 ERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQ---AVPLANSQAVREQLRQEKALL 1381
Cdd:PRK02224   307 ADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELReeaAELESELEEAREAVEDRREEI 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1382 EDIERHGEKVEEcqRFAKqyinaikdyelqlvtykaqlepvaSPAKKPKVQSGSESIIQEYVDLRTRYSELSTLtsqyir 1461
Cdd:PRK02224   387 EELEEEIEELRE--RFGD------------------------APVDLGNAEDFLEELREERDELREREAELEAT------ 434
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1462 fISETLRRMEEEERLAEQQR-----------------AEERERLAEVEAALE----KQRQLAEAHAQAK--AQAEREAQG 1518
Cdd:PRK02224   435 -LRTARERVEEAEALLEAGKcpecgqpvegsphvetiEEDRERVEELEAELEdleeEVEEVEERLERAEdlVEAEDRIER 513
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1519 LQRR---MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvRLQLEATERQRG 1595
Cdd:PRK02224   514 LEERredLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNS--KLAELKERIESL 591
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1596 GAEGELQALRARAE---EAEAQKRQAQEEAERLRR-QVQDETQRKRQAEAELAlrvQAEAEAARE-KQRALQALeelrlq 1670
Cdd:PRK02224   592 ERIRTLLAAIADAEdeiERLREKREALAELNDERReRLAEKRERKRELEAEFD---EARIEEAREdKERAEEYL------ 662
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 1671 aEEAERRLRQAEAERARqVQVALETAQRSAE--AELQSEHASFAEKTAQLErTLKEE 1725
Cdd:PRK02224   663 -EQVEEKLDELREERDD-LQAEIGAVENELEelEELRERREALENRVEALE-ALYDE 716
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1185-1938 4.30e-15

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 83.09  E-value: 4.30e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1185 AAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLkeaqavPATLPELEATKAALKKLRAQ 1264
Cdd:TIGR00618  160 AKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCM------PDTYHERKQVLEKELKHLRE 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1265 AEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEverwrervtllLERWQAVLAQTDVRQRELEQLGRQLRYYRESAdp 1344
Cdd:TIGR00618  234 ALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRAR-----------IEELRAQEAVLEETQERINRARKAAPLAAHIK-- 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1345 lgAWLRDAKQRQEQIQAvpLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVAS 1424
Cdd:TIGR00618  301 --AVTQIEQQAQRIHTE--LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHT 376
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1425 PAKKPKVQSGSESIIQEyvDLRTRYSELSTLTSQYIRFISETLRRMEEEERL--AEQQRAEERERLAEVEAALEKQRQ-- 1500
Cdd:TIGR00618  377 LTQHIHTLQQQKTTLTQ--KLQSLCKELDILQREQATIDTRTSAFRDLQGQLahAKKQQELQQRYAELCAAAITCTAQce 454
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1501 -LAEAHAQAKAQAERE-------AQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEA--EIQAKARQVEAAE 1570
Cdd:TIGR00618  455 kLEKIHLQESAQSLKEreqqlqtKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDidNPGPLTRRMQRGE 534
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1571 RSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRaRAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE 1650
Cdd:TIGR00618  535 QTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQ-QSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLACE 613
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1651 AEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVV 1730
Cdd:TIGR00618  614 QHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQLT 693
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1731 QLREEATRRAQQQAEAERARAEAERELERWQLkANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREArrrgkaEEQAV 1810
Cdd:TIGR00618  694 YWKEMLAQCQTLLRELETHIEEYDREFNEIEN-ASSSLGSDLAAREDALNQSLKELMHQARTVLKARTE------AHFNN 766
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1811 RQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQR-----QLLEEELARLQREAAAATQKRRELEAELAK 1885
Cdd:TIGR00618  767 NEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEipsdeDILNLQCETLVQEEEQFLSRLEEKSATLGE 846
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 1886 VRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEE 1938
Cdd:TIGR00618  847 ITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGINQIKIQFDGDALIKFLHE 899
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1370-2034 4.70e-15

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 82.66  E-value: 4.70e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1370 VREQLRQEKALLEDIERhgekveecqrfAKQYINAIKDYELQLVTYKAQ---LEPVaspakkpkvqsgsESIIQEYVDLR 1446
Cdd:COG4913    213 VREYMLEEPDTFEAADA-----------LVEHFDDLERAHEALEDAREQielLEPI-------------RELAERYAAAR 268
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1447 TRYSELSTLTSQY-IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQ----LAEAHAQAKAQAEREAQGLQR 1521
Cdd:COG4913    269 ERLAELEYLRAALrLWFAQRRLELLEAELEELRAELARLEAELERLEARLDALREeldeLEAQIRGNGGDRLEQLEREIE 348
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1522 RMQEEVARREEVAVEAQEQKRSI--------------QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL 1587
Cdd:COG4913    349 RLERELEERERRRARLEALLAALglplpasaeefaalRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEI 428
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1588 EATERQRGGAEGELQALRARAEEAeaqkrqAQEEAERLR-----RQVQDETQRKRQAeAELALRVQA-----EAEAAREK 1657
Cdd:COG4913    429 ASLERRKSNIPARLLALRDALAEA------LGLDEAELPfvgelIEVRPEEERWRGA-IERVLGGFAltllvPPEHYAAA 501
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1658 QRALQALE-ELRLQAEEAERRLRQAEAER------ARQVQVALETAQRSAEAELQSehaSFAEKTAQLERTLKEEHVAV- 1729
Cdd:COG4913    502 LRWVNRLHlRGRLVYERVRTGLPDPERPRldpdslAGKLDFKPHPFRAWLEAELGR---RFDYVCVDSPEELRRHPRAIt 578
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1730 --VQLREEATrraqqqaeaERARAEAERELERWQL-KANEALRLRLQAEEVAQQKSLTQAEAEKQKEeaerearrrgKAE 1806
Cdd:COG4913    579 raGQVKGNGT---------RHEKDDRRRIRSRYVLgFDNRAKLAALEAELAELEEELAEAEERLEAL----------EAE 639
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQL---AEGTAQQRLAAEQELIRLRAETEQGEQqrqlLEEELARLQREAAAATQKRRELEAEL 1883
Cdd:COG4913    640 LDALQERREALQRLAEYSWDeidVASAEREIAELEAELERLDASSDDLAA----LEEQLEELEAELEELEEELDELKGEI 715
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1884 AKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1963
Cdd:COG4913    716 GRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRA 795
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1964 -KLAAISEATRLKTEAEiALKEKEAENERLR--RLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELER 2034
Cdd:COG4913    796 fNREWPAETADLDADLE-SLPEYLALLDRLEedGLPEYEERFKELLNENSIEFVADLLSKLRRAIREIKERIDP 868
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1465-1956 4.90e-15

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 82.12  E-value: 4.90e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHA--QAKAQAEREAQGLQRRMQEEVARREEVAvEAQEQKR 1542
Cdd:COG4717     88 EEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPlyQELEALEAELAELPERLEELEERLEELR-ELEEELE 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1543 SIQEELQHLRQSSEaeiqakarqvEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEA 1622
Cdd:COG4717    167 ELEAELAELQEELE----------ELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENEL 236
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1623 ER--LRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSA 1700
Cdd:COG4717    237 EAaaLEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEEL 316
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1701 EAElqsehasfaektaQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAqq 1780
Cdd:COG4717    317 EEE-------------ELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGV-- 381
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1781 ksltqaeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEgtAQQRLAAEQELIRLRAETEQGEQqrqlLEE 1860
Cdd:COG4717    382 -----------------------EDEEELRAALEQAEEYQELKEELEE--LEEQLEELLGELEELLEALDEEE----LEE 432
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1861 ELARLQREAAAATQKRRELEAELAKVRAEMEVLlaskaraeeESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAK 1940
Cdd:COG4717    433 ELEELEEELEELEEELEELREELAELEAELEQL---------EEDGELAELLQELEELKAELRELAEEWAALKLALELLE 503
                          490
                   ....*....|....*....
gi 1920237946 1941 RQRQLAEED---AVRQRAE 1956
Cdd:COG4717    504 EAREEYREErlpPVLERAS 522
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4168-4206 5.20e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 71.59  E-value: 5.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4168 LLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEMNEIL 4206
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3590-3628 5.20e-15

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 71.59  E-value: 5.20e-15
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3590 LLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEMNRVL 3628
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1459-2178 7.31e-15

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 82.32  E-value: 7.31e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1459 YIRFISETLRRMEEEERLAEQQRAEERERLAEVEAaLEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQ 1538
Cdd:TIGR00618  140 YKTFTRVVLLPQGEFAQFLKAKSKEKKELLMNLFP-LDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYH 218
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1539 EQKRSIQEELQHLR--QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQAL---RARAEEAEA 1613
Cdd:TIGR00618  219 ERKQVLEKELKHLReaLQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERInraRKAAPLAAH 298
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1614 QKRQAQEEAERLRRQVQ-DETQRKRQAEAELALRVQAEAEAAREKQRALQAL--EELRLQAEEAERRLRQAEAERARQVQ 1690
Cdd:TIGR00618  299 IKAVTQIEQQAQRIHTElQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLhsQEIHIRDAHEVATSIREISCQQHTLT 378
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1691 VALETAQRSAEAELQSEHASFAEKTaqlertlkeehvavvQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRL 1770
Cdd:TIGR00618  379 QHIHTLQQQKTTLTQKLQSLCKELD---------------ILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELC 443
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1771 RLQAEEVAQQKSLTQAEAEKQKEEAerearrrgKAEEQAVRQRELAEQELEKQRQLAEgtaqQRLAAEQELIRLRAETEQ 1850
Cdd:TIGR00618  444 AAAITCTAQCEKLEKIHLQESAQSL--------KEREQQLQTKEQIHLQETRKKAVVL----ARLLELQEEPCPLCGSCI 511
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1851 GEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLAS----KARAEEESRSTSEKSKQR---------LEA 1917
Cdd:TIGR00618  512 HPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQraslKEQMQEIQQSFSILTQCDnrskedipnLQN 591
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1918 EAGRFRELAEEAARLR-ALAEEAKRQ-RQLAEEDAVRQRAEAERVLAEKLAaiSEATRLKTEAEIALKEKEAENERLRRL 1995
Cdd:TIGR00618  592 ITVRLQDLTEKLSEAEdMLACEQHALlRKLQPEQDLQDVRLHLQQCSQELA--LKLTALHALQLTLTQERVREHALSIRV 669
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1996 AEDEAFQRRLLEEQAAQHKADieaRLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELG 2075
Cdd:TIGR00618  670 LPKELLASRQLALQKMQSEKE---QLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLK 746
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2076 RIRGTAEDTLRSKEQAEQEAARQrqlaaeeerrRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAE 2155
Cdd:TIGR00618  747 ELMHQARTVLKARTEAHFNNNEE----------VTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDE 816
                          730       740
                   ....*....|....*....|....
gi 1920237946 2156 QE-SARQLQLAQEAAQKRLQAEEK 2178
Cdd:TIGR00618  817 DIlNLQCETLVQEEEQFLSRLEEK 840
CH_SMTNB cd21259
calponin homology (CH) domain found in smoothelin-B and similar proteins; Smoothelins are ...
303-414 9.03e-15

calponin homology (CH) domain found in smoothelin-B and similar proteins; Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. The human SMTN gene encodes smoothelin-A and smoothelin-B. This model corresponds to the single CH domain of smoothelin-B. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409108  Cd Length: 112  Bit Score: 73.49  E-value: 9.03e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  303 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 382
Cdd:cd21259      3 KQMLLDWCRAKTRGYENVDIQNFSSSWSDGMAFCALVHNFFPEAFDYSQLSPQNRRHNFEVAFSSAEKHADCPQLLDVED 82
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1920237946  383 -VDVPQPDEKSIITYVSSLYDAMprvpdVQDGV 414
Cdd:cd21259     83 mVRMREPDWKCVYTYIQEFYRCL-----VQKGL 110
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1452-2363 1.02e-14

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 81.76  E-value: 1.02e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1452 LSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEV----EAALEKQRQLAEAHAQAkAQAEREAQGLQRRMQEEV 1527
Cdd:pfam01576  178 LSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGEStdlqEQIAELQAQIAELRAQL-AKKEEELQAALARLEEET 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1528 ARREEvaveAQEQKRSIQEELQHLRQSSEAEIQAKARqveaAERSRLRIEEEIRVVRLQLEATErqrgGAEGELQALRAR 1607
Cdd:pfam01576  257 AQKNN----ALKKIRELEAQISELQEDLESERAARNK----AEKQRRDLGEELEALKTELEDTL----DTTAAQQELRSK 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1608 AE-EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvqaeaEAAREKQRALQALEELRlQAEEAERRLRQAEAERA 1686
Cdd:pfam01576  325 REqEVTELKKALEEETRSHEAQLQEMRQKHTQALEELT-------EQLEQAKRNKANLEKAK-QALESENAELQAELRTL 396
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1687 RQVQVALETAQRSAEAELQSEHASFAEktAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1766
Cdd:pfam01576  397 QQAKQDSEHKRKKLEGQLQELQARLSE--SERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQ 474
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1767 ALRlrlqAEEVAQQKSLTQAEAEKqkeeaerearrrgkaEEQAVRQRELAEQELEKQRQLAegtaQQRLAAEQELIRLRA 1846
Cdd:pfam01576  475 ELL----QEETRQKLNLSTRLRQL---------------EDERNSLQEQLEEEEEAKRNVE----RQLSTLQAQLSDMKK 531
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1847 ETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAK-------VRAEMEVLLAS-------------------KARA 1900
Cdd:pfam01576  532 KLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKlektknrLQQELDDLLVDldhqrqlvsnlekkqkkfdQMLA 611
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1901 EEESRSTS-EKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA---ISEATRLKT 1976
Cdd:pfam01576  612 EEKAISARyAEERDRAEAEA---REKETRALSLARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVgknVHELERSKR 688
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1977 EAEIALKEKEAENERLR-RLAEDEAFQRRL---LEEQAAQHKADIEARlaqlrkaSESELERQKGLVedtlRQRRQVEEE 2052
Cdd:pfam01576  689 ALEQQVEEMKTQLEELEdELQATEDAKLRLevnMQALKAQFERDLQAR-------DEQGEEKRRQLV----KQVRELEAE 757
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2053 ILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKA 2132
Cdd:pfam01576  758 LEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKESEKKLKN 837
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2133 ALEEVERLKAKVEEARRLRERAEQESAR-QLQLAQEAAQKRLQAEEKAHAFA-VQQKEQELQQTLQQEQSVLERLRSEAE 2210
Cdd:pfam01576  838 LEAELLQLQEDLAASERARRQAQQERDElADEIASGASGKSALQDEKRRLEArIAQLEEELEEEQSNTELLNDRLRKSTL 917
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2211 AARR---AAEEAEAARERAEREAAQSRRQVEEAeRLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQ 2287
Cdd:pfam01576  918 QVEQlttELAAERSTSQKSESARQQLERQNKEL-KAKLQEMEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQAANKL 996
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2288 AADAE---------MEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2358
Cdd:pfam01576  997 VRRTEkklkevllqVEDERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEASRANAARRKLQRELDDATESNESMNREVST 1076

                   ....*
gi 1920237946 2359 LRVQM 2363
Cdd:pfam01576 1077 LKSKL 1081
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1277-2056 1.04e-14

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 81.65  E-value: 1.04e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1277 DELRGAQEVGERLQQRHGERDvEVERWRERVTLLLERwqaVLAQTDVRQRELEQLGRqlryYREsadplgawLRDAKQRQ 1356
Cdd:TIGR02169  160 DEIAGVAEFDRKKEKALEELE-EVEENIERLDLIIDE---KRQQLERLRREREKAER----YQA--------LLKEKREY 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1357 EQiqAVPLANSQAVREQLRQEKALLEDIERHGEKVEEcqrfakqyinAIKDYELQLVTYKAQLEPVASPAKKpkvqSGSE 1436
Cdd:TIGR02169  224 EG--YELLKEKEALERQKEAIERQLASLEEELEKLTE----------EISELEKRLEEIEQLLEELNKKIKD----LGEE 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1437 SIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREA 1516
Cdd:TIGR02169  288 EQLRVKEKIGELEAEIASLERS-IAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEEL 366
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1517 QGLQRRMQEEVARREEVAVEAQEQKRSIqEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGG 1596
Cdd:TIGR02169  367 EDLRAELEEVDKEFAETRDELKDYREKL-EKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1597 AEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQALEEL---RLQA-- 1671
Cdd:TIGR02169  446 KALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELA-EAEAQARASEERVRGGRAVEEVlkaSIQGvh 524
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1672 ----------EEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFA-----EKTAQLERTLKEEHVA-------- 1728
Cdd:TIGR02169  525 gtvaqlgsvgERYATAIEVAAGNRLNNVVVEDDAVAKEAIELLKRRKAGRAtflplNKMRDERRDLSILSEDgvigfavd 604
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1729 --------------------VVQLREEATRRAQQQAEAERA--------------RAEAERELERWQLKAnEALRLRLQA 1774
Cdd:TIGR02169  605 lvefdpkyepafkyvfgdtlVVEDIEAARRLMGKYRMVTLEgelfeksgamtggsRAPRGGILFSRSEPA-ELQRLRERL 683
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1775 EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAV---RQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1851
Cdd:TIGR02169  684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGeieKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKEL 763
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1852 EQQRQLLEEELARLQreAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTsEKSKQRLEAEAGRFRELAEEAAR 1931
Cdd:TIGR02169  764 EARIEELEEDLHKLE--EALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREI-EQKLNRLTLEKEYLEKEIQELQE 840
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1932 LRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAA 2011
Cdd:TIGR02169  841 QRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLS 920
                          810       820       830       840       850
                   ....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2012 QHKADIEA---RLAQLRKASESELE-RQKGLVEDTLRQRRQ-VEEEILAL 2056
Cdd:TIGR02169  921 ELKAKLEAleeELSEIEDPKGEDEEiPEEELSLEDVQAELQrVEEEIRAL 970
CH_PLS_FIM_rpt3 cd21219
third calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ...
179-285 1.05e-14

third calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5, which cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409068  Cd Length: 113  Bit Score: 73.08  E-value: 1.05e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDrvqKKTFTKWVNKHLIKAQrhISDLYEDLRDGhnlISLLEVLsgDSL-P---------REKGRMRFHKLQNVQIA 248
Cdd:cd21219      1 EGSRE---ERAFRMWLNSLGLDPL--INNLYEDLRDG---LVLLQVL--DKIqPgcvnwkkvnKPKPLNKFKKVENCNYA 70
                           90       100       110
                   ....*....|....*....|....*....|....*..
gi 1920237946  249 LDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTIILH 285
Cdd:cd21219     71 VDLAKKLGFSLVGIGGKDIADGNRKLTLALVWQLMRY 107
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
1075-1626 1.58e-14

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 81.14  E-value: 1.58e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1075 EDRLQAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLrlpldkeparecAQRITEQQK 1154
Cdd:COG1196    274 LELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEEL------------EELEEELEE 341
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1155 AQAEVDGLGKGVARLSAEAEKVLAlpepspaapTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAH 1234
Cdd:COG1196    342 LEEELEEAEEELEEAEAELAEAEE---------ALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEAL 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1235 EEQLKEAQAvpatlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERw 1314
Cdd:COG1196    413 LERLERLEE------ELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEE- 485
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1315 qavlaqtdvRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEkALLEDIERHGEKVEEC 1394
Cdd:COG1196    486 ---------LAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAA-LEAALAAALQNIVVED 555
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1395 QRFAKQYINAIKDYELQLVT-YKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEE 1473
Cdd:COG1196    556 DEVAAAAIEYLKAAKAGRATfLPLDKIRARAALAAALARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAA 635
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1474 ERLAEQQRAEERERLAEVEAALEKQRqLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQ 1553
Cdd:COG1196    636 LRRAVTLAGRLREVTLEGEGGSAGGS-LTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEE 714
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ-------AQEEAERLR 1626
Cdd:COG1196    715 ERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLEREIEAlgpvnllAIEEYEELE 794
CH_SMTNL2 cd21261
calponin homology (CH) domain found in smoothelin-like protein 2; Smoothelin-like protein 2 ...
303-402 2.01e-14

calponin homology (CH) domain found in smoothelin-like protein 2; Smoothelin-like protein 2 (SMTNL2) is highly expressed in skeletal muscle and could be associated with differentiating myocytes. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409110  Cd Length: 107  Bit Score: 72.31  E-value: 2.01e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  303 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 382
Cdd:cd21261      3 KQILLEWCRSKTIGYKNIDLQNFSSSWSDGMAFCALVHSFFPEAFDYDSLSPSNRKHNFELAFSMAEKLANCDRLIEVED 82
                           90       100
                   ....*....|....*....|..
gi 1920237946  383 VDV--PQPDEKSIITYVSSLYD 402
Cdd:cd21261     83 MMVmgRKPDPMCVFTYVQSLYN 104
CH_DIXDC1 cd21213
calponin homology (CH) domain found in Dixin and similar proteins; Dixin, also called ...
186-286 3.19e-14

calponin homology (CH) domain found in Dixin and similar proteins; Dixin, also called coiled-coil protein DIX1, coiled-coil-DIX1, or DIX domain-containing protein 1, is a positive effector of the Wnt signaling pathway. It activates WNT3A signaling via DVL2 and regulates JNK activation by AXIN1 and DVL2. Members of this family contain a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409062  Cd Length: 107  Bit Score: 71.56  E-value: 3.19e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  186 QKKTFTKWVNKHLIK--AQRHISDLYEDLRDGHNLISLLEVLSGDSL------PREKGRMRfhklQNVQIALDYLRHRQV 257
Cdd:cd21213      1 QLQAYVAWVNSQLKKrpGIRPVQDLRRDLRDGVALAQLIEILAGEKLpgidwnPTTDAERK----ENVEKVLQFMASKRI 76
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  258 KLVNIRNDDIADGNPKLTLGLIWTIILHF 286
Cdd:cd21213     77 RMHQTSAKDIVDGNLKAIMRLILALAAHF 105
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1201-1704 4.25e-14

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 79.57  E-value: 4.25e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1201 EQVRSLSAI--YLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPatlpELEATKAALKKLRAQAEAQQPVFDALRDE 1278
Cdd:COG4913    249 EQIELLEPIreLAERYAAARERLAELEYLRAALRLWFAQRRLELLEA----ELEELRAELARLEAELERLEARLDALREE 324
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1279 LRGAQEV-----GERLQQRhgERDVE-VERWRERVTLLLERWQAVLAQTDVR---------------QRELEQLGRQLRY 1337
Cdd:COG4913    325 LDELEAQirgngGDRLEQL--EREIErLERELEERERRRARLEALLAALGLPlpasaeefaalraeaAALLEALEEELEA 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1338 YRESADPLGAWLRDAKQRQEQIQA-----------VPlANSQAVREQLRQEKALLED---------------------IE 1385
Cdd:COG4913    403 LEEALAEAEAALRDLRRELRELEAeiaslerrksnIP-ARLLALRDALAEALGLDEAelpfvgelievrpeeerwrgaIE 481
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1386 R--HGEK----VEEcQRFAK--QYINAIKDyELQLVTYKAqlEPVASPAKKPKVQSGS--------ESIIQEYVD--LRT 1447
Cdd:COG4913    482 RvlGGFAltllVPP-EHYAAalRWVNRLHL-RGRLVYERV--RTGLPDPERPRLDPDSlagkldfkPHPFRAWLEaeLGR 557
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1448 RYS--------ELST----LTSQYIRFISETLRRMEEEERL---------AEQQRAEERERLAEVEAALEKQRQLAEAHA 1506
Cdd:COG4913    558 RFDyvcvdspeELRRhpraITRAGQVKGNGTRHEKDDRRRIrsryvlgfdNRAKLAALEAELAELEEELAEAEERLEALE 637
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1507 QAKAQAEREAQGLQRRmqEEVARREEVAVEAQEQKRSIQEELQHLRQSSeAEIQAKARQVEAAERSRLRIEEEIRVVRLQ 1586
Cdd:COG4913    638 AELDALQERREALQRL--AEYSWDEIDVASAEREIAELEAELERLDASS-DDLAALEEQLEELEAELEELEEELDELKGE 714
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1587 LEATERQRGGAEGELQALRARAEEAE------------------AQKRQAQEEAERLRRQVQDETQRKRQAEAEL----- 1643
Cdd:COG4913    715 IGRLEKELEQAEEELDELQDRLEAAEdlarlelralleerfaaaLGDAVERELRENLEERIDALRARLNRAEEELeramr 794
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 1644 ---------ALRVQAEAEAAREKQRALQALEELRL---QAEEAERRLRQAEAERArQVQVALETAQRSAEAEL 1704
Cdd:COG4913    795 afnrewpaeTADLDADLESLPEYLALLDRLEEDGLpeyEERFKELLNENSIEFVA-DLLSKLRRAIREIKERI 866
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1761-2621 4.91e-14

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 79.73  E-value: 4.91e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1761 QLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLA---- 1836
Cdd:TIGR02169  231 EKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLersi 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1837 --AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQR 1914
Cdd:TIGR02169  311 aeKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDY 390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1915 LEAEAGRFRELAEEAARLRALAEEAKRQRQlaeedavrQRAEAERVLAEKLAAISEatrLKTEAEIALKEKEAENERLRR 1994
Cdd:TIGR02169  391 REKLEKLKREINELKRELDRLQEELQRLSE--------ELADLNAAIAGIEAKINE---LEEEKEDKALEIKKQEWKLEQ 459
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1995 LAED-EAFQRRLLEEQAAQhkADIEARLAQLRKaSESELERQKGLVEDTLRQRRQVEEeilalkgsfekaaagkaELELE 2073
Cdd:TIGR02169  460 LAADlSKYEQELYDLKEEY--DRVEKELSKLQR-ELAEAEAQARASEERVRGGRAVEE-----------------VLKAS 519
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2074 LGRIRGTAEDTLRSKEQ---AEQEAARQRqlaaeeerrrrEAEERVQKSLAAEE--EAARQRK---AALEEVERLKAKVE 2145
Cdd:TIGR02169  520 IQGVHGTVAQLGSVGERyatAIEVAAGNR-----------LNNVVVEDDAVAKEaiELLKRRKagrATFLPLNKMRDERR 588
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2146 EARRLRERAEQESARQLqlaQEAAQKRlqaeEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARER 2225
Cdd:TIGR02169  589 DLSILSEDGVIGFAVDL---VEFDPKY----EPAFKYVFGDTLVVEDIEAARRLMGKYRMVTLEGELFEKSGAMTGGSRA 661
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2226 AEREAAQSRRQVEEAERL---KQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQ 2302
Cdd:TIGR02169  662 PRGGILFSRSEPAELQRLrerLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2303 ALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQveEELFSLRVQMEELGKLKARIEAENRALvl 2382
Cdd:TIGR02169  742 LEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH--SRIPEIQAELSKLEEEVSRIEARLREI-- 817
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2383 rDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLaEEDLAQQRALAEKmLKEKMQAVQEatrLKAEAELLQQQK 2462
Cdd:TIGR02169  818 -EQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI-ENLNGKKEELEEE-LEELEAALRD---LESRLGDLKKER 891
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2463 ELAQEQARRLQEDKEQMAQQlaqetqgfqktLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRfRKQAEDIGER 2542
Cdd:TIGR02169  892 DELEAQLRELERKIEELEAQ-----------IEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE-ELSLEDVQAE 959
                          810       820       830       840       850       860       870
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2543 LYRTELAtqekvmlVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSF 2621
Cdd:TIGR02169  960 LQRVEEE-------IRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREVFMEAFEAINENF 1031
CH_FLN_rpt2 cd21230
second calponin homology (CH) domain found in filamins; The filamin family includes filamin-A ...
301-398 5.26e-14

second calponin homology (CH) domain found in filamins; The filamin family includes filamin-A (FLN-A), filamin-B (FLN-B) and filamin-C (FLN-C). Filamins function to anchor various transmembrane proteins to the actin cytoskeleton. FLN-A is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-B is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton and may also promote orthogonal branching of actin filaments as well as link actin filaments to membrane glycoproteins. FLN-C, also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. Members of this family contain two copies of the CH domain. The model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409079  Cd Length: 103  Bit Score: 70.87  E-value: 5.26e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI----DMNKVYRqtnLENLDQAFSVAERDLGVTR 376
Cdd:cd21230      1 TPKQRLLGWIQNKIPQ---LPITNFTTDWNDGRALGALVDSCAPGLCpdweTWDPNDA---LENATEAMQLAEDWLGVPQ 74
                           90       100
                   ....*....|....*....|..
gi 1920237946  377 LLDPEDVDVPQPDEKSIITYVS 398
Cdd:cd21230     75 LITPEEIINPNVDEMSVMTYLS 96
CH_SMTNA cd21258
calponin homology (CH) domain found in smoothelin-A and similar proteins; Smoothelins are ...
303-406 5.34e-14

calponin homology (CH) domain found in smoothelin-A and similar proteins; Smoothelins are actin-binding cytoskeletal proteins that are abundantly expressed in healthy visceral (smoothelin-A) and vascular (smoothelin-B) smooth muscle. This model corresponds to the single CH domain of smoothelin-A. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409107  Cd Length: 111  Bit Score: 71.23  E-value: 5.34e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  303 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 382
Cdd:cd21258      3 KQMLLDWCRAKTRGYEHVDIQNFSSSWSDGMAFCALVHNFFPDAFDYSQLSPQNRRQNFEVAFSAAEMLADCVPLVEVED 82
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  383 VDV--PQPDEKSIITYVSSLYDAMPR 406
Cdd:cd21258     83 MMImgKKPDSKCVFTYVQSLYNHLRR 108
CH_CYTS cd21199
calponin homology (CH) domain found in the cytospin family; The cytospin family includes ...
306-401 6.24e-14

calponin homology (CH) domain found in the cytospin family; The cytospin family includes cytospin-A and cytospin-B. Cytospin-A, also called renal carcinoma antigen NY-REN-22, sperm antigen with calponin homology and coiled-coil domains 1-like, or SPECC1-like (SPECC1L) protein, is involved in cytokinesis and spindle organization. It may play a role in actin cytoskeleton organization and microtubule stabilization and hence, is required for proper cell adhesion and migration. Cytospin-B, also called nuclear structure protein 5 (NSP5), sperm antigen HCMOGT-1, or sperm antigen with calponin homology and coiled-coil domains 1 (SPECC1), is a novel fusion partner to PDGFRB in juvenile myelomonocytic leukemia with translocation t(5;17)(q33;p11.2). Members of this family contain a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409048  Cd Length: 112  Bit Score: 70.85  E-value: 6.24e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  306 LLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDPED-VD 384
Cdd:cd21199     13 LLKWCQEKTQGYKGIDITNFSSSWNDGLAFCALLHSYLPDKIPYSELNPQDKRRNFTLAFKAAE-SVGIPTTLTIDEmVS 91
                           90
                   ....*....|....*..
gi 1920237946  385 VPQPDEKSIITYVSSLY 401
Cdd:cd21199     92 MERPDWQSVMSYVTAIY 108
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1192-1714 8.01e-14

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 78.80  E-value: 8.01e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1192 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEvLRAHEEQLKEAQAvpaTLPELEATKAALKKLRAQAEAQQpv 1271
Cdd:COG4913    266 AARERLAELEYLRAALRLWFAQRRLELLEAELEELRAE-LARLEAELERLEA---RLDALREELDELEAQIRGNGGDR-- 339
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1272 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTdvrQRELEQLGRQLRYYRESADPLGAWLRD 1351
Cdd:COG4913    340 LEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEA---AALLEALEEELEALEEALAEAEAALRD 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1352 AKQRQEQIQA-----------VPlANSQAVREQLRQEKALLED---------------------IER--HGEK----VEE 1393
Cdd:COG4913    417 LRRELRELEAeiaslerrksnIP-ARLLALRDALAEALGLDEAelpfvgelievrpeeerwrgaIERvlGGFAltllVPP 495
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1394 cQRFAK--QYINAIKDyELQLVTYKAQLEPVASPAKKPKVQSGSEsiiqeyvdlrtrysELSTLTSQYIRFISETLRRM- 1470
Cdd:COG4913    496 -EHYAAalRWVNRLHL-RGRLVYERVRTGLPDPERPRLDPDSLAG--------------KLDFKPHPFRAWLEAELGRRf 559
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 -----EEEERLAEQQRAEERERLA-EVEAALEK--QRQLAEAH------AQAKAQAEREAQGLQRRMQEEVARREEVAvE 1536
Cdd:COG4913    560 dyvcvDSPEELRRHPRAITRAGQVkGNGTRHEKddRRRIRSRYvlgfdnRAKLAALEAELAELEEELAEAEERLEALE-A 638
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1537 AQEQKRSIQEELQHLRQSSEAEIQakarqVEAAERSRLRIEEEIRvvrlQLEAterqrggAEGELQALRARAEEAEAQKR 1616
Cdd:COG4913    639 ELDALQERREALQRLAEYSWDEID-----VASAEREIAELEAELE----RLDA-------SSDDLAALEEQLEELEAELE 702
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1617 QAQEEAERLRRQVQDETQRKRQAEAEL-ALRVQAEAEAAREKQRALQALEELRlqAEEAERRLRQAEAERARQVQVALET 1695
Cdd:COG4913    703 ELEEELDELKGEIGRLEKELEQAEEELdELQDRLEAAEDLARLELRALLEERF--AAALGDAVERELRENLEERIDALRA 780
                          570
                   ....*....|....*....
gi 1920237946 1696 AQRSAEAELQSEHASFAEK 1714
Cdd:COG4913    781 RLNRAEEELERAMRAFNRE 799
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
1842-2655 8.04e-14

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 78.86  E-value: 8.04e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1842 IRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAE--MEVLLASKARAEEESRSTSEKSKQRLEAea 1919
Cdd:TIGR00618   94 LRCTRSHRKTEQPEQLYLEQKKGRGRILAAKKSETEEVIHDLLKLDYKtfTRVVLLPQGEFAQFLKAKSKEKKELLMN-- 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1920 grfrelAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAErvlAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDE 1999
Cdd:TIGR00618  172 ------LFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQ---LLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSH 242
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2000 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEeilalkgsfeKAAAGKAELELELGRIRG 2079
Cdd:TIGR00618  243 AYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAAP----------LAAHIKAVTQIEQQAQRI 312
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2080 TAEdtLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2159
Cdd:TIGR00618  313 HTE--LQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTT 390
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2160 rQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE 2239
Cdd:TIGR00618  391 -LTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK 469
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2240 AERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRL 2319
Cdd:TIGR00618  470 EREQQLQTKEQIHLQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYH 549
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2320 QLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLkarIEAENRAlvlrdKDSAQRLLQEEAEKm 2399
Cdd:TIGR00618  550 QLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDL---TEKLSEA-----EDMLACEQHALLRK- 620
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2400 KQVAEEAARLSVAAQEAARLRQLAEEDLAQqraLAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQ--EDKE 2477
Cdd:TIGR00618  621 LQPEQDLQDVRLHLQQCSQELALKLTALHA---LQLTLTQERVREHALSIRVLPKELLASRQLALQKMQSEKEQltYWKE 697
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2478 QMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEdigERLYRTELATQEKVMLV 2557
Cdd:TIGR00618  698 MLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMHQARTVLK---ARTEAHFNNNEEVTAAL 774
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2558 QTLeTQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQ 2637
Cdd:TIGR00618  775 QTG-AELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLK 853
                          810
                   ....*....|....*...
gi 1920237946 2638 EKAKLEQlfQDEVAKAQA 2655
Cdd:TIGR00618  854 YEECSKQ--LAQLTQEQA 869
mukB PRK04863
chromosome partition protein MukB;
1482-2588 8.80e-14

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 78.85  E-value: 8.80e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1482 AEERERLaeVEAALEKQRQLAEAHAQAKAQAEReaqglqrrmQEEVARREEvavEAQEQKRSIQEELQ----HLRQSSEA 1557
Cdd:PRK04863   278 ANERRVH--LEEALELRRELYTSRRQLAAEQYR---------LVEMARELA---ELNEAESDLEQDYQaasdHLNLVQTA 343
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1558 EIQAKA--RQVEAAERSRLRIEEEIRVVRlqlEATERQrggaegelqalraraEEAEAQKRQAQEEAERLRRQVQDETQr 1635
Cdd:PRK04863   344 LRQQEKieRYQADLEELEERLEEQNEVVE---EADEQQ---------------EENEARAEAAEEEVDELKSQLADYQQ- 404
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1636 krqaeaelALRVQAEAeaAREKQRALQALEELR-------LQAEEAERRLRQAEAERARQVQVALETAQRSAEAElqsEH 1708
Cdd:PRK04863   405 --------ALDVQQTR--AIQYQQAVQALERAKqlcglpdLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQ---AA 471
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1709 ASFAEKTAQLERTLKEEHVavvqlREEAtrraqqqaeaeraraeaerelerWQlKANEALRlrlqaeevaqqksltqaea 1788
Cdd:PRK04863   472 HSQFEQAYQLVRKIAGEVS-----RSEA-----------------------WD-VARELLR------------------- 503
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1789 ekqkeeaerearrrgkaeeQAVRQRELAEQELEKQRQLAEgtAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQRE 1868
Cdd:PRK04863   504 -------------------RLREQRHLAEQLQQLRMRLSE--LEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEE 562
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1869 AAAAtqkRRELEAELAKVRAEMEVLlaskaRAEEESrstSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1948
Cdd:PRK04863   563 LEAR---LESLSESVSEARERRMAL-----RQQLEQ---LQARIQRLAARAPAWLAAQDALARLREQSGEEFEDSQDVTE 631
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1949 dAVRQRAEAERVLAEKLAAISEA-TRLKTEAEIALKEKEAENERLRRLAEDeaFQRRLLEEQ----AAQHKADIEARLAQ 2023
Cdd:PRK04863   632 -YMQQLLERERELTVERDELAARkQALDEEIERLSQPGGSEDPRLNALAER--FGGVLLSEIyddvSLEDAPYFSALYGP 708
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2024 LRKASeseLERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG--KAElELELGRIRGTAEDTLR-SKEQAEQ---EAAR 2097
Cdd:PRK04863   709 ARHAI---VVPDLSDAAEQLAGLEDCPEDLYLIEGDPDSFDDSvfSVE-ELEKAVVVKIADRQWRySRFPEVPlfgRAAR 784
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2098 QRQLaaeeerrrreaeervqkslaaeEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLA---------QEA 2168
Cdd:PRK04863   785 EKRI----------------------EQLRAEREELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAfeadpeaelRQL 842
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2169 AQKRLQAEEKahafavqqkeqelqqtlqqeqsvLERLRSeaeaarraaeeaeaareraerEAAQSRRQVEEAERLKQSae 2248
Cdd:PRK04863   843 NRRRVELERA-----------------------LADHES---------------------QEQQQRSQLEQAKEGLSA-- 876
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2249 eqaqaqaqaqaaaekLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRlqleeTDhqk 2328
Cdd:PRK04863   877 ---------------LNRLLPRLNLLADETLADRVEEIREQLDEAEEAKRFVQQHGNALAQLEPIVSVLQ-----SD--- 933
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2329 silDEELQRLKAEVTEAARQRGQVEEELFSLrvqmEELGKLKARIEAENRALVLRDKDSAQRLLQ---EEAEKMKQVAEE 2405
Cdd:PRK04863   934 ---PEQFEQLKQDYQQAQQTQRDAKQQAFAL----TEVVQRRAHFSYEDAAEMLAKNSDLNEKLRqrlEQAEQERTRARE 1006
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2406 AARlsvaaQEAARLRQLAE--EDLAQQRALAEKMLKEKMQAVQEAT-RLKAEAE-LLQQQKELAQEQARRLQEDKEQMAQ 2481
Cdd:PRK04863  1007 QLR-----QAQAQLAQYNQvlASLKSSYDAKRQMLQELKQELQDLGvPADSGAEeRARARRDELHARLSANRSRRNQLEK 1081
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2482 QLaqetqgfqkTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEkvmlvqtLE 2561
Cdd:PRK04863  1082 QL---------TFCEAEMDNLTKKLRKLERDYHEMREQVVNAKAGWCAVLRLVKDNGVERRLHRRELAYLS-------AD 1145
                         1130      1140
                   ....*....|....*....|....*..
gi 1920237946 2562 TQRQQSDRDAERLREAIAELEHEKDKL 2588
Cdd:PRK04863  1146 ELRSMSDKALGALRLAVADNEHLRDVL 1172
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3849-3887 1.06e-13

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 67.74  E-value: 1.06e-13
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3849 LLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPELHDRL 3887
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2283-2742 1.32e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 78.17  E-value: 1.32e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQ 2362
Cdd:TIGR02168  231 VLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2363 MEELGKLKARIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM 2442
Cdd:TIGR02168  311 LANLERQLEELEAQ-LEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVA 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2443 QAVQEATRLKAEAELLQQQKE-LAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2521
Cdd:TIGR02168  390 QLELQIASLNNEIERLEARLErLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2522 QARAEEDARRFRKQAEDIGERLYRTE-----------------------------LATQEKV------------------ 2554
Cdd:TIGR02168  470 LEEAEQALDAAERELAQLQARLDSLErlqenlegfsegvkallknqsglsgilgvLSELISVdegyeaaieaalggrlqa 549
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2555 MLVQTLETQRQ--QSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE-----------------------MQTVRQEQ 2609
Cdd:TIGR02168  550 VVVENLNAAKKaiAFLKQNELGRVTFLPLDSIKGTEIQGNDREILKNIEgflgvakdlvkfdpklrkalsylLGGVLVVD 629
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2610 LLQETQALQ------------------------QSFLSEKDSLLQRERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQ 2665
Cdd:TIGR02168  630 DLDNALELAkklrpgyrivtldgdlvrpggvitGGSAKTNSSILERRREIEELEEKIEEL-EEKIAELEKALAELRKELE 708
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2666 QMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIA 2742
Cdd:TIGR02168  709 ELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE 785
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1248-1707 1.53e-13

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 77.50  E-value: 1.53e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1248 LPELEATKAALKKLRAQAEAqqpvFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTL--LLERWQAVLAQTDVRQ 1325
Cdd:COG4717     70 LKELKELEEELKEAEEKEEE----YAELQEELEELEEELEELEAELEELREELEKLEKLLQLlpLYQELEALEAELAELP 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1326 RELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQekaLLEDIERHGEKVEECQRFAKQYINAI 1405
Cdd:COG4717    146 ERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQD---LAEELEELQQRLAELEEELEEAQEEL 222
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1406 KDYELQLVTYKAQLEPVASPAK--KPKVQSGSESII-------QEYVDLRTRYSELSTLTSQYIRFISETLRRMEE--EE 1474
Cdd:COG4717    223 EELEEELEQLENELEAAALEERlkEARLLLLIAAALlallglgGSLLSLILTIAGVLFLVLGLLALLFLLLAREKAslGK 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1475 RLAEQQRAEERERL--AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKrsIQEELQHLR 1552
Cdd:COG4717    303 EAEELQALPALEELeeEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQE--IAALLAEAG 380
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1553 QSSEAEIQAKARQVEAAErsrlRIEEEIRVVRLQLEA--TERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQ 1630
Cdd:COG4717    381 VEDEEELRAALEQAEEYQ----ELKEELEELEEQLEEllGELEELLEALDEEELEEELEELEEELEELEEELEELREELA 456
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1631 DETQRKRQAEAElalrvqaeaeaarekqralQALEELRLQAEEAERRLRQAEAERARQ--VQVALETAQRSAEAELQSE 1707
Cdd:COG4717    457 ELEAELEQLEED-------------------GELAELLQELEELKAELRELAEEWAALklALELLEEAREEYREERLPP 516
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1469-2593 1.63e-13

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 77.91  E-value: 1.63e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1469 RMEEEERLAEQQRAEERERLAEVEAAL----EKQRQLAEAHAQAKAQAEREAqglqrrmqEEVARREEVAVEAQEQKRSI 1544
Cdd:pfam01576    2 RQEEEMQAKEEELQKVKERQQKAESELkeleKKHQQLCEEKNALQEQLQAET--------ELCAEAEEMRARLAARKQEL 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1545 QEELQHLrqssEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegelQALRARAEEAEAQKRQAQEEAER 1624
Cdd:pfam01576   74 EEILHEL----ESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDEEEAAR-------QKLQLEKVTTEAKIKKLEEDILL 142
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1625 LRRQVQDETQRKRQAE---AELALRVQAEAEAA------REKQRALQALEELRLQAEEAERRlrqaEAERARQVQVALET 1695
Cdd:pfam01576  143 LEDQNSKLSKERKLLEeriSEFTSNLAEEEEKAkslsklKNKHEAMISDLEERLKKEEKGRQ----ELEKAKRKLEGEST 218
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1696 AQRSAEAELQsehASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAeaeraraeaerelerwQLKANEALRLRLQaE 1775
Cdd:pfam01576  219 DLQEQIAELQ---AQIAELRAQLAKKEEELQAALARLEEETAQKNNALK----------------KIRELEAQISELQ-E 278
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1776 EVAQQKSltqaeaekqkeeaerearrrgkAEEQAVRQRELAEQELEKQRQLAEGT-----AQQRLAA--EQELIRLR--- 1845
Cdd:pfam01576  279 DLESERA----------------------ARNKAEKQRRDLGEELEALKTELEDTldttaAQQELRSkrEQEVTELKkal 336
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1846 -AETEQGEQQRQ-----------LLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEEsRSTSEKSKQ 1913
Cdd:pfam01576  337 eEETRSHEAQLQemrqkhtqaleELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHK-RKKLEGQLQ 415
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1914 RLEA---EAGRFR-ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERV---LAEKLAAISEATRLKTEAEIALKEKE 1986
Cdd:pfam01576  416 ELQArlsESERQRaELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLesqLQDTQELLQEETRQKLNLSTRLRQLE 495
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1987 AENERLRRLAEDEAFQRRLLEEQAAQHkadiEARLAQLRKASESELERQKGLVEDtlrqRRQVEEEILALKGSFEKAAAG 2066
Cdd:pfam01576  496 DERNSLQEQLEEEEEAKRNVERQLSTL----QAQLSDMKKKLEEDAGTLEALEEG----KKRLQRELEALTQQLEEKAAA 567
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2067 KAELELELGRIRGTAEDTLRSKEQaeqeaarQRQLAAEEERRRREAeervqKSLAAEEEAARQRKAALEEVERLKAKVEE 2146
Cdd:pfam01576  568 YDKLEKTKNRLQQELDDLLVDLDH-------QRQLVSNLEKKQKKF-----DQMLAEEKAISARYAEERDRAEAEAREKE 635
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2147 ARRLRERAEQESARQLQLAQEAAQKRLQAEekahafavqqkeqelqqtlqqeqsvLERLRSEAEAARRAAEEAEAARERA 2226
Cdd:pfam01576  636 TRALSLARALEEALEAKEELERTNKQLRAE-------------------------MEDLVSSKDDVGKNVHELERSKRAL 690
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2227 EREAAQSRRQVEEAERLKQSaeeqaqaqaqaqAAAEKLRKEAEQEAARRAQAeqaalRQKQAADAEMEKHKQfaeQALRQ 2306
Cdd:pfam01576  691 EQQVEEMKTQLEELEDELQA------------TEDAKLRLEVNMQALKAQFE-----RDLQARDEQGEEKRR---QLVKQ 750
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2307 KAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKD 2386
Cdd:pfam01576  751 VRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKE 830
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2387 SAQRLLQEEAEKMkQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQ 2466
Cdd:pfam01576  831 SEKKLKNLEAELL-QLQEDLAASERARRQAQQERDELADEIASGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLN 909
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2467 EQARRLQEDKEQMAQQLAQETQGFQKtLETERQrqlEMSAEAERLRLRVAEMSRAQ---------------ARAEEDARR 2531
Cdd:pfam01576  910 DRLRKSTLQVEQLTTELAAERSTSQK-SESARQ---QLERQNKELKAKLQEMEGTVkskfkssiaaleakiAQLEEQLEQ 985
                         1130      1140      1150      1160      1170      1180
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2532 FRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQ 2593
Cdd:pfam01576  986 ESRERQAANKLVRRTEKKLKEVLLQVEDERRHADQYKDQAEKGNSRMKQLKRQLEEAEEEAS 1047
PTZ00121 PTZ00121
MAEBL; Provisional
2285-2815 5.52e-13

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 76.33  E-value: 5.52e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2285 QKQAADAEMEKHKQFAEQALRqkaqVEQELTAlrlqleETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRV-QM 2363
Cdd:PTZ00121  1121 KKKAEDARKAEEARKAEDARK----AEEARKA------EDAKRVEIARKAEDARKAEEARKAEDAKKAEAARKAEEVrKA 1190
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2364 EELGKLKA--RIEAENRALVLRDKDSAQRllQEEAEKMKQV--AEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK 2439
Cdd:PTZ00121  1191 EELRKAEDarKAEAARKAEEERKAEEARK--AEDAKKAEAVkkAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARR 1268
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2440 EKMQAVQEATrlKAEaELLQQQKELAQEQARRLQEDKEqmAQQLAQETQGFQKTLETERQRQlEMSAEAERLRLRVAEMS 2519
Cdd:PTZ00121  1269 QAAIKAEEAR--KAD-ELKKAEEKKKADEAKKAEEKKK--ADEAKKKAEEAKKADEAKKKAE-EAKKKADAAKKKAEEAK 1342
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2520 RAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLEtQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQlKS 2599
Cdd:PTZ00121  1343 KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAK-KKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK-KA 1420
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2600 EEMQTvRQEQLLQETQALQQSFLSEKDSLLQRErciEQEKAKLEQLFQ--DEVAKAQALREEQQRqqqqmqqekqqlAAS 2677
Cdd:PTZ00121  1421 DEAKK-KAEEKKKADEAKKKAEEAKKADEAKKK---AEEAKKAEEAKKkaEEAKKADEAKKKAEE------------AKK 1484
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2678 MEEARRRQHEAeegvRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR-SEEIAPSRA-AAARALPNG 2755
Cdd:PTZ00121  1485 ADEAKKKAEEA----KKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKkAEEKKKADElKKAEELKKA 1560
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2756 QDAADGPAAAAEPEHAFDGLRR-----KVPAQRLQEVGVL-------SAEELQQLAQGRTTVAELAQREDVR 2815
Cdd:PTZ00121  1561 EEKKKAEEAKKAEEDKNMALRKaeeakKAEEARIEEVMKLyeeekkmKAEEAKKAEEAKIKAEELKKAEEEK 1632
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1226-1722 6.27e-13

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 75.72  E-value: 6.27e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1226 EAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELrgAQEVGERLQQRHGERDVEVERWRE 1305
Cdd:COG4913    239 RAHEALEDAREQIELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRRLEL--LEAELEELRAELARLEAELERLEA 316
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1306 RVTLLLERWQAVLAQ-TDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDI 1384
Cdd:COG4913    317 RLDALREELDELEAQiRGNGGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEAL 396
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1385 ERHGEKVEECQRFAKQyinAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFIS 1464
Cdd:COG4913    397 EEELEALEEALAEAEA---ALRDLRRELRELEAEIA---------SLERRKSNIPARLLALRDALAEALGLDEAELPFVG 464
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLaeqQRAEER-------------ERLAEVEAALEKQR-------QLAEAHAQAKAQAEREAQGLQRRM- 1523
Cdd:COG4913    465 ELIEVRPEEERW---RGAIERvlggfaltllvppEHYAAALRWVNRLHlrgrlvyERVRTGLPDPERPRLDPDSLAGKLd 541
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1524 ----------QEEVARREEVA-VEAQEQ----KRSIQEELQ--------------HLRQ------SSEAEIQAKARQVEA 1568
Cdd:COG4913    542 fkphpfrawlEAELGRRFDYVcVDSPEElrrhPRAITRAGQvkgngtrhekddrrRIRSryvlgfDNRAKLAALEAELAE 621
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1569 AERSRLRIEEEIRVVRLQLEATERQRGGAEG---------ELQALRARAEEAEAQKRQAQE---EAERLRRQVQDETQRK 1636
Cdd:COG4913    622 LEEELAEAEERLEALEAELDALQERREALQRlaeyswdeiDVASAEREIAELEAELERLDAssdDLAALEEQLEELEAEL 701
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1637 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV--QVALETAQRSAEAELQSEHASFAEK 1714
Cdd:COG4913    702 EELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERfaAALGDAVERELRENLEERIDALRAR 781

                   ....*...
gi 1920237946 1715 TAQLERTL 1722
Cdd:COG4913    782 LNRAEEEL 789
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
1451-1668 6.96e-13

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 74.52  E-value: 6.96e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1451 ELSTLTSQYIR-----FISETLRRMEEEERLAEQQRAEeRERLAEVEAAlEKQRQLAEAHAQAKAQAER----EAQGLQR 1521
Cdd:COG2268    170 ELESVAITDLEdennyLDALGRRKIAEIIRDARIAEAE-AERETEIAIA-QANREAEEAELEQEREIETariaEAEAELA 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1522 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEirvvrlqLEATERQRGGAEGEL 1601
Cdd:COG2268    248 KKKAEERREAETARAEAEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAEREEAE-------LEADVRKPAEAEKQA 320
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 1602 QALRARAeEAEAQKRQAQEEAERLRRQVqdETQRKRQAEAELALRVQAEAEAAREKQRALQALEELR 1668
Cdd:COG2268    321 AEAEAEA-EAEAIRAKGLAEAEGKRALA--EAWNKLGDAAILLMLIEKLPEIAEAAAKPLEKIDKIT 384
CH_jitterbug-like_rpt2 cd21229
second calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ...
303-398 7.03e-13

second calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409078  Cd Length: 105  Bit Score: 67.80  E-value: 7.03e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  303 KEKLLLWSQRMVEGCqglRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPE 381
Cdd:cd21229      5 KKLMLAWLQAVLPEL---KITNFSTDWNDGIALSALLDYCKPGLCpNWRKLDPSNSLENCRRAMDLAKREFNIPMVLSPE 81
                           90
                   ....*....|....*..
gi 1920237946  382 DVDVPQPDEKSIITYVS 398
Cdd:cd21229     82 DLSSPHLDELSGMTYLS 98
CH_ASPM_rpt1 cd21223
first calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated ...
204-283 7.21e-13

first calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated protein (ASPM) and similar proteins; ASPM, also called abnormal spindle protein homolog, or Asp homolog, is involved in mitotic spindle regulation and coordination of mitotic processes. It may also have a preferential role in regulating neurogenesis. Members of this family contain two copies of the CH domain in the middle region. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409072  Cd Length: 113  Bit Score: 68.00  E-value: 7.21e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  204 HISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM----RFHKLQNVQIALDYLRHRQV----KLVNIRNDDIADGNPKLT 275
Cdd:cd21223     25 AVTNLAVDLRDGVRLCRLVELLTGDWSLLSKLRVpaisRLQKLHNVEVALKALKEAGVlrggDGGGITAKDIVDGHREKT 104

                   ....*...
gi 1920237946  276 LGLIWTII 283
Cdd:cd21223    105 LALLWRII 112
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3183-3221 7.46e-13

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 65.43  E-value: 7.46e-13
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3183 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPELHEKL 3221
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1599-2177 8.30e-13

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 75.46  E-value: 8.30e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1599 GELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRqaEAELALRVQAEAEAAREKQRALQALEELRLQAEEA--ER 1676
Cdd:PRK02224   162 GKLEEYRERASDARLGVERVLSDQRGSLDQLKAQIEEKE--EKDLHERLNGLESELAELDEEIERYEEQREQARETrdEA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1677 RLRQAEAERARQVQVALETA---QRSAEAELQSEHASFAEKTAQLERTLKEehvavvqLREEATRRAQQQAEAERARAEA 1753
Cdd:PRK02224   240 DEVLEEHEERREELETLEAEiedLRETIAETEREREELAEEVRDLRERLEE-------LEEERDDLLAEAGLDDADAEAV 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1754 ERELERWQlKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEaerearrrgKAEEQAVRQRELA---EQELEKQRQLAEGT 1830
Cdd:PRK02224   313 EARREELE-DRDEELRDRLEECRVAAQAHNEEAESLREDAD---------DLEERAEELREEAaelESELEEAREAVEDR 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1831 AQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVR---AEMEVLLA------------ 1895
Cdd:PRK02224   383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARervEEAEALLEagkcpecgqpve 462
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1896 --SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAE--EDAVRQRAEAERVLAEKLAAISEA 1971
Cdd:PRK02224   463 gsPHVETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEErrEDLEELIAERRETIEEKRERAEEL 542
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1972 TRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIE---------ARLAQLRKASESELERQKGLVE-- 2040
Cdd:PRK02224   543 RERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIEslerirtllAAIADAEDEIERLREKREALAEln 622
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2041 ----DTLRQRRqveEEILALKGSFEKAAAGKAELELElgrirgTAEDTLRSKEQAEQEAARQRQlaaeeerrrreaeeRV 2116
Cdd:PRK02224   623 derrERLAEKR---ERKRELEAEFDEARIEEAREDKE------RAEEYLEQVEEKLDELREERD--------------DL 679
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2117 QKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEE 2177
Cdd:PRK02224   680 QAEIGAVENELEELEELRERREALENRVEALEALYDEAEELESMYGDLRAELRQRNVETLE 740
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1468-1970 8.39e-13

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 75.76  E-value: 8.39e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAE----VEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRS 1543
Cdd:COG3096    242 RMTLEAIRVTQSDRDLFKHLITEatnyVAADYMRHANERRELSERALELRRELFGARRQLAEEQYRLVEMARELEELSAR 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1544 iQEELQHLRQSSE---AEIQAKARQVEAAERSRLRIEEeirvVRLQLEATERQRGGAEGELqalraraEEAEAQKRQAQE 1620
Cdd:COG3096    322 -ESDLEQDYQAASdhlNLVQTALRQQEKIERYQEDLEE----LTERLEEQEEVVEEAAEQL-------AEAEARLEAAEE 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1621 EAERLRRQVQDETQrkrqaeaelALRVQAEAeaAREKQRALQALEELR-------LQAEEAERRLRQAEAERARQVQVAL 1693
Cdd:COG3096    390 EVDSLKSQLADYQQ---------ALDVQQTR--AIQYQQAVQALEKARalcglpdLTPENAEDYLAAFRAKEQQATEEVL 458
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1694 ETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVavvqlREEAtrraqqqaeaeraraeaerelerWQlKANEALR---- 1769
Cdd:COG3096    459 ELEQKLSVAD---AARRQFEKAYELVCKIAGEVE-----RSQA-----------------------WQ-TARELLRryrs 506
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1770 LRLQAEEVAQQKSltqaeaekqkeeaerearRRGKAEEQAVRQRELAEQelekQRQLAEGTAQQRLAAEQelirLRAETE 1849
Cdd:COG3096    507 QQALAQRLQQLRA------------------QLAELEQRLRQQQNAERL----LEEFCQRIGQQLDAAEE----LEELLA 560
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1850 QGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAE----------EESRSTSEKSKQRLEAEA 1919
Cdd:COG3096    561 ELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARAPAWLAAQDALErlreqsgealADSQEVTAAMQQLLERER 640
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 1920 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISE 1970
Cdd:COG3096    641 EATVERDELAARKQALESQIERLSQPGGAEDPRLLALAERLGGVLLSEIYD 691
CH_SMTNL1 cd21260
calponin homology (CH) domain found in smoothelin-like protein 1; Smoothelin-like protein 1 ...
303-404 9.54e-13

calponin homology (CH) domain found in smoothelin-like protein 1; Smoothelin-like protein 1 (SMTNL1), also called calponin homology-associated smooth muscle protein (CHASM), plays a role in the regulation of contractile properties of both striated and smooth muscles. It can bind to calmodulin and tropomyosin. When it is unphosphorylated, SMTNL1 may inhibit myosin dephosphorylation. SMTNL1 contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409109  Cd Length: 116  Bit Score: 67.80  E-value: 9.54e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  303 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 382
Cdd:cd21260      3 KNMLLEWCRAKTRGYEHVDIQNFSSSWSSGMAFCALIHKFFPDAFDYAELDPANRRHNFTLAFSTAEKHADCAPLLEVED 82
                           90       100
                   ....*....|....*....|...
gi 1920237946  383 -VDVPQPDEKSIITYVSSLYDAM 404
Cdd:cd21260     83 mVRMSVPDSKCVYTYIQELYRSL 105
CH_CYTSA cd21256
calponin homology (CH) domain found in cytospin-A; Cytospin-A, also called renal carcinoma ...
301-401 1.14e-12

calponin homology (CH) domain found in cytospin-A; Cytospin-A, also called renal carcinoma antigen NY-REN-22, or sperm antigen with calponin homology and coiled-coil domains 1-like, or SPECC1-like protein (SPECC1L), is involved in cytokinesis and spindle organization. It may play a role in actin cytoskeleton organization and microtubule stabilization and hence, is required for proper cell adhesion and migration. Cytospin-A contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409105  Cd Length: 119  Bit Score: 67.41  E-value: 1.14e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDP 380
Cdd:cd21256     14 SKRNALLKWCQKKTEGYQNIDITNFSSSWNDGLAFCALLHTYLPAHIPYQELNSQDKRRNFTLAFQAAE-SVGIKSTLDI 92
                           90       100
                   ....*....|....*....|..
gi 1920237946  381 ED-VDVPQPDEKSIITYVSSLY 401
Cdd:cd21256     93 NEmVRTERPDWQSVMTYVTAIY 114
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1522-2620 1.44e-12

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 74.83  E-value: 1.44e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1522 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEA--AERSRLRIEEEIRvVRLQLEATErqrggAEG 1599
Cdd:pfam01576    2 RQEEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQlqAETELCAEAEEMR-ARLAARKQE-----LEE 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1600 ELQALRARAEEAEAQKRQAQEEAERLRRQVQDetqrkrqAEAELalrvqAEAEAAREKqralqaleeLRLQAEEAERRLR 1679
Cdd:pfam01576   76 ILHELESRLEEEEERSQQLQNEKKKMQQHIQD-------LEEQL-----DEEEAARQK---------LQLEKVTTEAKIK 134
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1680 QAEAErarqvqVALETAQRSaeaELQSEHASFAEKTAQLERTLKEEHVAVVQL-----REEATRRAQQQAEAERARAEAE 1754
Cdd:pfam01576  135 KLEED------ILLLEDQNS---KLSKERKLLEERISEFTSNLAEEEEKAKSLsklknKHEAMISDLEERLKKEEKGRQE 205
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1755 RELERWQLKAnEALRLRlqaEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtAQQR 1834
Cdd:pfam01576  206 LEKAKRKLEG-ESTDLQ---EQIAELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISE--LQED 279
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1835 LAAEqelirlRAETEQGEQQRQLLEEELARLQRE---AAAATQKRRELEAelakvRAEMEVLLASKArAEEESRSTSEKS 1911
Cdd:pfam01576  280 LESE------RAARNKAEKQRRDLGEELEALKTEledTLDTTAAQQELRS-----KREQEVTELKKA-LEEETRSHEAQL 347
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1912 KQRLEAEAGRFRELAEE---AARLRALAEEAK--------------RQRQLAEEDAVRQRAEAERVLAEKLAAISEATRL 1974
Cdd:pfam01576  348 QEMRQKHTQALEELTEQleqAKRNKANLEKAKqalesenaelqaelRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQ 427
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1975 KTEAEIALKEKEAENERLRRLAEDeafqrrlLEEQAAQHKADIEARLAQLRKASE--SELERQKGLVEDTLrqrRQVEEE 2052
Cdd:pfam01576  428 RAELAEKLSKLQSELESVSSLLNE-------AEGKNIKLSKDVSSLESQLQDTQEllQEETRQKLNLSTRL---RQLEDE 497
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2053 ILALKGSFEKAAAGKAELELELGRIRGTAEDTlrsKEQAEQEAARQRQLaaeeerrrreaeERVQKSLAAEEEAARQRka 2132
Cdd:pfam01576  498 RNSLQEQLEEEEEAKRNVERQLSTLQAQLSDM---KKKLEEDAGTLEAL------------EEGKKRLQRELEALTQQ-- 560
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2133 aLEEVERLKAKVEEAR-RLRERAE-----QESARQLQLAQEAAQKR---LQAEEKA-HAFAVQQKEQELQQTLQQEQSVL 2202
Cdd:pfam01576  561 -LEEKAAAYDKLEKTKnRLQQELDdllvdLDHQRQLVSNLEKKQKKfdqMLAEEKAiSARYAEERDRAEAEAREKETRAL 639
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2203 ERLRSeaeaarraaeeaeaareraereAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAA 2282
Cdd:pfam01576  640 SLARA----------------------LEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEM 697
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQaadaEMEKHKQFAEQA-LR-------QKAQVEQELTALRLQLEEtdhQKSILDEELQRLKAEVTEAARQRGQvee 2354
Cdd:pfam01576  698 KTQLE----ELEDELQATEDAkLRlevnmqaLKAQFERDLQARDEQGEE---KRRQLVKQVRELEAELEDERKQRAQ--- 767
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2355 eLFSLRVQME-ELGKLKARIEAENRAlvlrdKDSAQRLLQEEAEKMKQVAeeaarlsvaaqeaarlRQLAEEDLAQQRAL 2433
Cdd:pfam01576  768 -AVAAKKKLElDLKELEAQIDAANKG-----REEAVKQLKKLQAQMKDLQ----------------RELEEARASRDEIL 825
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2434 AEKMLKEKmqavqeatRLKA-EAELLQQQKELA-QEQARR-LQEDKEQMAQQLAQETQGfqKTLETERQRQLEmsaeaER 2510
Cdd:pfam01576  826 AQSKESEK--------KLKNlEAELLQLQEDLAaSERARRqAQQERDELADEIASGASG--KSALQDEKRRLE-----AR 890
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2511 LRLRVAEMSRAQARAEEDARRFRKQAEDIgERLyRTELATQEKvmLVQTLETQRQQSDRDAERLREAIAELEHE-KDKLK 2589
Cdd:pfam01576  891 IAQLEEELEEEQSNTELLNDRLRKSTLQV-EQL-TTELAAERS--TSQKSESARQQLERQNKELKAKLQEMEGTvKSKFK 966
                         1130      1140      1150
                   ....*....|....*....|....*....|.
gi 1920237946 2590 QEAQLLQLKSEEMqtvrQEQLLQETQALQQS 2620
Cdd:pfam01576  967 SSIAALEAKIAQL----EEQLEQESRERQAA 993
CH_PLS_rpt3 cd21298
third calponin homology (CH) domain found in the plastin family; The plastin family includes ...
188-289 1.99e-12

third calponin homology (CH) domain found in the plastin family; The plastin family includes plastin-1, -2, and -3. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409147  Cd Length: 117  Bit Score: 66.88  E-value: 1.99e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  188 KTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSGDSL-------PREKGRMRFHKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21298      9 KTYRNWMNS--LGVNPFVNHLYSDLRDGLVLLQLYDKIKPGVVdwsrvnkPFKKLGANMKKIENCNYAVELGKKLKFSLV 86
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHFQIS 289
Cdd:cd21298     87 GIGGKDIYDGNRTLTLALVWQLMRAYTLS 115
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3925-3963 2.93e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.50  E-value: 2.93e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3925 LLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDTHDQL 3963
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1812-2734 3.31e-12

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 73.67  E-value: 3.31e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1812 QRELAEQELEKQR-QLAEGTAQQRLAA-EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAe 1889
Cdd:pfam01576  109 EEQLDEEEAARQKlQLEKVTTEAKIKKlEEDILLLEDQNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEA- 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1890 MEVLLASKARAEEESRSTSEKSKQRLEAEAGrfrELAEEAARLRALAEEAKRQRQLAEEDavrqraeaervLAEKLAAIS 1969
Cdd:pfam01576  188 MISDLEERLKKEEKGRQELEKAKRKLEGEST---DLQEQIAELQAQIAELRAQLAKKEEE-----------LQAALARLE 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1970 EATRLKTEAEIALKEKEAENERLRRLAEDEAFQRrlleEQAAQHKADIEARLAQLRKASESELERQKGLVEdtLRQRRqv 2049
Cdd:pfam01576  254 EETAQKNNALKKIRELEAQISELQEDLESERAAR----NKAEKQRRDLGEELEALKTELEDTLDTTAAQQE--LRSKR-- 325
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2050 EEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLrsKEQAEQeAARQRQLAAEEERRRREAEERVQKSL----AAEEE 2125
Cdd:pfam01576  326 EQEVTELKKALEEETRSHEAQLQEMRQKHTQALEEL--TEQLEQ-AKRNKANLEKAKQALESENAELQAELrtlqQAKQD 402
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2126 AARQRKAALEEVERLKAKVEEARRLRERAEQESARqLQLAQEAAQKRLQAEEKahafavqqKEQELQQTLQQEQSVLERL 2205
Cdd:pfam01576  403 SEHKRKKLEGQLQELQARLSESERQRAELAEKLSK-LQSELESVSSLLNEAEG--------KNIKLSKDVSSLESQLQDT 473
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2206 RSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQ 2285
Cdd:pfam01576  474 QELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRE 553
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2286 KQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLeetDHQKSILDEELQRLKAEVTEAArqrgqvEEELFSLRVQMEe 2365
Cdd:pfam01576  554 LEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDL---DHQRQLVSNLEKKQKKFDQMLA------EEKAISARYAEE- 623
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2366 lgklKARIEAENRalvlrDKDSAQRLLQEEAEKMKQVAEEAARlsvaaqeAARLRQLAEEDLAQQRALAEKMLKE----K 2441
Cdd:pfam01576  624 ----RDRAEAEAR-----EKETRALSLARALEEALEAKEELER-------TNKQLRAEMEDLVSSKDDVGKNVHElersK 687
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2442 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLEtERQRQL-----EMSAEAERLRLRVA 2516
Cdd:pfam01576  688 RALEQQVEEMKTQLEELEDELQATEDAKLRLEVNMQALKAQFERDLQARDEQGE-EKRRQLvkqvrELEAELEDERKQRA 766
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2517 EMSRAQARAEEDARRFRKQAEDIGERlyRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQ 2596
Cdd:pfam01576  767 QAVAAKKKLELDLKELEAQIDAANKG--REEAVKQLKKLQAQMKDLQRELEEARASRDEILAQSKESEKKLKNLEAELLQ 844
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2597 LKSEEMQTVRQE-QLLQETQALQQ---SFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQ 2672
Cdd:pfam01576  845 LQEDLAASERARrQAQQERDELADeiaSGASGKSALQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTT 924
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2673 QLAAS---------------------------MEEARRRQHEA-----EEGVRRQQEELQRLAQQQQQQEKLLAEENQRL 2720
Cdd:pfam01576  925 ELAAErstsqksesarqqlerqnkelkaklqeMEGTVKSKFKSsiaalEAKIAQLEEQLEQESRERQAANKLVRRTEKKL 1004
                          970
                   ....*....|....
gi 1920237946 2721 RERLQHLEEERRAA 2734
Cdd:pfam01576 1005 KEVLLQVEDERRHA 1018
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
2002-2742 3.32e-12

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 73.47  E-value: 3.32e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2002 QRRLLEEQAA---QHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIR 2078
Cdd:pfam02463  153 ERRLEIEEEAagsRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYL 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2079 GTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQES 2158
Cdd:pfam02463  233 KLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDD 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2159 ARQLQLAQE---AAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRR 2235
Cdd:pfam02463  313 EEKLKESEKekkKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKL 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2236 QVEEAErlKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELT 2315
Cdd:pfam02463  393 KEEELE--LKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKS 470
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2316 ALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEE 2395
Cdd:pfam02463  471 EDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVI 550
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2396 AEKMKQVAEEAARLSVAAQEAARLRQLAEedlaqqralaekmlkekmqavqeaTRLKAEAELLQQQKELAQEQARRLQED 2475
Cdd:pfam02463  551 VEVSATADEVEERQKLVRALTELPLGARK------------------------LRLLIPKLKLPLKSIAVLEIDPILNLA 606
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2476 KEQMAQQLAQETQGFQKTLETERQRQLEMSaeaERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVM 2555
Cdd:pfam02463  607 QLDKATLEADEDDKRAKVVEGILKDTELTK---LKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQE 683
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2556 LVQTLETQRQQSdrdaERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCI 2635
Cdd:pfam02463  684 KAESELAKEEIL----RRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKK 759
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2636 EQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGvrrQQEELQRLAQQQQQQEKLLAE 2715
Cdd:pfam02463  760 EEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAE---LLEEEQLLIEQEEKIKEEELE 836
                          730       740
                   ....*....|....*....|....*..
gi 1920237946 2716 ENQRLRERLQHLEEERRAALARSEEIA 2742
Cdd:pfam02463  837 ELALELKEEQKLEKLAEEELERLEEEI 863
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1804-2025 3.75e-12

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 71.72  E-value: 3.75e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1883
Cdd:COG4942     20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1884 AKVRAE----------------MEVLLASKARAEEESRSTSEKS-KQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLA 1946
Cdd:COG4942    100 EAQKEElaellralyrlgrqppLALLLSPEDFLDAVRRLQYLKYlAPARREQAEELRADLAELAALRAELEAERAELEAL 179
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1947 EEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEdeafqrRLLEEQAAQHKADIEARLAQLR 2025
Cdd:COG4942    180 LAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIA------RLEAEAAAAAERTPAAGFAALK 252
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1212-2097 3.90e-12

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 73.47  E-value: 3.90e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1212 EKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALKKLRaqaeaqqpvfdalrdelrgaqevgerlqq 1291
Cdd:pfam02463  180 EETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLD----------------------------- 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1292 rhgERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESAdplgawlrDAKQRQEQIQAVPLANSQAVR 1371
Cdd:pfam02463  231 ---YLKLNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEE--------KEKKLQEEELKLLAKEEEELK 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1372 EQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSgsesiIQEYVDLRTRYSE 1451
Cdd:pfam02463  300 SELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEEL-----EKLQEKLEQLEEE 374
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1452 LSTLTSQYIRFISETLRR--MEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL---QRRMQEE 1526
Cdd:pfam02463  375 LLAKKKLESERLSSAAKLkeEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGkltEEKEELE 454
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1527 VARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLR-IEEEIRVVRLQLEATERQRGGAEGELQALR 1605
Cdd:pfam02463  455 KQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESkARSGLKVLLALIKDGVGGRIISAHGRLGDL 534
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQ-AEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE 1684
Cdd:pfam02463  535 GVAVENYKVAISTAVIVEVSATADEVEERQKLVrALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLE 614
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1685 RARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHvavvQLREEATRRAQQQAEAERARAEAERELERWQLKA 1764
Cdd:pfam02463  615 ADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEE----GLAEKSEVKASLSELTKELLEIQELQEKAESELA 690
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1765 NEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRL 1844
Cdd:pfam02463  691 KEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELS 770
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1845 RAETEQGEQQRQLLEEELARLQREAAAATQKrrELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRE 1924
Cdd:pfam02463  771 LKEKELAEEREKTEKLKVEEEKEEKLKAQEE--ELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 848
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1925 LAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRR 2004
Cdd:pfam02463  849 EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAE 928
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2005 LLEEQAAQHKADIEARLAQLRKASESELERQKglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDT 2084
Cdd:pfam02463  929 ILLKYEEEPEELLLEEADEKEKEENNKEEEEE---RNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKK 1005
                          890
                   ....*....|...
gi 1920237946 2085 LRSKEQAEQEAAR 2097
Cdd:pfam02463 1006 KLIRAIIEETCQR 1018
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3514-3552 4.41e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.12  E-value: 4.41e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3514 LLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPELHEKL 3552
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1143-1728 4.55e-12

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 73.06  E-value: 4.55e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1143 RECAQRITEQQKAQAEVDGLGKGVARLSAEAEkvlalpepspaaptlrsELELTLGKLEqvrslsaiylEKLKTISLVIR 1222
Cdd:COG3096    522 AELEQRLRQQQNAERLLEEFCQRIGQQLDAAE-----------------ELEELLAELE----------AQLEELEEQAA 574
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1223 STQEAEEVLRAHEEQLKE-AQAVPATLPELEATKAALKKLRAQAEAQqpvFDALRDELRGAQEVGERLQQRHGERDvEVE 1301
Cdd:COG3096    575 EAVEQRSELRQQLEQLRArIKELAARAPAWLAAQDALERLREQSGEA---LADSQEVTAAMQQLLEREREATVERD-ELA 650
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1302 RWRERVTLLLERWQAVLAQTDVRQREL-EQLGRQL--RYYR----ESADPLGAWLRDAKQrqeqiqAVPLANSQAVREQL 1374
Cdd:COG3096    651 ARKQALESQIERLSQPGGAEDPRLLALaERLGGVLlsEIYDdvtlEDAPYFSALYGPARH------AIVVPDLSAVKEQL 724
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1375 RQEKALLED---IERHGEKVEECQRFAKQYINAI--KDYELQLVTYKAQLEPV----ASPAKKPKVQSGSESIIQEYVDL 1445
Cdd:COG3096    725 AGLEDCPEDlylIEGDPDSFDDSVFDAEELEDAVvvKLSDRQWRYSRFPEVPLfgraAREKRLEELRAERDELAEQYAKA 804
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1446 RTRYSELSTLTSQYIRFISETLRRMEEEErlAEQQRAEERERLAEVEAALEKQRQlAEAHAQAKAQAEREAQGLQRRMQE 1525
Cdd:COG3096    805 SFDVQKLQRLHQAFSQFVGGHLAVAFAPD--PEAELAALRQRRSELERELAQHRA-QEQQLRQQLDQLKEQLQLLNKLLP 881
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1526 EV---------ARREEVAVE---AQEQKRSIQEELQHLRQSSE--AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATE 1591
Cdd:COG3096    882 QAnlladetlaDRLEELREEldaAQEAQAFIQQHGKALAQLEPlvAVLQSDPEQFEQLQADYLQAKEQQRRLKQQIFALS 961
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1592 --RQR------GGAEGEL-------QALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAARE 1656
Cdd:COG3096    962 evVQRrphfsyEDAVGLLgensdlnEKLRARLEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQELEQE 1041
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1657 -KQRALQALEELRLQAEEaERRLRQAEAERARQVQVALETAQRSAEAELQSEHASF--AEKTAQLERTLKEEHVA 1728
Cdd:COG3096   1042 lEELGVQADAEAEERARI-RRDELHEELSQNRSRRSQLEKQLTRCEAEMDSLQKRLrkAERDYKQEREQVVQAKA 1115
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4437-4475 4.59e-12

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 63.12  E-value: 4.59e-12
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4437 LLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMVDRI 4475
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
2327-2734 5.21e-12

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 72.49  E-value: 5.21e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2327 QKSILDEELQRLKAEVTEAARQRG---QVEEELFSLRVQMEELGKLKARIEAENRAL--------VLRDKDSAQRLLQEE 2395
Cdd:COG4717     65 KPELNLKELKELEEELKEAEEKEEeyaELQEELEELEEELEELEAELEELREELEKLekllqllpLYQELEALEAELAEL 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2396 AEKMKQVAEEAARLSVAAQEAARLRQLAEEdlaQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQED 2475
Cdd:COG4717    145 PERLEELEERLEELRELEEELEELEAELAE---LQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEE 221
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2476 KEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEED---------------ARRFRKQAEDIG 2540
Cdd:COG4717    222 LEELEEELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTiagvlflvlgllallFLLLAREKASLG 301
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2541 ERLYRTELATQEkvmlvQTLETQRQQSDRDAERLREAI--AELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQAL- 2617
Cdd:COG4717    302 KEAEELQALPAL-----EELEEEELEELLAALGLPPDLspEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALl 376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2618 QQSFLSEKDSLLQRERCIEQEKAKLEQLfqdEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQE 2697
Cdd:COG4717    377 AEAGVEDEEELRAALEQAEEYQELKEEL---EELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEELRE 453
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 2698 ELQRLAQQQQQQEK-----LLAEENQRLRERLQHLEEERRAA 2734
Cdd:COG4717    454 ELAELEAELEQLEEdgelaELLQELEELKAELRELAEEWAAL 495
CH_SF cd00014
calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding ...
303-402 5.46e-12

calponin homology (CH) domain superfamily; CH domains are actin filament (F-actin) binding motifs, which may be present as a single copy or in tandem repeats (which increase binding affinity). They either function as autonomous actin binding motifs or serve a regulatory function. CH domains are found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, as well as proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).


Pssm-ID: 409031 [Multi-domain]  Cd Length: 103  Bit Score: 65.05  E-value: 5.46e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  303 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTN---LENLDQAFSVAER-DLGVTRLL 378
Cdd:cd00014      1 EEELLKWINEVLGEELPVSITDLFESLRDGVLLCKLINKLSPGSIPKINKKPKSPfkkRENINLFLNACKKlGLPELDLF 80
                           90       100
                   ....*....|....*....|....
gi 1920237946  379 DPEDVdVPQPDEKSIITYVSSLYD 402
Cdd:cd00014     81 EPEDL-YEKGNLKKVLGTLWALAL 103
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
2301-2625 7.36e-12

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 72.08  E-value: 7.36e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2301 EQALRQKAQVEQELTALRLQLEETDHQKSIlDEELQRLKAEVTEAARQRGQVEEELFSL--RVQMEELGKLK-------A 2371
Cdd:pfam17380  255 EYTVRYNGQTMTENEFLNQLLHIVQHQKAV-SERQQQEKFEKMEQERLRQEKEEKAREVerRRKLEEAEKARqaemdrqA 333
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2372 RIEAENRALVLRDKDSAQRLLQEEaekmKQVAEEAARLSVAAQEAARLRQLaeEDLAQQRALAEKMLKEKMQAVQEATRL 2451
Cdd:pfam17380  334 AIYAEQERMAMERERELERIRQEE----RKRELERIRQEEIAMEISRMREL--ERLQMERQQKNERVRQELEAARKVKIL 407
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2452 KAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQgfQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARR 2531
Cdd:pfam17380  408 EEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEER--AREMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRD 485
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2532 fRKQAEDIGERLYRTELATQEKVM--------LVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQ 2603
Cdd:pfam17380  486 -RKRAEEQRRKILEKELEERKQAMieeerkrkLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRRIQEQMRKATEE 564
                          330       340
                   ....*....|....*....|..
gi 1920237946 2604 TVRQEQLLQETQALQQSFLSEK 2625
Cdd:pfam17380  565 RSRLEAMEREREMMRQIVESEK 586
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
744-922 8.48e-12

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 67.86  E-value: 8.48e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  744 LHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQ 823
Cdd:cd00176      2 LQQFLRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERL 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  824 AALQTQWSWMLQLCCCIEAHLKENTAYFQFFSDVREAEEQLRKLQETLRRKYTCDrsiTATRLEDLLQDAQDEKEQLSEY 903
Cdd:cd00176     82 EELNQRWEELRELAEERRQRLEEALDLQQFFRDADDLEQWLEEKEAALASEDLGK---DLESVEELLKKHKELEEELEAH 158
                          170
                   ....*....|....*....
gi 1920237946  904 RGHLSGLAKRAKAIVQLKP 922
Cdd:cd00176    159 EPRLKSLNELAEELLEEGH 177
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1439-1937 1.00e-11

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 71.91  E-value: 1.00e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1439 IQEYVDLRTRYSELSTLTSQYirFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAqg 1518
Cdd:COG3096    248 IRVTQSDRDLFKHLITEATNY--VAADYMRHANERRELSERALELRRELFGARRQLAEEQYRLVEMARELEELSARES-- 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1519 lqrrmqeevarreevAVEAQEQKRSiqeelQHLrqsseAEIQAKARQVEAAERSRLRIEEeirvVRLQLEATERQRGGAE 1598
Cdd:COG3096    324 ---------------DLEQDYQAAS-----DHL-----NLVQTALRQQEKIERYQEDLEE----LTERLEEQEEVVEEAA 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1599 GELqalraraEEAEAQKRQAQEEAERLRRQVQDETQrkrqaeaelALRVQaeAEAAREKQRALQALEELR-------LQA 1671
Cdd:COG3096    375 EQL-------AEAEARLEAAEEEVDSLKSQLADYQQ---------ALDVQ--QTRAIQYQQAVQALEKARalcglpdLTP 436
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1672 EEAERRLRQAEAERARQVQVALETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVavvqlREEATRRAQQQAEAERARA 1751
Cdd:COG3096    437 ENAEDYLAAFRAKEQQATEEVLELEQKLSVAD---AARRQFEKAYELVCKIAGEVE-----RSQAWQTARELLRRYRSQQ 508
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1752 EAERELERWQLKANEA-LRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrGKAEEQAVRQRELAEQELEKQRQLAEGT 1830
Cdd:COG3096    509 ALAQRLQQLRAQLAELeQRLRQQQNAERLLEEF-------------------CQRIGQQLDAAEELEELLAELEAQLEEL 569
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1831 AQQRLAAEQELIRLRAETEQGEQQRQLLeEELARLQREAAAATQKRRELEAE----LAKVRAEMEVLLaSKARAEEESRS 1906
Cdd:COG3096    570 EEQAAEAVEQRSELRQQLEQLRARIKEL-AARAPAWLAAQDALERLREQSGEaladSQEVTAAMQQLL-EREREATVERD 647
                          490       500       510
                   ....*....|....*....|....*....|....*
gi 1920237946 1907 TSEKSKQRLEAEAgrfRELA----EEAARLRALAE 1937
Cdd:COG3096    648 ELAARKQALESQI---ERLSqpggAEDPRLLALAE 679
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4092-4130 1.05e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 61.96  E-value: 1.05e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4092 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEFKDKL 4130
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1533-1736 1.12e-11

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 70.18  E-value: 1.12e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1533 VAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAE 1612
Cdd:COG4942     14 AAAAQADAAAEAEAELEQLQQ----EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1613 AQKRQAQEEAERLRRQVQDET----QRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRL-----RQAEA 1683
Cdd:COG4942     90 KEIAELRAELEAQKEELAELLralyRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLaelaaLRAEL 169
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 1684 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1736
Cdd:COG4942    170 EAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEA 222
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3259-3297 1.19e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 61.96  E-value: 1.19e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3259 LLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKETSAAL 3297
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1500-1870 1.23e-11

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 71.31  E-value: 1.23e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1500 QLAEAHAQAKAQAEREAQGLQRRMQEEVARREevaveaqeqKRSIQEELQHLRQSSEAEiqaKARQVEA-------AERS 1572
Cdd:pfam17380  273 QLLHIVQHQKAVSERQQQEKFEKMEQERLRQE---------KEEKAREVERRRKLEEAE---KARQAEMdrqaaiyAEQE 340
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1573 RLRIEEEIRVVRLQLEatERQRggaegELQALRaraEEAEAQKRQAQEEAERLRRQVQDETQRKRQaEAELALRVQ-AEA 1651
Cdd:pfam17380  341 RMAMERERELERIRQE--ERKR-----ELERIR---QEEIAMEISRMRELERLQMERQQKNERVRQ-ELEAARKVKiLEE 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1652 EAAREKQRALQALEELRLQAEEA-ERRLRQAEAERARQVQ-VALETAQRSAEAE-LQSEHASFAEKTAQLERTLKEEHVA 1728
Cdd:pfam17380  410 ERQRKIQQQKVEMEQIRAEQEEArQREVRRLEEERAREMErVRLEEQERQQQVErLRQQEEERKRKKLELEKEKRDRKRA 489
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1729 VVQLREeatrraqqqaeaeraraeaereLERWQLKANEalrlRLQAEEVAQQKSLTQAEAEKQKEEAEREARRrgKAEEQ 1808
Cdd:pfam17380  490 EEQRRK----------------------ILEKELEERK----QAMIEEERKRKLLEKEMEERQKAIYEEERRR--EAEEE 541
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1809 AVRQrelaeQELEKQRQLAEgtaqQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAA 1870
Cdd:pfam17380  542 RRKQ-----QEMEERRRIQE----QMRKATEERSRLEAMEREREMMRQIVESEKARAEYEAT 594
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
1471-2177 1.45e-11

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 71.41  E-value: 1.45e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERERLAEVEAALEKQ-----RQLAEAHAQAKAQAEREAQGLQRRMQEEVARR-------------EE 1532
Cdd:pfam12128  104 RLDDFIKANNDFVKCETVAELGRFMKNAgiqrtNLLNTREYRSIIQNDRTLLGRERVELRSLARQfalcdsesplrhiDK 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1533 VAVEAQEQK------RSIQEELQHLRQSSEAEIQAKARQVEA--AERSRLRIEEEIRVVRLQLEATERQRGGAEGELQAL 1604
Cdd:pfam12128  184 IAKAMHSKEgkfrdvKSMIVAILEDDGVVPPKSRLNRQQVEHwiRDIQAIAGIMKIRPEFTKLQQEFNTLESAELRLSHL 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1605 RARAEEAEAQKRQAQEEAE----RLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQ 1680
Cdd:pfam12128  264 HFGYKSDETLIASRQEERQetsaELNQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAA 343
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1681 AEAERARQVQVALETAQRSAEAeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAerelerw 1760
Cdd:pfam12128  344 ADQEQLPSWQSELENLEERLKA-LTGKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAED------- 415
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1761 qlkANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRG-KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQ 1839
Cdd:pfam12128  416 ---DLQALESELREQLEAGKLEFNEEEYRLKSRLGELKLRLNQaTATPELLLQLENFDERIERAREEQEAANAEVERLQS 492
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1840 ELIRLRAETEQGEQQRQLLEEELARLQREAAAATQ----KRRELEAELAKVRAEMEVLLASKARAEEESRS------TSE 1909
Cdd:pfam12128  493 ELRQARKRRDQASEALRQASRRLEERQSALDELELqlfpQAGTLLHFLRKEAPDWEQSIGKVISPELLHRTdldpevWDG 572
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1910 KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQR---AEAERVLAEKLAAISEATRLKTEAEIALKEKE 1986
Cdd:pfam12128  573 SVGGELNLYGVKLDLKRIDVPEWAASEEELRERLDKAEEALQSARekqAAAEEQLVQANGELEKASREETFARTALKNAR 652
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1987 aenERLRRLAEDEAFQRRLLEEQAAQHKA-------DIEARLAQLRKASESELERQKG---------------LVEDTLR 2044
Cdd:pfam12128  653 ---LDLRRLFDEKQSEKDKKNKALAERKDsanerlnSLEAQLKQLDKKHQAWLEEQKEqkreartekqaywqvVEGALDA 729
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2045 QRRQVEEEILALKGSFE-------------------------KAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQR 2099
Cdd:pfam12128  730 QLALLKAAIAARRSGAKaelkaletwykrdlaslgvdpdviaKLKREIRTLERKIERIAVRRQEVLRYFDWYQETWLQRR 809
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2100 QLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEevERLKAKVEEARRLRE-----RAEQESARQLQLAQEAAQKRLQ 2174
Cdd:pfam12128  810 PRLATQLSNIERAISELQQQLARLIADTKLRRAKLE--MERKASEKQQVRLSEnlrglRCEMSKLATLKEDANSEQAQGS 887

                   ...
gi 1920237946 2175 AEE 2177
Cdd:pfam12128  888 IGE 890
CH_CYTSB cd21257
calponin homology (CH) domain found in cytospin-B; Cytospin-B, also called nuclear structure ...
301-401 1.46e-11

calponin homology (CH) domain found in cytospin-B; Cytospin-B, also called nuclear structure protein 5 (NSP5), or sperm antigen HCMOGT-1, or sperm antigen with calponin homology and coiled-coil domains 1 (SPECC1), is a novel fusion Cytospin-B that contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409106  Cd Length: 112  Bit Score: 64.28  E-value: 1.46e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAErDLGVTRLLDP 380
Cdd:cd21257      8 SKRNALLKWCQKKTEGYPNIDITNFSSSWSDGLAFCALLHTYLPAHIPYQELSSQDKKRNLLLAFQAAE-SVGIKPSLEL 86
                           90       100
                   ....*....|....*....|..
gi 1920237946  381 ED-VDVPQPDEKSIITYVSSLY 401
Cdd:cd21257     87 SEmMYTDRPDWQSVMQYVAQIY 108
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
2289-2645 1.62e-11

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 71.22  E-value: 1.62e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2289 ADAEMEKHKQFAEQA--LRQKA-QVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEE 2365
Cdd:PRK02224   344 AESLREDADDLEERAeeLREEAaELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDE 423
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2366 LGKLKARIEAENRALVLRDKDsAQRLLQE-EAEKMKQVAEEAARLSVAAQEAARLRQLAEE--DLAQQRALAEKMLKEKM 2442
Cdd:PRK02224   424 LREREAELEATLRTARERVEE-AEALLEAgKCPECGQPVEGSPHVETIEEDRERVEELEAEleDLEEEVEEVEERLERAE 502
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2443 QAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQmAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSraQ 2522
Cdd:PRK02224   503 DLVEAEDRIERLEERREDLEELIAERRETIEEKRER-AEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELN--S 579
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2523 ARAEEDARrfRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERL---REAIAELEHEKDklkqEAQLLQLKS 2599
Cdd:PRK02224   580 KLAELKER--IESLERIRTLLAAIADAEDEIERLREKREALAELNDERRERLaekRERKRELEAEFD----EARIEEARE 653
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 2600 EEMQTVR-QEQLLQETQALQQsflsEKDSLLQRERCIEQEKAKLEQL 2645
Cdd:PRK02224   654 DKERAEEyLEQVEEKLDELRE----ERDDLQAEIGAVENELEELEEL 696
COG3899 COG3899
Predicted ATPase [General function prediction only];
1641-2177 2.26e-11

Predicted ATPase [General function prediction only];


Pssm-ID: 443106 [Multi-domain]  Cd Length: 1244  Bit Score: 70.66  E-value: 2.26e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1641 AELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE-RARQVQVALETAQRSAEAELQSEHAS--------F 1711
Cdd:COG3899    712 ARRALARGAYAEALRYLERALELLPPDPEEEYRLALLLELAEALyLAGRFEEAEALLERALAARALAALAAlrhgnppaS 791
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1712 AEKTAQLERTLKEEHVAVVQLREEATRRAQQ--QAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQkslTQAEAE 1789
Cdd:COG3899    792 ARAYANLGLLLLGDYEEAYEFGELALALAERlgDRRLEARALFNLGFILHWLGPLREALELLREALEAGLE---TGDAAL 868
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1790 KQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREA 1869
Cdd:COG3899    869 ALLALAAAAAAAAAAAALAAAAAAAARLLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAA 948
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1870 AAATQKRRELEAELAKVRAEMEVLLASKARAeeesrstsekskqRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEED 1949
Cdd:COG3899    949 AAAAALAAALALAAAAAAAAAAALAAAAAAA-------------AAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAAL 1015
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1950 AVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 2029
Cdd:COG3899   1016 AAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAAAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAA 1095
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2030 SELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRR 2109
Cdd:COG3899   1096 ALAAAAAAALALAAALAALALAAALAALALAAAARAAAALLLLAAALALALAALLLLAALLLALALLLLALAALALAAAL 1175
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2110 REAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEE 2177
Cdd:COG3899   1176 AALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLL 1243
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1845-2161 2.44e-11

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 70.15  E-value: 2.44e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1845 RAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAElAKVRAEMEVLLASKARAEEESRSTSEKSkqrlEAEAGRFRE 1924
Cdd:pfam17380  295 KMEQERLRQEKEEKAREVERRRKLEEAEKARQAEMDRQ-AAIYAEQERMAMERERELERIRQEERKR----ELERIRQEE 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1925 LAEEAARLRALaEEAKRQRQLAEEdAVRQRAEAERV--LAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEA-- 2000
Cdd:pfam17380  370 IAMEISRMREL-ERLQMERQQKNE-RVRQELEAARKvkILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAre 447
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2001 FQRRLLEEQAAQHKADIEARLAQLRKASESELERQKglvedtlRQRRQVEEEilaLKGSFEKAAAGKAELELELGRIRGT 2080
Cdd:pfam17380  448 MERVRLEEQERQQQVERLRQQEEERKRKKLELEKEK-------RDRKRAEEQ---RRKILEKELEERKQAMIEEERKRKL 517
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2081 AEDTLRSKEQAEQEAARQRQlaaeeerrrREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2160
Cdd:pfam17380  518 LEKEMEERQKAIYEEERRRE---------AEEERRKQQEMEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKAR 588

                   .
gi 1920237946 2161 Q 2161
Cdd:pfam17380  589 A 589
PTZ00121 PTZ00121
MAEBL; Provisional
1123-1678 2.90e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 70.56  E-value: 2.90e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1123 EACETRTVHRLRLPLDKEPARECAQRITEQQKAqaevDGLGKGV--ARLSAEAEKVLAlPEPSPAAPTLRSELELTLGKL 1200
Cdd:PTZ00121  1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKA----DEAKKKAeeAKKKADAAKKKA-EEAKKAAEAAKAEAEAAADEA 1359
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1201 EQVRSLS-AIYLEKLKTISLVIRSTQEAEEVLRAhEEQLKEAQAVPATLPELEATKAALKK---LRAQAEAQQPVfdalr 1276
Cdd:PTZ00121  1360 EAAEEKAeAAEKKKEEAKKKADAAKKKAEEKKKA-DEAKKKAEEDKKKADELKKAAAAKKKadeAKKKAEEKKKA----- 1433
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1277 DELRGAQEvgERLQQRHGERDVEVERWRERVTLLLERwqavlaqtdvrQRELEQLGRQLRYYREsADPLGAWLRDAKQRQ 1356
Cdd:PTZ00121  1434 DEAKKKAE--EAKKADEAKKKAEEAKKAEEAKKKAEE-----------AKKADEAKKKAEEAKK-ADEAKKKAEEAKKKA 1499
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1357 EQIQAVPLANSQAVREQLRQEKALLEDIERHGE--KVEEcqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSG 1434
Cdd:PTZ00121  1500 DEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEakKADE----AKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1435 SESIIQEYVDLR----TRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERlaeveaalEKQRQLAEAHAQAKA 1510
Cdd:PTZ00121  1576 KNMALRKAEEAKkaeeARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK--------KKVEQLKKKEAEEKK 1647
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1511 QAEReaqgLQRRMQEEVARREEVAVEAQEQKRSIQEelqhLRQSSEAEiqakarqvEAAERSRLRIEEEIRVVRLQLEAT 1590
Cdd:PTZ00121  1648 KAEE----LKKAEEENKIKAAEEAKKAEEDKKKAEE----AKKAEEDE--------KKAAEALKKEAEEAKKAEELKKKE 1711
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1591 ERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAElalrvQAEAEAAREKQRALQALEELRLQ 1670
Cdd:PTZ00121  1712 AEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLK-----KEEEKKAEEIRKEKEAVIEEELD 1786

                   ....*...
gi 1920237946 1671 AEEAERRL 1678
Cdd:PTZ00121  1787 EEDEKRRM 1794
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
2855-2893 3.14e-11

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 60.80  E-value: 3.14e-11
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 2855 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPELHHKL 2893
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
2284-2486 3.90e-11

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 68.25  E-value: 3.90e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2284 RQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQM 2363
Cdd:COG4942     34 QEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRAL 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2364 EELGK---LKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE 2440
Cdd:COG4942    114 YRLGRqppLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEAL 193
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946 2441 KMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQE 2486
Cdd:COG4942    194 KAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1458-2175 4.77e-11

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 69.76  E-value: 4.77e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1458 QYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEA 1537
Cdd:pfam15921   82 EYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCLK 161
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQ---SSEAEIQA-KARQVEAAERSRLRIEEEIRVVRLQLE----ATERQRGGAEGELQALRAR-- 1607
Cdd:pfam15921  162 EDMLEDSNTQIEQLRKmmlSHEGVLQEiRSILVDFEEASGKKIYEHDSMSTMHFRslgsAISKILRELDTEISYLKGRif 241
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1608 --AEEAEAQKRQAQEEAERLRRQVQDET-QRKRQAEAELAlRVQAEAEAAREKQRALQAleelrlQAEEAERRLRQAEAE 1684
Cdd:pfam15921  242 pvEDQLEALKSESQNKIELLLQQHQDRIeQLISEHEVEIT-GLTEKASSARSQANSIQS------QLEIIQEQARNQNSM 314
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1685 RARQVQvALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAeaerelerwQLKA 1764
Cdd:pfam15921  315 YMRQLS-DLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQ---------KLLA 384
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1765 NEALRLRLQAEEVAQQKSLtqaeaekqkeeaerEARRRGKAEEQAVRQRELAEQELEKQRQLA---------EGTAQQRL 1835
Cdd:pfam15921  385 DLHKREKELSLEKEQNKRL--------------WDRDTGNSITIDHLRRELDDRNMEVQRLEAllkamksecQGQMERQM 450
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1836 AAEQ----ELIRLRAETEQGEQQRQLLEEELARLqreaaaaTQKRRELEAELAKVrAEMEVLLASKARAEEESRSTSEKS 1911
Cdd:pfam15921  451 AAIQgkneSLEKVSSLTAQLESTKEMLRKVVEEL-------TAKKMTLESSERTV-SDLTASLQEKERAIEATNAEITKL 522
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1912 KQRLEAEAGRFRELAEEAARLRALAEEAKRQR-QLAEEDAV----RQ----------------------RAEAERVLAEK 1964
Cdd:pfam15921  523 RSRVDLKLQELQHLKNEGDHLRNVQTECEALKlQMAEKDKVieilRQqienmtqlvgqhgrtagamqveKAQLEKEINDR 602
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1965 LAAISEATRLKTEAEIALKEKEAE---------------NERLRRLAEDEAFQRRLLEEQAAQHK------ADIEARLAQ 2023
Cdd:pfam15921  603 RLELQEFKILKDKKDAKIRELEARvsdlelekvklvnagSERLRAVKDIKQERDQLLNEVKTSRNelnslsEDYEVLKRN 682
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2024 LRKASE------SELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAG-KAELELELGRIrgtaeDTLRSK----EQAE 2092
Cdd:pfam15921  683 FRNKSEemetttNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGmQKQITAKRGQI-----DALQSKiqflEEAM 757
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2093 QEAARQRQLAAEEERRRREAEERV---QKSLAAEEEAARQRKAALEE--------VERLKAKVEEARRLRERAEQESARq 2161
Cdd:pfam15921  758 TNANKEKHFLKEEKNKLSQELSTVateKNKMAGELEVLRSQERRLKEkvanmevaLDKASLQFAECQDIIQRQEQESVR- 836
                          810
                   ....*....|....
gi 1920237946 2162 LQLAQEAAQKRLQA 2175
Cdd:pfam15921  837 LKLQHTLDVKELQG 850
COG3899 COG3899
Predicted ATPase [General function prediction only];
1485-2010 5.06e-11

Predicted ATPase [General function prediction only];


Pssm-ID: 443106 [Multi-domain]  Cd Length: 1244  Bit Score: 69.50  E-value: 5.06e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1485 RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1564
Cdd:COG3899    722 AEALRYLERALELLPPDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYANLGLL 801
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1565 QVEAAERSRLRIEEEIRVV-RLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL--RRQVQDETQRKRQAEA 1641
Cdd:COG3899    802 LLGDYEEAYEFGELALALAeRLGDRRLEARALFNLGFILHWLGPLREALELLREALEAGLETgdAALALLALAAAAAAAA 881
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1642 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERT 1721
Cdd:COG3899    882 AAAALAAAAAAAARLLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALA 961
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1722 LKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARR 1801
Cdd:COG3899    962 AAAAAAAAAALAAAAAAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAA 1041
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1802 RGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA 1881
Cdd:COG3899   1042 ALALLAALAAAAAAAAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALA 1121
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1882 ELAKVRAEMEVLLASKARAEEESRSTsekskqRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1961
Cdd:COG3899   1122 ALALAAAARAAAALLLLAAALALALA------ALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAAL 1195
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*....
gi 1920237946 1962 AEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQA 2010
Cdd:COG3899   1196 LAALLALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLLA 1244
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1807-2181 6.96e-11

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 68.64  E-value: 6.96e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQL--AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELA 1884
Cdd:COG4717    105 EELEAELEELREELEKLEKLlqLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLE 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1885 KVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVrQRAEAERVLAEK 1964
Cdd:COG4717    185 QLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLL-LLIAAALLALLG 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1965 LAAISEATRLKTEAEIAL----------------KEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKAS 2028
Cdd:COG4717    264 LGGSLLSLILTIAGVLFLvlgllallflllarekASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELL 343
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2029 ESELERQKGLVE-DTLRQRRQVEEEILALKGSFEKAAAGKAElelelgRIRGTAEDTLRSKEQAEQEAARQRQLAAEEER 2107
Cdd:COG4717    344 DRIEELQELLREaEELEEELQLEELEQEIAALLAEAGVEDEE------ELRAALEQAEEYQELKEELEELEEQLEELLGE 417
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2108 RRREAEERVQKSLAAE-EEAARQRKAALEEVERLKAKVEEAR-RLRERAEQESARQLQLAQEAAQKRLQAEEKAHA 2181
Cdd:COG4717    418 LEELLEALDEEELEEElEELEEELEELEEELEELREELAELEaELEQLEEDGELAELLQELEELKAELRELAEEWA 493
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2381-2737 7.22e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 69.32  E-value: 7.22e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2381 VLRDKDSAQRLLQEEA---EKMKQVAEEAARLSVAAQEA-ARLRQLAEEDLAQQRAL---AEKMlkEKMQAVQEATRlKA 2453
Cdd:TIGR02168  149 IIEAKPEERRAIFEEAagiSKYKERRKETERKLERTRENlDRLEDILNELERQLKSLerqAEKA--ERYKELKAELR-EL 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2454 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQET---QGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDAR 2530
Cdd:TIGR02168  226 ELALLVLRLEELREELEELQEELKEAEEELEELTaelQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQ 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2531 RFRKQAEDIGERLYRtelatqekvmlvqtLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQL 2610
Cdd:TIGR02168  306 ILRERLANLERQLEE--------------LEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELE 371
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2611 LQEtQALQQSFLSEKDSLLQRERCIEQEKAKLEQLfqdeVAKAQALreeqQRQQQQMQQEKQQLAASMEEARRRQHEAE- 2689
Cdd:TIGR02168  372 SRL-EELEEQLETLRSKVAQLELQIASLNNEIERL----EARLERL----EDRRERLQQEIEELLKKLEEAELKELQAEl 442
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 2690 EGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR 2737
Cdd:TIGR02168  443 EELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQAR 490
CH_FIMB_rpt3 cd21300
third calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar ...
181-280 7.94e-11

third calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar proteins; Fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409149  Cd Length: 119  Bit Score: 62.44  E-value: 7.94e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRvQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSGDS--------LPREKGRMRFHKLQNVQIALDYL 252
Cdd:cd21300      4 EGER-EARVFTLWLNS--LDVEPAVNDLFEDLRDGLILLQAYDKVIPGSvnwkkvnkAPASAEISRFKAVENTNYAVELG 80
                           90       100
                   ....*....|....*....|....*...
gi 1920237946  253 RHRQVKLVNIRNDDIADGNPKLTLGLIW 280
Cdd:cd21300     81 KQLGFSLVGIQGADITDGSRTLTLALVW 108
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1538-1727 1.13e-10

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 67.14  E-value: 1.13e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL--EATERQRGGAEGELQALRARAEEAEAQK 1615
Cdd:PRK09510    67 QQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQkkQAEEAAKQAALKQKQAEEAAAKAAAAAK 146
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1616 RQAQEEAERLRRQV-QDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALE 1694
Cdd:PRK09510   147 AKAEAEAKRAAAAAkKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAA 226
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1920237946 1695 TAQRSAEAELQSEHASFAEKTAQLERTLKEEHV 1727
Cdd:PRK09510   227 AAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2381-2736 1.17e-10

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 68.54  E-value: 1.17e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2381 VLRDKDSAQRLLQEEAEKMKQ-----VAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM--LKEKMQAVQEA-TRLK 2452
Cdd:TIGR02168  194 ILNELERQLKSLERQAEKAERykelkAELRELELALLVLRLEELREELEELQEELKEAEEELeeLTAELQELEEKlEELR 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2453 AEAELLQQQKELAQE---QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDA 2529
Cdd:TIGR02168  274 LEVSELEEEIEELQKelyALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2530 RRFRKQAEdigerlyrtelatqEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKqeAQLLQLKSE-EMQTVRQE 2608
Cdd:TIGR02168  354 ESLEAELE--------------ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN--NEIERLEARlERLEDRRE 417
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2609 QLLQETQALQQSFLSEKDSLLQRErcIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHeA 2688
Cdd:TIGR02168  418 RLQQEIEELLKKLEEAELKELQAE--LEELEEELEEL-QEELERLEEALEELREELEEAEQALDAAERELAQLQARLD-S 493
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 2689 EEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLqHLEEERRAALA 2736
Cdd:TIGR02168  494 LERLQENLEGFSEGVKALLKNQSGLSGILGVLSELI-SVDEGYEAAIE 540
COG3899 COG3899
Predicted ATPase [General function prediction only];
1471-1971 1.18e-10

Predicted ATPase [General function prediction only];


Pssm-ID: 443106 [Multi-domain]  Cd Length: 1244  Bit Score: 68.35  E-value: 1.18e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER-----EAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1545
Cdd:COG3899    741 EEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRhgnppASARAYANLGLLLLGDYEEAYEFGELALALA 820
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1546 EELQHLRQSSEAEIqAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL 1625
Cdd:COG3899    821 ERLGDRRLEARALF-NLGFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAARLLAA 899
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1626 RRQVQDETQRKRQAEAELALRVQAEAEAAREkqRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQ 1705
Cdd:COG3899    900 AAAALAAAAAAAALAAAELARLAAAAAAAAA--LALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAAAA 977
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1706 SEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQ 1785
Cdd:COG3899    978 AAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAAA 1057
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1786 AEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARL 1865
Cdd:COG3899   1058 AAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALAALALAAAARAAAALLL 1137
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1866 QREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQL 1945
Cdd:COG3899   1138 LAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLAL 1217
                          490       500
                   ....*....|....*....|....*.
gi 1920237946 1946 AEEDAVRQRAEAERVLAEKLAAISEA 1971
Cdd:COG3899   1218 EAAALLLLLLLAALALAAALLALRLL 1243
CH_MICAL1 cd21196
calponin homology (CH) domain found in molecule interacting with CasL protein 1; MICAL-1, also ...
303-403 1.23e-10

calponin homology (CH) domain found in molecule interacting with CasL protein 1; MICAL-1, also called NEDD9-interacting protein with calponin homology and LIM domains, acts as a [F-actin]-monooxygenase that promotes depolymerization of F-actin by mediating oxidation of specific methionine residues on actin to form methionine-sulfoxide, resulting in actin filament disassembly and preventing repolymerization. In the absence of actin, it also functions as a NADPH oxidase producing H(2)O(2). MICAL-1 acts as a cytoskeletal regulator that connects NEDD9 to intermediate filaments. It also acts as a negative regulator of apoptosis via its interaction with STK38 and STK38L. MICAL-1 is a Rab effector protein that plays a role in vesicle trafficking. It contains a single copy of the CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409045  Cd Length: 106  Bit Score: 61.21  E-value: 1.23e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  303 KEKLLLWSQRMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERDLGVTRLLDPED 382
Cdd:cd21196      5 QEELLRWCQEQTAGYPGVHVSDLSSSWADGLALCALVYRLQPGLLEPSELQGLGALEATAWALKVAENELGITPVVSAQA 84
                           90       100
                   ....*....|....*....|.
gi 1920237946  383 VdVPQPDEKSIITYVSSLYDA 403
Cdd:cd21196     85 V-VAGSDPLGLIAYLSHFHSA 104
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1807-2553 1.74e-10

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 68.02  E-value: 1.74e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAEGTAQqrlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1886
Cdd:COG4913    245 EDAREQIELLEPIRELAERYAAARER---LAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELERLEARLDAL 321
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1887 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1966
Cdd:COG4913    322 REELDELEAQIRGNGGDRLEQLEREIERLEREL---EERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEE 398
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1967 AISEATRLKTEAEIALKEKEAENERLRrlAEDEAFQRRlleeqaaqhKADIEARLAQLRKAseseLERQKGLVEDTLR-- 2044
Cdd:COG4913    399 ELEALEEALAEAEAALRDLRRELRELE--AEIASLERR---------KSNIPARLLALRDA----LAEALGLDEAELPfv 463
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2045 -QRRQVEEEILALKGSFEKAaagkaelelelgrIRGTAEDTLRSKEQAEQ--EAARQRQLAAEeerrrreaeerVQKSLA 2121
Cdd:COG4913    464 gELIEVRPEEERWRGAIERV-------------LGGFALTLLVPPEHYAAalRWVNRLHLRGR-----------LVYERV 519
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2122 AEEEAARQRKAALEE--VERLKAKVEEARrlrERAEQESARQLQLAQEAAQKRLQAEEKA--------HAFAVQQKEQEL 2191
Cdd:COG4913    520 RTGLPDPERPRLDPDslAGKLDFKPHPFR---AWLEAELGRRFDYVCVDSPEELRRHPRAitragqvkGNGTRHEKDDRR 596
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2192 QQTLQqeqSVLerlrseaeaarraaeeaeaareraereAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEaeqe 2271
Cdd:COG4913    597 RIRSR---YVL---------------------------GFDNRAKLAALEAELAELEEELAEAEERLEALEAELDA---- 642
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2272 aarraqaeqaaLRQKQAADAEMEKHkQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQ 2351
Cdd:COG4913    643 -----------LQERREALQRLAEY-SWDEIDVASAEREIAELEAELERLDASSDDLAALEEQLEELEAELEELEEELDE 710
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2352 VEEELFSLRvqmEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2431
Cdd:COG4913    711 LKGEIGRLE---KELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEE 787
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2432 ALAEKMLKEKMQAVQEATRLKAEAELLQQ-QKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLeterqrQLEMSAEAER 2510
Cdd:COG4913    788 ELERAMRAFNREWPAETADLDADLESLPEyLALLDRLEEDGLPEYEERFKELLNENSIEFVADL------LSKLRRAIRE 861
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2511 LRLRVAEMSRA----------------QARAEEDARRFRKQAEDIGERLYRTELATQEK 2553
Cdd:COG4913    862 IKERIDPLNDSlkripfgpgrylrleaRPRPDPEVREFRQELRAVTSGASLFDEELSEA 920
mukB PRK04863
chromosome partition protein MukB;
1143-1731 1.93e-10

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 67.67  E-value: 1.93e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1143 RECAQRITEQQKAQaevdglgkgvaRLSAEAEKVLALPEPSPA-APTLRSELEltlgkleqvrslsaiylEKLKTISLVI 1221
Cdd:PRK04863   523 SELEQRLRQQQRAE-----------RLLAEFCKRLGKNLDDEDeLEQLQEELE-----------------ARLESLSESV 574
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1222 RSTQEAEEVLRAHEEQLK-EAQAVPATLPELEATKAALKKLRAQAEaqqpvfdalrDELRGAQEVGERLQQrHGERDVEV 1300
Cdd:PRK04863   575 SEARERRMALRQQLEQLQaRIQRLAARAPAWLAAQDALARLREQSG----------EEFEDSQDVTEYMQQ-LLEREREL 643
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1301 ERWRERVTlllERWQAVLAQTD-VRQRELEQLGRQLRYyresADPLGAWL----------RDAKQRQ----EQIQAVPLA 1365
Cdd:PRK04863   644 TVERDELA---ARKQALDEEIErLSQPGGSEDPRLNAL----AERFGGVLlseiyddvslEDAPYFSalygPARHAIVVP 716
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1366 NSQAVREQLRQEKALLEDI-------ERHGEKVEECQRFAKQYINAIKDYELQLVTYKAqlEPVASPAKKPK----VQSG 1434
Cdd:PRK04863   717 DLSDAAEQLAGLEDCPEDLyliegdpDSFDDSVFSVEELEKAVVVKIADRQWRYSRFPE--VPLFGRAAREKrieqLRAE 794
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1435 SESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEErlAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER 1514
Cdd:PRK04863   795 REELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAFEAD--PEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKE 872
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1515 EAQGLQR-----------RMQEEVARREEVAVEAQEQKRSIQ---------EELQHLRQSSEAEIQAKARQVEAAE---- 1570
Cdd:PRK04863   873 GLSALNRllprlnlladeTLADRVEEIREQLDEAEEAKRFVQqhgnalaqlEPIVSVLQSDPEQFEQLKQDYQQAQqtqr 952
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1571 --RSRLRIEEEIRVVRLQLEATERQR-GGAEGELQ-ALRARAEEAEAQKRQAQEEAerlrRQVQDETQRKRQAEAELALR 1646
Cdd:PRK04863   953 daKQQAFALTEVVQRRAHFSYEDAAEmLAKNSDLNeKLRQRLEQAEQERTRAREQL----RQAQAQLAQYNQVLASLKSS 1028
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1647 VQAEAEAAREKQRALQAL---------EELRLQAEEAERRLRQAEAERArqvqvALETAQRSAEAELQsehaSFAEKTAQ 1717
Cdd:PRK04863  1029 YDAKRQMLQELKQELQDLgvpadsgaeERARARRDELHARLSANRSRRN-----QLEKQLTFCEAEMD----NLTKKLRK 1099
                          650
                   ....*....|....
gi 1920237946 1718 LERTLKEEHVAVVQ 1731
Cdd:PRK04863  1100 LERDYHEMREQVVN 1113
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1333-2003 2.05e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 67.40  E-value: 2.05e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1333 RQLRYYRESADPLGAWLRDAKQRQEQIQAvpLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDY---- 1408
Cdd:PRK03918   111 SSVREWVERLIPYHVFLNAIYIRQGEIDA--ILESDESREKVVRQILGLDDYENAYKNLGEVIKEIKRRIERLEKFikrt 188
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1409 ---ELQLVTYKAQLEPVASPAKK-----PKVQSGSESIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQ 1480
Cdd:PRK03918   189 eniEELIKEKEKELEEVLREINEisselPELREELEKLEKEVKELEELKEEIEELEKE-LESLEGSKRKLEEKIRELEER 267
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1481 RAEERERLAEVE---AALEKQRQLAEAHAQAK-----------------AQAEREAQGLQRRMQEEVARREEVAvEAQEQ 1540
Cdd:PRK03918   268 IEELKKEIEELEekvKELKELKEKAEEYIKLSefyeeyldelreiekrlSRLEEEINGIEERIKELEEKEERLE-ELKKK 346
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1541 KRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQE 1620
Cdd:PRK03918   347 LKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKK 426
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1621 EAERLRRQVQDETQRKRQAEAELALRVQAEAEAarEKQRALQALEELRLQAEEAERRLRQAEAERARQ--VQVALETAQ- 1697
Cdd:PRK03918   427 AIEELKKAKGKCPVCGRELTEEHRKELLEEYTA--ELKRIEKELKEIEEKERKLRKELRELEKVLKKEseLIKLKELAEq 504
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1698 -RSAEAELQSEHASFAEKTAQLERTLKEEhvaVVQLREEATRRAQqqaeaeraraeaerelerwQLKANEALRLRLQAEE 1776
Cdd:PRK03918   505 lKELEEKLKKYNLEELEKKAEEYEKLKEK---LIKLKGEIKSLKK-------------------ELEKLEELKKKLAELE 562
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1777 VAQQKsltqaeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLaEGTAQQRLAAEQELIRLRaeteQGEQQRQ 1856
Cdd:PRK03918   563 KKLDE----------------------LEEELAELLKELEELGFESVEEL-EERLKELEPFYNEYLELK----DAEKELE 615
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1857 LLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLasKARAEEESRSTSEKskqrleaeagrFRELAEEAARLRALA 1936
Cdd:PRK03918   616 REEKELKKLEEELDKAFEELAETEKRLEELRKELEELE--KKYSEEEYEELREE-----------YLELSRELAGLRAEL 682
                          650       660       670       680       690       700       710
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1937 EEAKRQRQLAEEDAvrqraeaeRVLAEKLAAISEATRLKTEAEIALKEKEAENERLRR---LAEDEAFQR 2003
Cdd:PRK03918   683 EELEKRREEIKKTL--------EKLKEELEEREKAKKELEKLEKALERVEELREKVKKykaLLKERALSK 744
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
1661-2156 2.32e-10

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 67.37  E-value: 2.32e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1661 LQALEELRLQAEEAE---RRLRQAEAERARQVQVALE-TAQRSAEAELQSEHASFAEKTAQLERtLKEEHVAVVQLREEA 1736
Cdd:PRK02224   161 LGKLEEYRERASDARlgvERVLSDQRGSLDQLKAQIEeKEEKDLHERLNGLESELAELDEEIER-YEEQREQARETRDEA 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1737 TRRAQQQAEAERARAEAERELERWQLKANEALRLRLQ-AEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQREL 1815
Cdd:PRK02224   240 DEVLEEHEERREELETLEAEIEDLRETIAETEREREElAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREEL 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1816 AEQELEKQRQLAEGTAQQRlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLA 1895
Cdd:PRK02224   320 EDRDEELRDRLEECRVAAQ-AHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRE 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1896 SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE----------------DAVRQRAEAER 1959
Cdd:PRK02224   399 RFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAgkcpecgqpvegsphvETIEEDRERVE 478
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1960 VLAEKLAAIsEATRLKTEAEI----ALKEKEAENERLRRlaedeafQRRLLEEQAAQHKADIEA---RLAQLRKAS---E 2029
Cdd:PRK02224   479 ELEAELEDL-EEEVEEVEERLeraeDLVEAEDRIERLEE-------RREDLEELIAERRETIEEkreRAEELRERAaelE 550
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2030 SELERQKglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELElELGRIRgtaeDTLRSKEQAEQEAARQRQLAAEEERRR 2109
Cdd:PRK02224   551 AEAEEKR---EAAAEAEEEAEEAREEVAELNSKLAELKERIE-SLERIR----TLLAAIADAEDEIERLREKREALAELN 622
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 2110 REAEERVQkslaaeeeAARQRKAALEEvERLKAKVEEARRLRERAEQ 2156
Cdd:PRK02224   623 DERRERLA--------EKRERKRELEA-EFDEARIEEAREDKERAEE 660
mukB PRK04863
chromosome partition protein MukB;
1229-2048 2.48e-10

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 67.29  E-value: 2.48e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1229 EVLRAHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAqqpvfdaLRDELRGAQEvGERLQQRHGERDVEVERWRERvt 1308
Cdd:PRK04863   294 ELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDYQA-------ASDHLNLVQT-ALRQQEKIERYQADLEELEER-- 363
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1309 llLERWQAVLAQTDVRQRELEqlgRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVReQLRQEKALLE----DI 1384
Cdd:PRK04863   364 --LEEQNEVVEEADEQQEENE---ARAEAAEEEVDELKSQLADYQQALDVQQTRAIQYQQAVQ-ALERAKQLCGlpdlTA 437
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1385 ERHGEKVEECQRFAKQYINAIKDYELQLVTYKA---QLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQY-- 1459
Cdd:PRK04863   438 DNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAahsQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQLqq 517
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1460 IRFISETLRRMEEEERLAEQQRAEERERL-------AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREE 1532
Cdd:PRK04863   518 LRMRLSELEQRLRQQQRAERLLAEFCKRLgknlddeDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQR 597
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1533 VAVEAQEQkRSIQEELQHLRQSSEAE----------IQAKARQVEAAERSRLRIEEEIRVVRLQLEATErQRGGAEGE-L 1601
Cdd:PRK04863   598 LAARAPAW-LAAQDALARLREQSGEEfedsqdvteyMQQLLERERELTVERDELAARKQALDEEIERLS-QPGGSEDPrL 675
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1602 QALRAR-----------------AEEAEA---QKRQA--QEEAERLRRQVQDET-------------QRKRQA-----EA 1641
Cdd:PRK04863   676 NALAERfggvllseiyddvsledAPYFSAlygPARHAivVPDLSDAAEQLAGLEdcpedlyliegdpDSFDDSvfsveEL 755
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1642 ELALRVQ-AEAE--------------AAREKQralqaLEELRLQAEEAERRLRQAEAERaRQVQVALETAQR-------- 1698
Cdd:PRK04863   756 EKAVVVKiADRQwrysrfpevplfgrAAREKR-----IEQLRAEREELAERYATLSFDV-QKLQRLHQAFSRfigshlav 829
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1699 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVA 1778
Cdd:PRK04863   830 AFEADPEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKEGLSALNRLLPRLNLLADETLADRVEEIREQLDEAEEA 909
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1779 QQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQ--------------LAEGTAQQRLAAEQEL-IR 1843
Cdd:PRK04863   910 KRFVQQHGNALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQqafaltevvqrrahFSYEDAAEMLAKNSDLnEK 989
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1844 LRAETEQGEQQRQLLEEEL--------------ARLQREAAAATQKRRELEAELAK--VRAEMEVLLASKARAEE----- 1902
Cdd:PRK04863   990 LRQRLEQAEQERTRAREQLrqaqaqlaqynqvlASLKSSYDAKRQMLQELKQELQDlgVPADSGAEERARARRDElharl 1069
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1903 ----ESRSTSEKSKQRLEAE----AGRFRELAEEAARLRALAEEAKRQRQLAeEDAVRQRAEAERVLAEKLAAISeATRL 1974
Cdd:PRK04863  1070 sanrSRRNQLEKQLTFCEAEmdnlTKKLRKLERDYHEMREQVVNAKAGWCAV-LRLVKDNGVERRLHRRELAYLS-ADEL 1147
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1975 KTEAEI---ALKEKEAENERLR---RLAEDEAFQ----------RRLLEEQAAQhkaDIeARLAQLRKASEsELERQKGL 2038
Cdd:PRK04863  1148 RSMSDKalgALRLAVADNEHLRdvlRLSEDPKRPerkvqfyiavYQHLRERIRQ---DI-IRTDDPVEAIE-QMEIELSR 1222
                          970
                   ....*....|
gi 1920237946 2039 VEDTLRQRRQ 2048
Cdd:PRK04863  1223 LTEELTSREQ 1232
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1818-2179 2.87e-10

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 66.71  E-value: 2.87e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1818 QELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQR--EAAAATQKRRELEAELAKVRAEMEvlla 1895
Cdd:COG4717     74 KELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKllQLLPLYQELEALEAELAELPERLE---- 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1896 sKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLK 1975
Cdd:COG4717    150 -ELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEE 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1976 TEAEIALKEKEAENERLRRL---------------AEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVE 2040
Cdd:COG4717    229 LEQLENELEAAALEERLKEArlllliaaallallgLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQ 308
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2041 DTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERrrreaeerVQKSL 2120
Cdd:COG4717    309 ALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAL--------LAEAG 380
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2121 AAEEEAARQRKAALEEVERLKAKVEEA-RRLRERAEQESARQLQLAQEAAQKRLQAEEKA 2179
Cdd:COG4717    381 VEDEEELRAALEQAEEYQELKEELEELeEQLEELLGELEELLEALDEEELEEELEELEEE 440
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
1076-1725 2.99e-10

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 67.02  E-value: 2.99e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1076 DRLQAEREYgscSRHYQQLLQSLEQGEQEESrcqrcISELKDIRLQLEACEtrtvhrlrlpldkepaRECAQRITEQQKA 1155
Cdd:TIGR02169  201 ERLRREREK---AERYQALLKEKREYEGYEL-----LKEKEALERQKEAIE----------------RQLASLEEELEKL 256
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1156 QAEVDGLGKGVA----RLSAEAEKVLALPEPSPAAptLRSELELTLGKLEQVRSLSAIYLEKL--------KTISLVIRS 1223
Cdd:TIGR02169  257 TEEISELEKRLEeieqLLEELNKKIKDLGEEEQLR--VKEKIGELEAEIASLERSIAEKERELedaeerlaKLEAEIDKL 334
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1224 TQEAEEVLRAHEEQLKEAQAVPAtlpELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGE-------- 1295
Cdd:TIGR02169  335 LAEIEELEREIEEERKRRDKLTE---EYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINElkreldrl 411
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1296 ------RDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQ--------- 1360
Cdd:TIGR02169  412 qeelqrLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEkelsklqre 491
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1361 -----AVPLANSQAVREQLRQEKALLEDI--------------ERHGEKVE-------------------ECQRFAKQY- 1401
Cdd:TIGR02169  492 laeaeAQARASEERVRGGRAVEEVLKASIqgvhgtvaqlgsvgERYATAIEvaagnrlnnvvveddavakEAIELLKRRk 571
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1402 --------INAIK---------------DYELQLVTYKAQLEPVASPAKKPKV-----QSGSESIIQ-----------EY 1442
Cdd:TIGR02169  572 agratflpLNKMRderrdlsilsedgviGFAVDLVEFDPKYEPAFKYVFGDTLvvediEAARRLMGKyrmvtlegelfEK 651
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1443 VDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRR 1522
Cdd:TIGR02169  652 SGAMTGGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE 731
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1523 MQEEVARREEVAVEAQEQKRSIQEELQHLrQSSEAEIQAKARQVEAAERSRLRIE-----EEIRVVRLQLEATERQRGGA 1597
Cdd:TIGR02169  732 EEKLKERLEELEEDLSSLEQEIENVKSEL-KELEARIEELEEDLHKLEEALNDLEarlshSRIPEIQAELSKLEEEVSRI 810
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1598 EGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK---RQAEAELALRVQAEAEAAREKQRALQALEE----LRLQ 1670
Cdd:TIGR02169  811 EARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIksiEKEIENLNGKKEELEEELEELEAALRDLESrlgdLKKE 890
                          730       740       750       760       770
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1671 AEEAERRLRQAEaERARQVQVALETAqRSAEAELQSEHASFAEKTAQLERTLKEE 1725
Cdd:TIGR02169  891 RDELEAQLRELE-RKIEELEAQIEKK-RKRLSELKAKLEALEEELSEIEDPKGED 943
PTZ00121 PTZ00121
MAEBL; Provisional
1225-1676 3.11e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.09  E-value: 3.11e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1225 QEAEEVLRAHEEQLK--EAQAVPATLPELEATKAALKKLRAQAEAQQPVFDAlrDELRGAQEVGERLQQRHGErdvEVER 1302
Cdd:PTZ00121  1497 KKADEAKKAAEAKKKadEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKA--DELKKAEELKKAEEKKKAE---EAKK 1571
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1303 WRERVTLLLERWQavlaqtDVRQRELEQLGRQLRYYRESADPLGAWLRdaKQRQEQIQAvplansqavrEQLRQEKALLE 1382
Cdd:PTZ00121  1572 AEEDKNMALRKAE------EAKKAEEARIEEVMKLYEEEKKMKAEEAK--KAEEAKIKA----------EELKKAEEEKK 1633
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1383 DIERHGEKVEECQRFAKQyinaIKDYELQLVTYKAQLEPVASPAKKPkvqsgSESIIQEYVDLRtRYSELSTLTSQYIRF 1462
Cdd:PTZ00121  1634 KVEQLKKKEAEEKKKAEE----LKKAEEENKIKAAEEAKKAEEDKKK-----AEEAKKAEEDEK-KAAEALKKEAEEAKK 1703
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRRMEEEERLAEQQRAEERERLAEVEaalekqrqlaeahaQAKAQAEREaqglqRRMQEEVARREEVAVEAQEQKR 1542
Cdd:PTZ00121  1704 AEELKKKEAEEKKKAEELKKAEEENKIKAE--------------EAKKEAEED-----KKKAEEAKKDEEEKKKIAHLKK 1764
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1543 SIQEELQHLRQSSEAEIQAKARqvEAAERSRLRIEEEIRVVRLQLEATerQRGGAEGELQALRARAEEAEAQKRQAqEEA 1622
Cdd:PTZ00121  1765 EEEKKAEEIRKEKEAVIEEELD--EEDEKRRMEVDKKIKDIFDNFANI--IEGGKEGNLVINDSKEMEDSAIKEVA-DSK 1839
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1623 ERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRaLQALEELRLQAEEAER 1676
Cdd:PTZ00121  1840 NMQLEEADAFEKHKFNKNNENGEDGNKEADFNKEKDL-KEDDEEEIEEADEIEK 1892
CH_PLS_FIM_rpt1 cd21217
first calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ...
187-283 3.84e-10

first calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5; they cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409066 [Multi-domain]  Cd Length: 114  Bit Score: 60.28  E-value: 3.84e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  187 KKTFTKWVN---------KHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMR-----FHKLQNVQIALDYL 252
Cdd:cd21217      3 KEAFVEHINslladdpdlKHLLPIDPDGDDLFEALRDGVLLCKLINKIVPGTIDERKLNKKkpkniFEATENLNLALNAA 82
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1920237946  253 RHRQVKLVNIRNDDIADGNPKLTLGLIWTII 283
Cdd:cd21217     83 KKIGCKVVNIGPQDILDGNPHLVLGLLWQII 113
PRK05035 PRK05035
electron transport complex protein RnfC; Provisional
1896-2167 5.45e-10

electron transport complex protein RnfC; Provisional


Pssm-ID: 235334 [Multi-domain]  Cd Length: 695  Bit Score: 65.74  E-value: 5.45e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1896 SKARAEEESRSTSEKSKQRLEAEAGRF-RELAEEAARlralAEEAKRQRQLAEEDAVrqrAEAERVLAEKLAAISEATRL 1974
Cdd:PRK05035   436 AEIRAIEQEKKKAEEAKARFEARQARLeREKAAREAR----HKKAAEARAAKDKDAV---AAALARVKAKKAAATQPIVI 508
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1975 KTEAEIALKEKEAEnERLRRLAEDEAFQRRLLEEQAAQHKADIEARL--AQLRKASESELERQKGLVEDTlrQRRQVEEE 2052
Cdd:PRK05035   509 KAGARPDNSAVIAA-REARKAQARARQAEKQAAAAADPKKAAVAAAIarAKAKKAAQQAANAEAEEEVDP--KKAAVAAA 585
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2053 ILALKGSFEKAAAGKAELELELgrirgTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKA 2132
Cdd:PRK05035   586 IARAKAKKAAQQAASAEPEEQV-----AEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVAAAIARAKARKA 660
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1920237946 2133 ALEEVERLKAKVEEARRLRERAEQESARQLQLAQE 2167
Cdd:PRK05035   661 AQQQANAEPEEAEDPKKAAVAAAIARAKAKKAAQQ 695
PTZ00121 PTZ00121
MAEBL; Provisional
2394-2815 5.89e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 66.32  E-value: 5.89e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2394 EEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLK-AEAELLQQQKELAQ--EQAR 2470
Cdd:PTZ00121  1091 EATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEdAKRVEIARKAEDARkaEEAR 1170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2471 RLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRlRVAEMSRAQ-ARAEEDARRF---RKQAEDIGERLYRT 2546
Cdd:PTZ00121  1171 KAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEER-KAEEARKAEdAKKAEAVKKAeeaKKDAEEAKKAEEER 1249
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2547 ELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQllqlKSEEMQTVrqEQLLQETQALQQSFLSEKD 2626
Cdd:PTZ00121  1250 NNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAK----KAEEKKKA--DEAKKKAEEAKKADEAKKK 1323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2627 SLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASME-----------EARRRQHEAE---EGV 2692
Cdd:PTZ00121  1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAkkkadaakkkaEEKKKADEAKkkaEED 1403
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2693 RRQQEELQRLAQQQQQQEKLL--AEENQRLRERLQHLEEERRAalarseEIAPSRAAAARALPNGQDAADGPAAAAEPEH 2770
Cdd:PTZ00121  1404 KKKADELKKAAAAKKKADEAKkkAEEKKKADEAKKKAEEAKKA------DEAKKKAEEAKKAEEAKKKAEEAKKADEAKK 1477
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*
gi 1920237946 2771 AFDGLRRKVPAQRLQEVGVLSAEELQQLAQGRTTVAELAQREDVR 2815
Cdd:PTZ00121  1478 KAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAK 1522
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1370-1960 6.14e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 65.86  E-value: 6.14e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1370 VREQLRQEKALLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDlrtRY 1449
Cdd:PRK03918   219 LREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAE---EY 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1450 SELSTLTSQYIrfisETLRRMEEEERLAEQQRAEERERLAEVEaalEKQRQLaeahaqakaqaeREAQGLQRRMQEEVAR 1529
Cdd:PRK03918   296 IKLSEFYEEYL----DELREIEKRLSRLEEEINGIEERIKELE---EKEERL------------EELKKKLKELEKRLEE 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1530 REEvAVEAQEQKRSIQEELQHLRQS-SEAEIQAKARQVEAAERSRLRIEEEIRVV---RLQLEATERQRGGAEGELQALR 1605
Cdd:PRK03918   357 LEE-RHELYEEAKAKKEELERLKKRlTGLTPEKLEKELEELEKAKEEIEEEISKItarIGELKKEIKELKKAIEELKKAK 435
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARA---------EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvqaEAEAAREKQRALQALEELRLQAEEAER 1676
Cdd:PRK03918   436 GKCpvcgrelteEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELR-----ELEKVLKKESELIKLKELAEQLKELEE 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1677 RLRQAEAERarqvqvaLETAQRSAEaELQSEHASFAEKTAQLERTLKEEHvavvQLREEATRRAQQQAEAERARAEAERE 1756
Cdd:PRK03918   511 KLKKYNLEE-------LEKKAEEYE-KLKEKLIKLKGEIKSLKKELEKLE----ELKKKLAELEKKLDELEEELAELLKE 578
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1757 LERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtAQQRLA 1836
Cdd:PRK03918   579 LEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELE--ELEKKY 656
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1837 AEQELIRLRAETEQgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlaSKARAEEESrstSEKSKQRLE 1916
Cdd:PRK03918   657 SEEEYEELREEYLE-------LSRELAGLRAELEELEKRREEIKKTLEKLKEELEER--EKAKKELEK---LEKALERVE 724
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1917 AEAGRFRELAEEAARlRALAEEAKRQRQLAEE------DAVRQRAEAERV 1960
Cdd:PRK03918   725 ELREKVKKYKALLKE-RALSKVGEIASEIFEEltegkySGVRVKAEENKV 773
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
1222-1418 8.65e-10

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 61.69  E-value: 8.65e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1222 RSTQEAEEVLRAHEEQLKEAQaVPATLPELEATKAALKKLRAQAEAQQPVFDALrdelrgaQEVGERLQQRHGERDVEVe 1301
Cdd:cd00176      7 RDADELEAWLSEKEELLSSTD-YGDDLESVEALLKKHEALEAELAAHEERVEAL-------NELGEQLIEEGHPDAEEI- 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1302 rwRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADpLGAWLRDAKQRQEQIQavPLANSQAVREQLRQEKALL 1381
Cdd:cd00176     78 --QERLEELNQRWEELRELAEERRQRLEEALDLQQFFRDADD-LEQWLEEKEAALASED--LGKDLESVEELLKKHKELE 152
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1920237946 1382 EDIERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQ 1418
Cdd:cd00176    153 EELEAHEPRLKSLNELAEELLEEGHPDADEEIEEKLE 189
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
2283-2741 8.93e-10

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 65.38  E-value: 8.93e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILdeelqRLKAEVTEAARQRGQVEEELFSLRVQ 2362
Cdd:TIGR00618  245 LTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQERINRARKAA-----PLAAHIKAVTQIEQQAQRIHTELQSK 319
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2363 MEELGKLKARIEA--ENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA---RLRQLAEE---DLAQQRALA 2434
Cdd:TIGR00618  320 MRSRAKLLMKRAAhvKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTltqHIHTLQQQkttLTQKLQSLC 399
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2435 EKMLKEKMQAVQEATRLKAEAELlQQQKELAQEQARRLQEDKEQMAQQLAQETQ-----------GFQKTLETERQRQ-- 2501
Cdd:TIGR00618  400 KELDILQREQATIDTRTSAFRDL-QGQLAHAKKQQELQQRYAELCAAAITCTAQceklekihlqeSAQSLKEREQQLQtk 478
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2502 ---LEMSAEAERLRLRVAEMSRAQARAEEDARRF----RKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERL 2574
Cdd:TIGR00618  479 eqiHLQETRKKAVVLARLLELQEEPCPLCGSCIHpnpaRQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQR 558
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2575 REAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqlLQETQALQQSFLSEKDSLLQRERCIE---------QEKAKLEQL 2645
Cdd:TIGR00618  559 ASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNI--TVRLQDLTEKLSEAEDMLACEQHALLrklqpeqdlQDVRLHLQQ 636
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2646 FQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEAR---------------------------------RRQHEAEEGV 2692
Cdd:TIGR00618  637 CSQELALKLTALHALQLTLTQERVREHALSIRVLPKEllasrqlalqkmqsekeqltywkemlaqcqtllRELETHIEEY 716
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*....
gi 1920237946 2693 RRQQEELQRLAQQQQQQeklLAEENQRLRERLQHLEEERRAALARSEEI 2741
Cdd:TIGR00618  717 DREFNEIENASSSLGSD---LAAREDALNQSLKELMHQARTVLKARTEA 762
CH_PARV_rpt2 cd21222
second calponin homology (CH) domain found in the parvin family; The parvin family includes ...
178-286 9.63e-10

second calponin homology (CH) domain found in the parvin family; The parvin family includes alpha-parvin, beta-parvin, and gamma-parvin. Alpha-parvin, also called actopaxin, calponin-like integrin-linked kinase-binding protein (CH-ILKBP), or matrix-remodeling-associated protein 2, plays a role in sarcomere organization and in smooth muscle cell contraction. It is required for normal development of the embryonic cardiovascular system, and for normal septation of the heart outflow tract. Beta-parvin, also called affixin, is an adapter protein that plays a role in integrin signaling via ILK and in activation of the GTPases Cdc42 and Rac1 by guanine exchange factors, such as ARHGEF6. Both alpha-parvin and beta-parvin are involved in the reorganization of the actin cytoskeleton and the formation of lamellipodia, and both play roles in cell adhesion, cell spreading, establishment or maintenance of cell polarity, and cell migration. Gamma-parvin probably plays a role in the regulation of cell adhesion and cytoskeleton organization. Members of this family contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409071  Cd Length: 121  Bit Score: 59.14  E-value: 9.63e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  178 AADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLP----REKGRMRFHKLQNVQIALDYLR 253
Cdd:cd21222      9 EAPEKLAEVKELLLQFVNKHLAKLNIEVTDLATQFHDGVYLILLIGLLEGFFVPlheyHLTPSTDDEKLHNVKLALELME 88
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1920237946  254 HRQVKLVNIRNDDIADGNPKLTLGLIWTIILHF 286
Cdd:cd21222     89 DAGISTPKIRPEDIVNGDLKSILRVLYSLFSKY 121
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1846-2435 9.86e-10

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 65.09  E-value: 9.86e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1846 AETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTS--EKSKQRLEAEAG--- 1920
Cdd:PRK03918   186 KRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELEslEGSKRKLEEKIRele 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1921 --------RFRELAEEAARLRALAEEAKRQRQLAEEdaVRQRAEAERVLAEKLAAISEATRlktEAEIALKEKEAENERL 1992
Cdd:PRK03918   266 erieelkkEIEELEEKVKELKELKEKAEEYIKLSEF--YEEYLDELREIEKRLSRLEEEIN---GIEERIKELEEKEERL 340
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1993 RRLAEDEAFQRRLLEEQAAQHKA--DIEARLAQLRKASES----ELERQKGLVEDTLRQRRQVEEEILAL---KGSFEKA 2063
Cdd:PRK03918   341 EELKKKLKELEKRLEELEERHELyeEAKAKKEELERLKKRltglTPEKLEKELEELEKAKEEIEEEISKItarIGELKKE 420
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2064 AAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLaaeeerrrrEAEERVQKSLAAEEEAARQRKAALEEVERLKAK 2143
Cdd:PRK03918   421 IKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYT---------AELKRIEKELKEIEEKERKLRKELRELEKVLKK 491
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2144 VEEARRLRERAEQESARQLQLAQEAAQKrlqAEEKAHAF-AVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAeeaeaa 2222
Cdd:PRK03918   492 ESELIKLKELAEQLKELEEKLKKYNLEE---LEKKAEEYeKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELE------ 562
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2223 reraereaaqsrRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQ 2302
Cdd:PRK03918   563 ------------KKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDK 630
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2303 ALRQKAQVEQELTALRLQLEETdhQKSILDEELQRLKAEVTEaarqrgqVEEELFSLRVQMEELGKLKARIEAEnralvL 2382
Cdd:PRK03918   631 AFEELAETEKRLEELRKELEEL--EKKYSEEEYEELREEYLE-------LSRELAGLRAELEELEKRREEIKKT-----L 696
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2383 RDkdsaqrlLQEEAEKMKQVAEEAARLSVAAQEAARLRQ--LAEEDLAQQRALAE 2435
Cdd:PRK03918   697 EK-------LKEELEEREKAKKELEKLEKALERVEELREkvKKYKALLKERALSK 744
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1477-1675 1.15e-09

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 64.06  E-value: 1.15e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEE-RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARReevavEAQEQKrsiQEELQHLRQSS 1555
Cdd:PRK09510    72 KSAKRAEEqRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ-----AALKQK---QAEEAAAKAAA 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1556 EAEIQAKARQVEAAERSRlRIEEEIRvVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlrrqVQDETQR 1635
Cdd:PRK09510   144 AAKAKAEAEAKRAAAAAK-KAAAEAK-KKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKK----AAAEAKK 217
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1920237946 1636 KRQAEAELAL-RVQAEAEAAREKQRALQALEELRLQAEEAE 1675
Cdd:PRK09510   218 KAAAEAKAAAaKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1823-2029 1.29e-09

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 63.63  E-value: 1.29e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1823 QRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEE 1902
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1903 ESRSTSEKSKQRLEA--------------EAGRFRELAEEAARLRALAEEAKRQ-----RQLAEEDAVRQRAEAERvlAE 1963
Cdd:COG4942     98 ELEAQKEELAELLRAlyrlgrqpplalllSPEDFLDAVRRLQYLKYLAPARREQaeelrADLAELAALRAELEAER--AE 175
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1964 KLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASE 2029
Cdd:COG4942    176 LEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1638-2005 1.42e-09

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 64.76  E-value: 1.42e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1638 QAEAELALRVQAEAEAAREKQRALQALEELrlqAEEAERRLRQAEAERARQvqvaletaqrsaeAELQSEHASFAEktaq 1717
Cdd:pfam17380  279 QHQKAVSERQQQEKFEKMEQERLRQEKEEK---AREVERRRKLEEAEKARQ-------------AEMDRQAAIYAE---- 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1718 lertlkEEHVAVVQLREeatrraqqqaeaeraraeaereLERWQLKANEALRLRLQAEEVAQQksLTQAEAEKQKEEAER 1797
Cdd:pfam17380  339 ------QERMAMERERE----------------------LERIRQEERKRELERIRQEEIAME--ISRMRELERLQMERQ 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1798 EARRRGKAEEQAVRQRELAEQE-----LEKQRQLAEGTAQQRLAAEQELIRLraETEQGEQQRQLLEEELARLQReaaaa 1872
Cdd:pfam17380  389 QKNERVRQELEAARKVKILEEErqrkiQQQKVEMEQIRAEQEEARQREVRRL--EEERAREMERVRLEEQERQQQ----- 461
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1873 TQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKskqrlEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAvR 1952
Cdd:pfam17380  462 VERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEK-----ELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER-R 535
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1953 QRAEAER---VLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRL 2005
Cdd:pfam17380  536 REAEEERrkqQEMEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARAEY 591
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1619-2100 1.56e-09

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 64.40  E-value: 1.56e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1619 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQR 1698
Cdd:COG4717     52 EKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLY 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1699 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREeatrraqqqaeaeraraeaerelerwqlKANEALRLRLQAEEVA 1778
Cdd:COG4717    132 QELEALEAELAELPERLEELEERLEELRELEEELEE----------------------------LEAELAELQEELEELL 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1779 QQKSLTQAeaekqkeeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1858
Cdd:COG4717    184 EQLSLATE-----------------EELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLK 246
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1859 EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRStseKSKQRLEAEAGRFRELAEEAARLRALAEE 1938
Cdd:COG4717    247 EARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLA---REKASLGKEAEELQALPALEELEEEELEE 323
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1939 AKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLE--EQAAQHKAD 2016
Cdd:COG4717    324 LLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELRAALEqaEEYQELKEE 403
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2017 IEARLAQLRKASESELERQKGLVEDTLRQR-RQVEEEILALKGSFEKAAAGKAELELELGRIRGtaEDTLRSKEQAEQEA 2095
Cdd:COG4717    404 LEELEEQLEELLGELEELLEALDEEELEEElEELEEELEELEEELEELREELAELEAELEQLEE--DGELAELLQELEEL 481

                   ....*
gi 1920237946 2096 ARQRQ 2100
Cdd:COG4717    482 KAELR 486
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1325-1703 2.07e-09

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 64.59  E-value: 2.07e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1325 QRELEQLGRQLRYYRESADplgawlrDAKQRQEQIQA-VPLANSQAvREQLRQekaLLEDIERHGEKVEECQRFAKQYIN 1403
Cdd:COG3096    849 ERELAQHRAQEQQLRQQLD-------QLKEQLQLLNKlLPQANLLA-DETLAD---RLEELREELDAAQEAQAFIQQHGK 917
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1404 AIkdyelqlvtykAQLEPVASPAKKPKVQSgsESIIQEYVDLRTRYSELStltsQYIRFISETLRRM------EEEERLA 1477
Cdd:COG3096    918 AL-----------AQLEPLVAVLQSDPEQF--EQLQADYLQAKEQQRRLK----QQIFALSEVVQRRphfsyeDAVGLLG 980
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1478 EQQRAEE--RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEevarreevaveAQEQKRSIQEELQHLrqss 1555
Cdd:COG3096    981 ENSDLNEklRARLEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDA-----------KQQTLQELEQELEEL---- 1045
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1556 eaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE-------RLRRQ 1628
Cdd:COG3096   1046 --GVQADAEAEERARIRRDELHEELSQNRSRRSQLEKQLTRCEAEMDSLQKRLRKAERDYKQEREQVVqakagwcAVLRL 1123
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1629 VQDETQRKRQAEAELalrvqaeaeaarekqrALQALEELRLQAEEAERRLRQAEAERArQVQVALETAQRSAEAE 1703
Cdd:COG3096   1124 ARDNDVERRLHRREL----------------AYLSADELRSMSDKALGALRLAVADNE-HLRDALRLSEDPRRPE 1181
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1815-2046 2.30e-09

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 62.56  E-value: 2.30e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1815 LAEQELEKQRQLAEGTAQqrlAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAelakvraemevll 1894
Cdd:TIGR02794   47 AVAQQANRIQQQKKPAAK---KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQA------------- 110
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1895 askARAEEESRSTSEKSKQRLEAEAGRfrelAEEAARLRALAEEAKRQrqlAEEDAVRQRAEAervlAEKLAaisEATRL 1974
Cdd:TIGR02794  111 ---AKQAEEKQKQAEEAKAKQAAEAKA----KAEAEAERKAKEEAAKQ---AEEEAKAKAAAE----AKKKA---EEAKK 173
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1975 KTEAEiALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIE----ARLAQLRKASESELERQKGLVEDTLRQR 2046
Cdd:TIGR02794  174 KAEAE-AKAKAEAEAKAKAEEAKAKAEAAKAKAAAEAAAKAEAEaaaaAAAEAERKADEAELGDIFGLASGSNAEK 248
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1502-1725 2.94e-09

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 62.47  E-value: 2.94e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1502 AEAHAQAKAQAEREAQGLQRRMQEEVARREEvaveAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEE 1578
Cdd:COG4942     15 AAAQADAAAEAEAELEQLQQEIAELEKELAA----LKKEEKALLKQLAALERriaALARRIRALEQELAALEAELAELEK 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1579 EIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1658
Cdd:COG4942     91 EIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELE 170
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1659 RALQALEELRLQAEEAERRLRQAEAERARQVQV--ALETAQRSAEAELQSEHASFAEKTAQLERTLKEE 1725
Cdd:COG4942    171 AERAELEALLAELEEERAALEALKAERQKLLARleKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
2380-2731 3.03e-09

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 63.60  E-value: 3.03e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2380 LVLRDKDSAQRLLQEEAEKM-----KQVAEEAARlsvaaqEAARLRQLAEEDLAQQRALAEkmlkekmqavqeatrlkaE 2454
Cdd:pfam17380  277 IVQHQKAVSERQQQEKFEKMeqerlRQEKEEKAR------EVERRRKLEEAEKARQAEMDR------------------Q 332
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2455 AELLQQQKELAQEQARRL----QEDKEQMAQQLAQETQGFQKTLETERQR-QLEMSAEAERLRLRVAEMSRAQARAEEDA 2529
Cdd:pfam17380  333 AAIYAEQERMAMERERELerirQEERKRELERIRQEEIAMEISRMRELERlQMERQQKNERVRQELEAARKVKILEEERQ 412
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2530 RRFRKQAEDIGERLYRTELATQEKVmlvQTLETQRQqsdRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQ 2609
Cdd:pfam17380  413 RKIQQQKVEMEQIRAEQEEARQREV---RRLEEERA---REMERVRLEEQERQQQVERLRQQEEERKRKKLELEKEKRDR 486
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2610 LLQETQalqqsflsekdsllqRERCIEQEKAKLEQLFQDEVAKAQALREEqqrqqqqmqqekqqlaasMEEarRRQHEAE 2689
Cdd:pfam17380  487 KRAEEQ---------------RRKILEKELEERKQAMIEEERKRKLLEKE------------------MEE--RQKAIYE 531
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 2690 EGVRRQQEELQRlaqqqqqqEKLLAEENQRLRERLQHLEEER 2731
Cdd:pfam17380  532 EERRREAEEERR--------KQQEMEERRRIQEQMRKATEER 565
CH_FLNC_rpt2 cd21314
second calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; ...
301-403 3.21e-09

second calponin homology (CH) domain found in filamin-C (FLN-C) and similar proteins; Filamin-C (FLN-C), also called FLNc, ABP-280-like protein, ABP-L, actin-binding-like protein, filamin-2, or gamma-filamin, is a muscle-specific filamin that plays a central role in muscle cells, probably by functioning as a large actin-cross-linking protein. It may be involved in reorganizing the actin cytoskeleton in response to signaling events, and may also display structural functions at the Z lines in muscle cells. FLN-C is critical for normal myogenesis and for maintaining the structural integrity of the muscle fibers. FLN-C contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409163  Cd Length: 115  Bit Score: 57.77  E-value: 3.21e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  301 TAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGVTRLLD 379
Cdd:cd21314     11 TPKQRLLGWIQNKVPQ---LPITNFNRDWQDGKALGALVDNCAPGLCpDWESWDPNQPVQNAREAMQQADDWLGVPQVIA 87
                           90       100
                   ....*....|....*....|....
gi 1920237946  380 PEDVDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21314     88 PEEIVDPNVDEHSVMTYLSQFPKA 111
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
2290-2740 3.25e-09

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 63.45  E-value: 3.25e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2290 DAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKaevtEAARQRGQVEEELFSLRVQMEELGKL 2369
Cdd:TIGR00618  200 TLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYLTQKR----EAQEEQLKKQQLLKQLRARIEELRAQ 275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2370 KARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEA-ARLSVAAQEAARLRQLA------EEDLAQQRALAEKMLKEKM 2442
Cdd:TIGR00618  276 EAVLEETQERINRARKAAPLAAHIKAVTQIEQQAQRIhTELQSKMRSRAKLLMKRaahvkqQSSIEEQRRLLQTLHSQEI 355
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2443 QAVQEATRLKAEAELLQQQKELAQeQARRLQEDKEQMAQQLaqetQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQ 2522
Cdd:TIGR00618  356 HIRDAHEVATSIREISCQQHTLTQ-HIHTLQQQKTTLTQKL----QSLCKELDILQREQATIDTRTSAFRDLQGQLAHAK 430
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2523 ARAEEDARRFRKQAEDIGER----------LYRTELATQEKVMLVQTLETQRQQSDR-----DAERLREAIAELEHEKDK 2587
Cdd:TIGR00618  431 KQQELQQRYAELCAAAITCTaqceklekihLQESAQSLKEREQQLQTKEQIHLQETRkkavvLARLLELQEEPCPLCGSC 510
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2588 LKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQM 2667
Cdd:TIGR00618  511 IHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQ 590
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2668 QQEKQQLAASMEEARRR------QHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEE 2740
Cdd:TIGR00618  591 NITVRLQDLTEKLSEAEdmlaceQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRV 669
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1493-1717 3.52e-09

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 62.52  E-value: 3.52e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1493 AALEKQRQLAEAHAQAKAQAEREAQGLQRrmQEEVarREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAK---ARQVEAA 1569
Cdd:PRK09510    60 VVEQYNRQQQQQKSAKRAEEQRKKKEQQQ--AEEL--QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKqaaLKQKQAE 135
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1570 ERsrlrieeeirvvrlQLEATERQRGGAEGELQALRARAEEAEAQ-KRQAQEEAerlrrQVQDETQRKRQAEAELALRVQ 1648
Cdd:PRK09510   136 EA--------------AAKAAAAAKAKAEAEAKRAAAAAKKAAAEaKKKAEAEA-----AKKAAAEAKKKAEAEAAAKAA 196
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1649 AEAEAAREKQRALQAleelrlqAEEAErrlRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ 1717
Cdd:PRK09510   197 AEAKKKAEAEAKKKA-------AAEAK---KKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKA 255
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1946-2479 3.60e-09

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 63.54  E-value: 3.60e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1946 AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEdEAFQRRLLEEQAAQHKADIEARLAQLR 2025
Cdd:PRK03918   187 RTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKE-EIEELEKELESLEGSKRKLEEKIRELE 265
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2026 KASESELERQKGLVEDT--LRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAA 2103
Cdd:PRK03918   266 ERIEELKKEIEELEEKVkeLKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKK 345
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2104 EEERrrreaeerVQKSLAAEEEAArqrkaalEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAfA 2183
Cdd:PRK03918   346 KLKE--------LEKRLEELEERH-------ELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEIS-K 409
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2184 VQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEReaaqsRRQVEEAERLKQSAEEQAQAQAQAQAAAEK 2263
Cdd:PRK03918   410 ITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELL-----EEYTAELKRIEKELKEIEEKERKLRKELRE 484
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2264 LRKEAEQEAARraqaeqaaLRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA--- 2340
Cdd:PRK03918   485 LEKVLKKESEL--------IKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEElkk 556
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2341 EVTEAARQRGQVEEELFSLRVQMEELG---------KLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEA-ARLS 2410
Cdd:PRK03918   557 KLAELEKKLDELEEELAELLKELEELGfesveeleeRLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAfEELA 636
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2411 VAAQEAARLR-QLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2479
Cdd:PRK03918   637 ETEKRLEELRkELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEER 706
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1239-1660 3.81e-09

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 63.22  E-value: 3.81e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1239 KEAQAVPATLPELEA-----------------TKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERdVEVE 1301
Cdd:pfam17380  221 KEVQGMPHTLAPYEKmerrkesfnlaedvttmTPEYTVRYNGQTMTENEFLNQLLHIVQHQKAVSERQQQEKFEK-MEQE 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1302 RWR---ERVTLLLERWQAVLAQTDVRQRELEqlgRQLRYYRESAdplgawlRDAKQRQEQIQAVPLANSQAVREQLRQEK 1378
Cdd:pfam17380  300 RLRqekEEKAREVERRRKLEEAEKARQAEMD---RQAAIYAEQE-------RMAMERERELERIRQEERKRELERIRQEE 369
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1379 ALLEDierhgEKVEECQRFakqyinaikdyelqlvtykaQLEPvaspakkpkvQSGSESIIQEYVDLRtrysELSTLTSQ 1458
Cdd:pfam17380  370 IAMEI-----SRMRELERL--------------------QMER----------QQKNERVRQELEAAR----KVKILEEE 410
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1459 YIRFISETLRRMEEEERLAEQQRAEERERLAEveaalEKQRQLAEAHAQakaQAEREAQGLQRRMQEEVARREEVAVEAQ 1538
Cdd:pfam17380  411 RQRKIQQQKVEMEQIRAEQEEARQREVRRLEE-----ERAREMERVRLE---EQERQQQVERLRQQEEERKRKKLELEKE 482
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1539 EQKRSIQEELQhlRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrlqleatERQRGGAEGElqalRARAEEAEAQKRQA 1618
Cdd:pfam17380  483 KRDRKRAEEQR--RKILEKELEERKQAMIEEERKRKLLEKEME---------ERQKAIYEEE----RRREAEEERRKQQE 547
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 1619 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRA 1660
Cdd:pfam17380  548 MEERRRIQEQMRKATEERSRLEAMEREREMMRQIVESEKARA 589
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4513-4551 4.42e-09

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 54.64  E-value: 4.42e-09
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4513 FLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTAQKL 4551
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
CH_dFLNA-like_rpt2 cd21315
second calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and ...
296-398 4.49e-09

second calponin homology (CH) domain found in Drosophila melanogaster filamin-A (dFLNA) and similar proteins; Drosophila melanogaster filamin-A (dFLNA or dFLN-A), also called actin-binding protein 280 (ABP-280) or filamin-1, is involved in germline ring canal formation. It may tether actin microfilaments within the ovarian ring canal to the cell membrane and contributes to actin microfilament organization. dFLNA contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409164  Cd Length: 118  Bit Score: 57.10  E-value: 4.49e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  296 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 374
Cdd:cd21315     11 DGKGPTPKQRLLGWIQSKVPD---LPITNFTNDWNDGKAIGALVDALAPGLCpDWEDWDPKDAVKNAKEAMDLAEDWLDV 87
                           90       100
                   ....*....|....*....|....
gi 1920237946  375 TRLLDPEDVDVPQPDEKSIITYVS 398
Cdd:cd21315     88 PQLIKPEEMVNPKVDELSMMTYLS 111
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1221-2158 4.52e-09

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 63.43  E-value: 4.52e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1221 IRSTQEAEEVLRAHEEQLKEAQAvpatlpeleatkaALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQ--RHGERdv 1298
Cdd:COG3096    284 SERALELRRELFGARRQLAEEQY-------------RLVEMARELEELSARESDLEQDYQAASDHLNLVQTalRQQEK-- 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1299 eVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVReqlrqek 1378
Cdd:COG3096    349 -IERYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQQTRAIQYQQAVQ------- 420
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1379 ALledierhgEKVEECQRFAKQYINAIKDYelqLVTYKAQLEpvaspakkpkvqsgseSIIQEYVDLRTRYSELSTLTSQ 1458
Cdd:COG3096    421 AL--------EKARALCGLPDLTPENAEDY---LAAFRAKEQ----------------QATEEVLELEQKLSVADAARRQ 473
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1459 YIRFIsETLRRMEEE-ERLAEQQRAEERER-------LAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARR 1530
Cdd:COG3096    474 FEKAY-ELVCKIAGEvERSQAWQTARELLRryrsqqaLAQRLQQLRAQLAELEQRLRQQQNAERLLEEFCQRIGQQLDAA 552
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1531 EEVAVEAQEQKRSIQEELQHLRQSSEAEIQakarqveaaersrlrieeeirvvrlqleaTERQRGGAEGELQALRARAEE 1610
Cdd:COG3096    553 EELEELLAELEAQLEELEEQAAEAVEQRSE-----------------------------LRQQLEQLRARIKELAARAPA 603
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1611 AeaqkRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAeeaeRRLRQAE-AERARQV 1689
Cdd:COG3096    604 W----LAAQDALERLREQSGEALADSQEVTAAMQQLLEREREATVERDELAARKQALESQI----ERLSQPGgAEDPRLL 675
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1690 Q----------------VALETAQR-------SAEAELQSEHASFAEKTAQLERTLkeEHVAVVQLREEATRRAQQQAEA 1746
Cdd:COG3096    676 AlaerlggvllseiyddVTLEDAPYfsalygpARHAIVVPDLSAVKEQLAGLEDCP--EDLYLIEGDPDSFDDSVFDAEE 753
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1747 ERARAEAERELERWQL-----------KANE--ALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgkaeEQAVRQR 1813
Cdd:COG3096    754 LEDAVVVKLSDRQWRYsrfpevplfgrAAREkrLEELRAERDELAEQYA------------------------KASFDVQ 809
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1814 ELaeQELEKQ-RQLAEGTAQQRLAAEQElirlrAETEQGEQQRQLLEEELARL----QREAAAATQKRRELEAeLAKVRA 1888
Cdd:COG3096    810 KL--QRLHQAfSQFVGGHLAVAFAPDPE-----AELAALRQRRSELERELAQHraqeQQLRQQLDQLKEQLQL-LNKLLP 881
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1889 EMEVL----LASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRA--LAEEAKRQRQL---AEEDAVRQRAEAER 1959
Cdd:COG3096    882 QANLLadetLADRLEELREELDAAQEAQAFIQQHGKALAQLEPLVAVLQSdpEQFEQLQADYLqakEQQRRLKQQIFALS 961
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1960 VLAEKLAAISEAtrlktEAEIALKEKEAENERLR---RLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQK 2036
Cdd:COG3096    962 EVVQRRPHFSYE-----DAVGLLGENSDLNEKLRarlEQAEEARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQ 1036
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2037 GLvedtlrQRRQVEEEILALKGSFEKAAAGKAELELELGRIRG--TAEDTLRSKEQAEQEAARQRqlaaeeerrrreaEE 2114
Cdd:COG3096   1037 EL------EQELEELGVQADAEAEERARIRRDELHEELSQNRSrrSQLEKQLTRCEAEMDSLQKR-------------LR 1097
                          970       980       990      1000
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946 2115 RVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRL--RERAEQES 2158
Cdd:COG3096   1098 KAERDYKQEREQVVQAKAGWCAVLRLARDNDVERRLhrRELAYLSA 1143
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1467-1687 5.07e-09

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 61.70  E-value: 5.07e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1467 LRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE 1546
Cdd:COG4942     29 LEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAE 108
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1547 ---ELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAE 1623
Cdd:COG4942    109 llrALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERA 188
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1624 RLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERAR 1687
Cdd:COG4942    189 ALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALK 252
CH_AtFIM_like_rpt3 cd21299
third calponin homology (CH) domain found in the Arabidopsis thaliana fimbrin family; The ...
186-285 5.58e-09

third calponin homology (CH) domain found in the Arabidopsis thaliana fimbrin family; The Arabidopsis thaliana fimbrin (AtFIM) family includes Fimbrin-1, -2, -3, -4, and -5, which cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Members of this family contain four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409148  Cd Length: 114  Bit Score: 56.74  E-value: 5.58e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  186 QKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKG-----RMRFHKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21299      5 EERCFRLWINS--LGIDTYVNNVFEDVRDGWVLLEVLDKVSPGSVNWKHAnkppiKMPFKKVENCNQVVKIGKQLKFSLV 82
                           90       100
                   ....*....|....*....|....*
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILH 285
Cdd:cd21299     83 NVAGNDIVQGNKKLILALLWQLMRY 107
PLEC smart00250
Plectin repeat;
4435-4472 5.75e-09

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 54.41  E-value: 5.75e-09
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  4435 QRLLEAQACTGGIIDPSTGERFPVTDAVNKGLVDKIMV 4472
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1473-1687 6.23e-09

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 61.40  E-value: 6.23e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1473 EERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSiQEELQHLR 1552
Cdd:TIGR02794   49 AQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQK-QAEEAKAK 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1553 QSSEAEIQAKA-RQVEAAERSRLRIEEEirvvRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEE----AERLRR 1627
Cdd:TIGR02794  128 QAAEAKAKAEAeAERKAKEEAAKQAEEE----AKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEakakAEAAKA 203
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1628 QVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERAR 1687
Cdd:TIGR02794  204 KAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAGSEVDK 263
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2283-2739 7.85e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 62.24  E-value: 7.85e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEET-----DHQKSILDEELQRLKAEVTEAARQRGQVEEELF 2357
Cdd:COG4913    247 AREQIELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRrlellEAELEELRAELARLEAELERLEARLDALREELD 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2358 SLRVQMEELG-----KLKARIEAENRALVLRDKDSAQrlLQEEAEKMK-QVAEEAARLSVAAQEAARLRQLAEEDLAQQR 2431
Cdd:COG4913    327 ELEAQIRGNGgdrleQLEREIERLERELEERERRRAR--LEALLAALGlPLPASAEEFAALRAEAAALLEALEEELEALE 404
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2432 ALAEKMLKEKMQAVQEATRLKAEAELLQQQK---ELAQEQARR-----LQEDKEQM------------------------ 2479
Cdd:COG4913    405 EALAEAEAALRDLRRELRELEAEIASLERRKsniPARLLALRDalaeaLGLDEAELpfvgelievrpeeerwrgaiervl 484
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2480 ---AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLyRTELATQEKVML 2556
Cdd:COG4913    485 ggfALTLLVPPEHYAAALRWVNRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWL-EAELGRRFDYVC 563
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2557 VQTLE-------------------TQRQQSDRDAERL--------REAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQ-- 2607
Cdd:COG4913    564 VDSPEelrrhpraitragqvkgngTRHEKDDRRRIRSryvlgfdnRAKLAALEAELAELEEELAEAEERLEALEAELDal 643
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2608 EQLLQETQALQQSFLSEKDsLLQRERCIEQEKAKLEQL--FQDEVAKAQALREEqqrqqqqmqqekqqLAASMEEARRRQ 2685
Cdd:COG4913    644 QERREALQRLAEYSWDEID-VASAEREIAELEAELERLdaSSDDLAALEEQLEE--------------LEAELEELEEEL 708
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 2686 HEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSE 2739
Cdd:COG4913    709 DELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDA 762
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1616-2136 9.22e-09

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 62.24  E-value: 9.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1616 RQAQEEAERLRRQVQD----ETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARqvqv 1691
Cdd:COG4913    238 ERAHEALEDAREQIELlepiRELAERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELARLEAELER---- 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1692 aLETAQRSAEAELQSEHASFAEKTAQLERTLKEEhvavvqlreeatrraqqqaeaeraraeaereLERWQLKANEALRLR 1771
Cdd:COG4913    314 -LEARLDALREELDELEAQIRGNGGDRLEQLERE-------------------------------IERLERELEERERRR 361
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1772 LQAEEVAQQKSLTQAEAEKQKEeaerearrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1851
Cdd:COG4913    362 ARLEALLAALGLPLPASAEEFA----------ALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASL 431
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1852 EQQRQLLEEELARLQREAAAATQ-KRRELE--AELAKVRAE-------MEVLLASKAR---------------------- 1899
Cdd:COG4913    432 ERRKSNIPARLLALRDALAEALGlDEAELPfvGELIEVRPEeerwrgaIERVLGGFALtllvppehyaaalrwvnrlhlr 511
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1900 ------------AEEESRSTSEKS-KQRLEAEAGRFR-----ELAEEAARLRALAEEA-----------------KRQRQ 1944
Cdd:COG4913    512 grlvyervrtglPDPERPRLDPDSlAGKLDFKPHPFRawleaELGRRFDYVCVDSPEElrrhpraitragqvkgnGTRHE 591
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1945 LAEEDAVRQR----AEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEE-----QAAQHKA 2015
Cdd:COG4913    592 KDDRRRIRSRyvlgFDNRAKLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDeidvaSAEREIA 671
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2016 DIEARLAQLRKASeSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRirgtAEDTLRSKEQAEQEA 2095
Cdd:COG4913    672 ELEAELERLDASS-DDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDE----LQDRLEAAEDLARLE 746
                          570       580       590       600
                   ....*....|....*....|....*....|....*....|.
gi 1920237946 2096 ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 2136
Cdd:COG4913    747 LRALLEERFAAALGDAVERELRENLEERIDALRARLNRAEE 787
CH_FLNB_rpt2 cd21313
second calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; ...
296-403 1.35e-08

second calponin homology (CH) domain found in filamin-B (FLN-B) and similar proteins; Filamin-B (FLN-B) is also called ABP-278, ABP-280 homolog, actin-binding-like protein, beta-filamin, filamin homolog 1 (Fh1), filamin-3, thyroid autoantigen, truncated actin-binding protein, or truncated ABP. It connects cell membrane constituents to the actin cytoskeleton. It may promote orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It anchors various transmembrane proteins to the actin cytoskeleton. FLN-B contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409162  Cd Length: 110  Bit Score: 55.48  E-value: 1.35e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  296 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 374
Cdd:cd21313      3 DAKKQTPKQRLLGWIQNKIPY---LPITNFNQNWQDGKALGALVDSCAPGLCpDWESWDPQKPVDNAREAMQQADDWLGV 79
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  375 TRLLDPEDVDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21313     80 PQVITPEEIIHPDVDEHSVMTYLSQFPKA 108
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
648-837 1.37e-08

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 58.23  E-value: 1.37e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  648 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQLSPATRGAY---RDCLGRLD 724
Cdd:cd00176      6 LRDADELEAWLSEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAeeiQERLEELN 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  725 LQYAKLLSSSKARLRSLE---SLHGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEI 801
Cdd:cd00176     86 QRWEELRELAEERRQRLEealDLQQFFRDADDLEQWLEEKEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRLKSL 165
                          170       180       190
                   ....*....|....*....|....*....|....*..
gi 1920237946  802 QSTGDRLLREDHP-ARPTAESFQAALQTQWSWMLQLC 837
Cdd:cd00176    166 NELAEELLEEGHPdADEEIEEKLEELNERWEELLELA 202
COG3903 COG3903
Predicted ATPase [General function prediction only];
1504-1963 1.51e-08

Predicted ATPase [General function prediction only];


Pssm-ID: 443109 [Multi-domain]  Cd Length: 933  Bit Score: 61.57  E-value: 1.51e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1504 AHAQAKAQAEREAQGLQRRMQEEVARreeVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVV 1583
Cdd:COG3903    475 EYAAERLAEAGERAAARRRHADYYLA---LAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALRLAAAL 551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1584 RLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQA 1663
Cdd:COG3903    552 APFWFLRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAA 631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1664 LEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQ 1743
Cdd:COG3903    632 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAAL 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1744 AEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQ 1823
Cdd:COG3903    712 AAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAA 791
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1824 RQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1903
Cdd:COG3903    792 AAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALA 871
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1904 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1963
Cdd:COG3903    872 AAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
WEMBL pfam05701
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ...
1538-2078 1.77e-08

Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".


Pssm-ID: 461718 [Multi-domain]  Cd Length: 562  Bit Score: 60.81  E-value: 1.77e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEeirvVRLQLE--ATERQRGGAEGELQALRARaeeaEAQK 1615
Cdd:pfam05701   41 ELELEKVQEEIPEYKKQSEAAEAAKAQVLEELESTKRLIEE----LKLNLEraQTEEAQAKQDSELAKLRVE----EMEQ 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1616 RQAQEEAERLRRQVQDETQRKRQAEAELALrVQAE--------AEAAREKQRALQALEELRLQAEEAERRLRQAEAERAr 1687
Cdd:pfam05701  113 GIADEASVAAKAQLEVAKARHAAAVAELKS-VKEEleslrkeyASLVSERDIAIKRAEEAVSASKEIEKTVEELTIELI- 190
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1688 QVQVALETAQRS-AEAELQSEHASFA--EKTAQLERTLKEEHVAVVQLREEATRRAQQQAeaeraraeaerelerwQLKA 1764
Cdd:pfam05701  191 ATKESLESAHAAhLEAEEHRIGAALAreQDKLNWEKELKQAEEELQRLNQQLLSAKDLKS----------------KLET 254
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1765 NEALRLRLQAEEVAQQKSltqaeaekqkeEAEREARRRGKAEEQAVRQRE---LAEQELEKQRQLAEgtaqqRLAAEQEL 1841
Cdd:pfam05701  255 ASALLLDLKAELAAYMES-----------KLKEEADGEGNEKKTSTSIQAalaSAKKELEEVKANIE-----KAKDEVNC 318
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1842 IRLRAETEQGEQQRQllEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlasKARAEEESRSTSEKSKQRLEAeagr 1921
Cdd:pfam05701  319 LRVAAASLRSELEKE--KAELASLRQREGMASIAVSSLEAELNRTKSEIALV---QAKEKEAREKMVELPKQLQQA---- 389
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1922 frelAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEA--ENERLRRLAEDE 1999
Cdd:pfam05701  390 ----AQEAEEAKSLAQAAREELRKAKEEAEQAKAAASTVESRLEAVLKEIEAAKASEKLALAAIKAlqESESSAESTNQE 465
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2000 AFQR----------------RLLEEQAaqhKADIEARLAQLRKASESELERQKGLvEDTLRQRRQVEEEILALKGSFEKA 2063
Cdd:pfam05701  466 DSPRgvtlsleeyyelskraHEAEELA---NKRVAEAVSQIEEAKESELRSLEKL-EEVNREMEERKEALKIALEKAEKA 541
                          570
                   ....*....|....*
gi 1920237946 2064 AAGKAELELELGRIR 2078
Cdd:pfam05701  542 KEGKLAAEQELRKWR 556
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1816-2018 2.30e-08

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 59.82  E-value: 2.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1816 AEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAA--AATQKRRELEAELAKVRAEMEVL 1893
Cdd:PRK09510    77 AEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQkqAEEAAAKAAAAAKAKAEAEAKRA 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1894 LASKARAEEESrstseksKQRLEAEAgrfRELAEEAARLRALAEEAKrqrQLAEEdaVRQRAEAErvlAEKLAAISEATR 1973
Cdd:PRK09510   157 AAAAKKAAAEA-------KKKAEAEA---AKKAAAEAKKKAEAEAAA---KAAAE--AKKKAEAE---AKKKAAAEAKKK 218
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 1920237946 1974 LKTEAEIALKEKEAENErlrRLAEDEAFQRRLLEEQAAQHKADIE 2018
Cdd:PRK09510   219 AAAEAKAAAAKAAAEAK---AAAEKAAAAKAAEKAAAAKAAAEVD 260
EntF COG1020
EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites ...
1488-1900 2.32e-08

EntF, seryl-AMP synthase component of non-ribosomal peptide synthetase [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 440643 [Multi-domain]  Cd Length: 1329  Bit Score: 61.03  E-value: 2.32e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1488 LAEVEAALEKQRQLAEAHAQAkaqaeREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE----------- 1556
Cdd:COG1020    885 LGEIEAALLQHPGVREAVVVA-----REDAPGDKRLVAYVVPEAGAAAAAALLRLALALLLPPYMVPAAvvlllplpltg 959
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1557 --------AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQ 1628
Cdd:COG1020    960 ngkldrlaLPAPAAAAAAAAAAPPAEEEEEEAALALLLLLVVVVGDDDFFFFGGGLGLLLLLALARAARLLLLLLLLLLL 1039
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1629 VQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEH 1708
Cdd:COG1020   1040 FLAAAAAAAAAAAAAAAAAAAAPLAAAAAPLPLPPLLLSLLALLLALLLLLALLALLALLLLLLLLLLLLALLLLLALLL 1119
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1709 ASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEA 1788
Cdd:COG1020   1120 ALLAALRARRAVRQEGPRLRLLVALAAALALAALLALLLAAAAAAAELLAAAALLLLLALLLLALLLLLLLLLLLLLLLL 1199
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1789 EKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQRE 1868
Cdd:COG1020   1200 LLLLLLLLLLLLLLLLLLLLLLLLLAAAAAALLALALLLALLALAALLALAALAALAAALLALALALLALALLLLALALL 1279
                          410       420       430
                   ....*....|....*....|....*....|..
gi 1920237946 1869 AAAATQKRRELEAELAKVRAEMEVLLASKARA 1900
Cdd:COG1020   1280 LPALARARAARTARALALLLLLALLLLLALAL 1311
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
1226-1581 2.33e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 60.84  E-value: 2.33e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1226 EAEEVLRAHEEQLKEAQAVPATL-PELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWR 1304
Cdd:TIGR02168  681 ELEEKIEELEEKIAELEKALAELrKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELE 760
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1305 ERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLrDAKQRQEQIQAVPLANSQAVREQLRQEKALLED- 1383
Cdd:TIGR02168  761 AEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL-DELRAELTLLNEEAANLRERLESLERRIAATERr 839
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1384 IERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSELstltsqyirfi 1463
Cdd:TIGR02168  840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELE---------ALLNERASLEEALALLRSELEEL----------- 899
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1464 SETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQ------------------LAEAHAQAKAQAEREAQGLQRRMQE 1525
Cdd:TIGR02168  900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVridnlqerlseeysltleEAEALENKIEDDEEEARRRLKRLEN 979
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1526 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAErsrlRIEEEIR 1581
Cdd:TIGR02168  980 KIKELGPVNLAAIEEYEELKERYDFLTAQKEDLTEAKETLEEAIE----EIDREAR 1031
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2292-2740 2.43e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 60.85  E-value: 2.43e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2292 EMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEE--ELFSLRVQME----E 2365
Cdd:PRK03918   232 ELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEyiKLSEFYEEYLdelrE 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2366 LGKLKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEdlaqqralAEKMLKEKmq 2443
Cdd:PRK03918   312 IEKRLSRLEEEINGIeeRIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEE--------LERLKKRL-- 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2444 AVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQ-----RQLEMSAEAERLRLRVAEM 2518
Cdd:PRK03918   382 TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKcpvcgRELTEEHRKELLEEYTAEL 461
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2519 SRAQ---ARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKL-KQEAQL 2594
Cdd:PRK03918   462 KRIEkelKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLKEKLiKLKGEI 541
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2595 LQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRER-----CIEQEKAKLEQL--FQDEVAKAQALREEQQRQQQQM 2667
Cdd:PRK03918   542 KSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEelgfeSVEELEERLKELepFYNEYLELKDAEKELEREEKEL 621
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2668 QQEKQQLAASMEEARRRQHEAEEgVRRQQEELQRlaqqqqqqeKLLAEENQRLRERLQHLEEERRAALARSEE 2740
Cdd:PRK03918   622 KKLEEELDKAFEELAETEKRLEE-LRKELEELEK---------KYSEEEYEELREEYLELSRELAGLRAELEE 684
CH_NAV3 cd21286
calponin homology (CH) domain found in neuron navigator 3; Neuron navigator 3 (NAV3), also ...
188-282 2.89e-08

calponin homology (CH) domain found in neuron navigator 3; Neuron navigator 3 (NAV3), also called pore membrane and/or filament-interacting-like protein 1 (POMFIL1), Steerin-3 (STEERIN3), or Unc-53 homolog 3 (unc53H3), may regulate IL2 production by T-cells. It may be involved in neuron regeneration. NAV3 contains a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409135  Cd Length: 105  Bit Score: 54.65  E-value: 2.89e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  188 KTFTKWVNKHLIKA--QRHISDLYEDLRDGHNLISLLEVLSGD------SLPREKGRMrfhkLQNVQIALDYLRHRQVKL 259
Cdd:cd21286      3 KIYTDWANHYLAKSghKRLIKDLQQDIADGVLLAEIIQIIANEkvedinGCPRSQSQM----IENVDVCLSFLAARGVNV 78
                           90       100
                   ....*....|....*....|...
gi 1920237946  260 VNIRNDDIADGNPKLTLGLIWTI 282
Cdd:cd21286     79 QGLSAEEIRNGNLKAILGLFFSL 101
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
2301-2741 2.89e-08

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 60.62  E-value: 2.89e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2301 EQALRQKAQVEQELTALRLQL-EETDHQKSILDEELQRLKAEVTEAARQRGQV---EEELFSLRVQMEELGKLKARIEAE 2376
Cdd:pfam12128  404 EARDRQLAVAEDDLQALESELrEQLEAGKLEFNEEEYRLKSRLGELKLRLNQAtatPELLLQLENFDERIERAREEQEAA 483
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2377 NRAlvlrdkdsaQRLLQEEAEKMKQVAEEAARlsvaaqeaaRLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAE 2456
Cdd:pfam12128  484 NAE---------VERLQSELRQARKRRDQASE---------ALRQASRRLEERQSALDELELQLFPQAGTLLHFLRKEAP 545
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2457 LLQQQ--KELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA-------QARAEE 2527
Cdd:pfam12128  546 DWEQSigKVISPELLHRTDLDPEVWDGSVGGELNLYGVKLDLKRIDVPEWAASEEELRERLDKAEEAlqsarekQAAAEE 625
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2528 DARRFRKQAE--DIGERLYRTELaTQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTV 2605
Cdd:pfam12128  626 QLVQANGELEkaSREETFARTAL-KNARLDLRRLFDEKQSEKDKKNKALAERKDSANERLNSLEAQLKQLDKKHQAWLEE 704
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2606 RQEQLLQETQALQQSFL---SEKDSLLQR-----ERCIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQMQQEKQQLAAS 2677
Cdd:pfam12128  705 QKEQKREARTEKQAYWQvveGALDAQLALlkaaiAARRSGAKAELKAL-ETWYKRDLASLGVDPDVIAKLKREIRTLERK 783
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2678 MEEARRRQHEAEEGVRRQQE----ELQRLAQQQQQQEKLLAEENQRL-------RERLQHLEEERRAALARSEEI 2741
Cdd:pfam12128  784 IERIAVRRQEVLRYFDWYQEtwlqRRPRLATQLSNIERAISELQQQLarliadtKLRRAKLEMERKASEKQQVRL 858
growth_prot_Scy NF041483
polarized growth protein Scy;
2284-2730 3.11e-08

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 60.61  E-value: 3.11e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2284 RQKQAADAEMEKHKqfAEQAL-RQKAQVEQELTALRLQLEE-TDHQksildEELQRLKAEVTEAARQRGQVEEELFSLRV 2361
Cdd:NF041483   195 RQRLGSEAESARAE--AEAILrRARKDAERLLNAASTQAQEaTDHA-----EQLRSSTAAESDQARRQAAELSRAAEQRM 267
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2362 QMEELGKLKARIEAEnRALVLRDKDSAQRLLQEEA---EKMKQVAEEAARL-SVAAQEAARLRQLAEEDLAQQRALAEKM 2437
Cdd:NF041483   268 QEAEEALREARAEAE-KVVAEAKEAAAKQLASAESaneQRTRTAKEEIARLvGEATKEAEALKAEAEQALADARAEAEKL 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2438 LKEKMQAVQEATRLKAEAELlqqqkelaqEQARRLQEDKEQMAQQLAQETQGfQKTLETERQRQlEMSAEAERLRLRVAE 2517
Cdd:NF041483   347 VAEAAEKARTVAAEDTAAQL---------AKAARTAEEVLTKASEDAKATTR-AAAEEAERIRR-EAEAEADRLRGEAAD 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2518 MS-RAQARAEEDARRFRKQAEDIGE--RLYRTElATQEKVMLVQTLETQRQQSDRDA-ERLREAIAELEHEKDKLKQEA- 2592
Cdd:NF041483   416 QAeQLKGAAKDDTKEYRAKTVELQEeaRRLRGE-AEQLRAEAVAEGERIRGEARREAvQQIEEAARTAEELLTKAKADAd 494
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2593 QLLQLKSEEMQTVRQEQLLQETQALQQSflsekDSLLQRERCiEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQ 2672
Cdd:NF041483   495 ELRSTATAESERVRTEAIERATTLRRQA-----EETLERTRA-EAERLRAEAEEQAEEVRAAAERAARELREETERAIAA 568
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2673 QLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLR----ERLQHLEEE 2730
Cdd:NF041483   569 RQAEAAEELTRLHTEAEERLTAAEEALADARAEAERIRREAAEETERLRteaaERIRTLQAQ 630
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1606-2057 3.43e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 59.78  E-value: 3.43e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARAEEAEAQKRQAQEEAERLRRQVQD-ETQRKRQAEAELAL----RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQ 1680
Cdd:COG4717     71 KELKELEEELKEAEEKEEEYAELQEElEELEEELEELEAELeelrEELEKLEKLLQLLPLYQELEALEAELAELPERLEE 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1681 AEAERARQVQV-----ALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEaer 1755
Cdd:COG4717    151 LEERLEELRELeeeleELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEE--- 227
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1756 elerwQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAvrqrELAEQELEKQRQLAEGTAQQRL 1835
Cdd:COG4717    228 -----ELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIA----GVLFLVLGLLALLFLLLAREKA 298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1836 AAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRstsEKSKQRL 1915
Cdd:COG4717    299 SLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEEL---EQEIAAL 375
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1916 EAEAGrfrelAEEAARLRALAEEAKRQRQLAEEdavrqRAEAERVLAEKLAAISEATRLKTEAEiaLKEKEAENERLRRL 1995
Cdd:COG4717    376 LAEAG-----VEDEEELRAALEQAEEYQELKEE-----LEELEEQLEELLGELEELLEALDEEE--LEEELEELEEELEE 443
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1996 AEDEafqrrllEEQAAQHKADIEARLAQLrkASESELERQKGLVEDTLRQRRQVEEEILALK 2057
Cdd:COG4717    444 LEEE-------LEELREELAELEAELEQL--EEDGELAELLQELEELKAELRELAEEWAALK 496
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1465-1725 4.80e-08

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 58.39  E-value: 4.80e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSI 1544
Cdd:pfam13868   46 DEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQ 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1545 QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1624
Cdd:pfam13868  126 RQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDE 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1625 LR-RQVQDETQRK-RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEA 1702
Cdd:pfam13868  206 LRaKLYQEEQERKeRQKEREEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRR 285
                          250       260
                   ....*....|....*....|...
gi 1920237946 1703 ELQSEHASFAEKTAQLERTLKEE 1725
Cdd:pfam13868  286 MKRLEHRRELEKQIEEREEQRAA 308
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1779-1974 4.88e-08

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 58.66  E-value: 4.88e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1779 QQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLL 1858
Cdd:PRK09510    70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKA 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1859 EEELARLQREAAAATQKRRELEAELAKVRAEMEvllASKARAEEESRSTSEKSKQRLEAEAgrfRELAEEAARLRALAEE 1938
Cdd:PRK09510   150 EAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAE---AKKKAEAEAAAKAAAEAKKKAEAEA---KKKAAAEAKKKAAAEA 223
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1920237946 1939 AKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRL 1974
Cdd:PRK09510   224 KAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEV 259
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1173-1735 4.90e-08

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 59.69  E-value: 4.90e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1173 AEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEaqaVPATLPELE 1252
Cdd:PRK03918   203 EEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEE---LKKEIEELE 279
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1253 ATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLqqrhgerdvevERWRERvtllLERWQAVLAQTDVRQRELEQLG 1332
Cdd:PRK03918   280 EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRL-----------SRLEEE----INGIEERIKELEEKEERLEELK 344
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1333 RQLRYYRESADPLGAWLR---DAKQRQEQIqavplansqavrEQLRQEKALL--EDIERHGEKVEECQRFAKQYINAIKD 1407
Cdd:PRK03918   345 KKLKELEKRLEELEERHElyeEAKAKKEEL------------ERLKKRLTGLtpEKLEKELEELEKAKEEIEEEISKITA 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1408 YELQLVTYKAQLEPVASPAKKPKVQS---GSESIIQEYVDLRTRYSElstltsqYIRFISETLRRMEEEERlaeqqraEE 1484
Cdd:PRK03918   413 RIGELKKEIKELKKAIEELKKAKGKCpvcGRELTEEHRKELLEEYTA-------ELKRIEKELKEIEEKER-------KL 478
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1485 RERLAEVEAALEKQRQLAEAHAQAKaqaereaqglQRRMQEEvaRREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1564
Cdd:PRK03918   479 RKELRELEKVLKKESELIKLKELAE----------QLKELEE--KLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKK 546
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1565 QVEAAERsrlrIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1644
Cdd:PRK03918   547 ELEKLEE----LKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELK 622
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1645 lRVQAEAEAAREK-QRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERtLK 1723
Cdd:PRK03918   623 -KLEEELDKAFEElAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEK-LK 700
                          570
                   ....*....|..
gi 1920237946 1724 EEHVAVVQLREE 1735
Cdd:PRK03918   701 EELEEREKAKKE 712
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2333-2601 5.41e-08

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 59.54  E-value: 5.41e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2333 EELQRLKAEVTEAARQRGQVE------EELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEkmkQVAEEA 2406
Cdd:COG4913    235 DDLERAHEALEDAREQIELLEpirelaERYAAARERLAELEYLRAALRLWFAQRRLELLEAELEELRAELA---RLEAEL 311
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2407 ARLSVAAQEAARLRQLAEEDLAQQralaekmlkekmqAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQE 2486
Cdd:COG4913    312 ERLEARLDALREELDELEAQIRGN-------------GGDRLEQLEREIERLERELEERERRRARLEALLAALGLPLPAS 378
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2487 TQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQaedigerlyrtelatqekvmlVQTLETQRQQ 2566
Cdd:COG4913    379 AEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAE---------------------IASLERRKSN 437
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1920237946 2567 SDRDAERLREAIAE-LEHEKDKLKQEAQLLQLKSEE 2601
Cdd:COG4913    438 IPARLLALRDALAEaLGLDEAELPFVGELIEVRPEE 473
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1192-1640 5.72e-08

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 59.40  E-value: 5.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1192 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKA-------ALKKLRAQ 1264
Cdd:COG4717     75 ELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAelaelpeRLEELEER 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1265 AEAQQPVFDALRDELRGAQEVGERLQQrhgERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADP 1344
Cdd:COG4717    155 LEELRELEEELEELEAELAELQEELEE---LLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQ 231
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1345 LGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVeecqRFAKQYINAIkdyeLQLVTYKAQLEPVAS 1424
Cdd:COG4717    232 LENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGV----LFLVLGLLAL----LFLLLAREKASLGKE 303
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1425 PAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRfiseTLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA 1504
Cdd:COG4717    304 AEELQALPALEELEEEELEELLAALGLPPDLSPEELL----ELLDRIEELQELLREAEELEEELQLEELEQEIAALLAEA 379
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1505 HAQAKAQAEREAQGLQRRmQEEVARREEVAVEAQEQKRSIQEELQHLRQSS-EAEIQAKARQVEAAERSRLRIEEEIRVV 1583
Cdd:COG4717    380 GVEDEEELRAALEQAEEY-QELKEELEELEEQLEELLGELEELLEALDEEElEEELEELEEELEELEEELEELREELAEL 458
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 1584 RLQLEATERqrggaEGELQALRARAEEAEAQKRQAQEEAERLR------RQVQDETQRKRQAE 1640
Cdd:COG4717    459 EAELEQLEE-----DGELAELLQELEELKAELRELAEEWAALKlalellEEAREEYREERLPP 516
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
1557-1709 5.91e-08

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 58.73  E-value: 5.91e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1557 AEIQAKARQVEA-AERsrlriEEEIRVvrlqleaTERQRGGAEGELQALRARAE----EAEAQKRQAQEEAERlrrqvqd 1631
Cdd:COG2268    195 AEIIRDARIAEAeAER-----ETEIAI-------AQANREAEEAELEQEREIETariaEAEAELAKKKAEERR------- 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1632 ETQRKRqAEAELALRVQaEAEAAREKQRALQALE---ELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEH 1708
Cdd:COG2268    256 EAETAR-AEAEAAYEIA-EANAEREVQRQLEIAErerEIELQEKEAEREEAELEADVRKPAEAEKQAAEAEAEAEAEAIR 333

                   .
gi 1920237946 1709 A 1709
Cdd:COG2268    334 A 334
COG3903 COG3903
Predicted ATPase [General function prediction only];
1622-2081 6.30e-08

Predicted ATPase [General function prediction only];


Pssm-ID: 443109 [Multi-domain]  Cd Length: 933  Bit Score: 59.26  E-value: 6.30e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1622 AERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQAleelRLQAEEAERRLRQAEAERARQVQVALETAqrSAE 1701
Cdd:COG3903    478 AERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLA----RLDAEHDNLRAALRWALAHGDAELALRLA--AAL 551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1702 AELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQK 1781
Cdd:COG3903    552 APFWFLRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAA 631
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1782 SLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEE 1861
Cdd:COG3903    632 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAAL 711
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1862 LARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKR 1941
Cdd:COG3903    712 AAAAAAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAA 791
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1942 QRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARL 2021
Cdd:COG3903    792 AAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALA 871
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2022 AQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTA 2081
Cdd:COG3903    872 AAAAAAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
MutS2 COG1193
dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];
1526-1691 7.37e-08

dsDNA-specific endonuclease/ATPase MutS2 [Replication, recombination and repair];


Pssm-ID: 440806 [Multi-domain]  Cd Length: 784  Bit Score: 59.00  E-value: 7.37e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1526 EVARR----EEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegel 1601
Cdd:COG1193    490 EIARRlglpEEIIERARELLGEESIDVEKLIEELERERRELEEEREEAERLREELEKLREELEEKLEELEEEK------- 562
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1602 QALRARA-EEAEAQKRQAQEEAERLRRQVQDEtqrkrqaeaelalrvQAEAEAAREKQRALQALEElRLQAEEAERRLRQ 1680
Cdd:COG1193    563 EEILEKArEEAEEILREARKEAEELIRELREA---------------QAEEEELKEARKKLEELKQ-ELEEKLEKPKKKA 626
                          170
                   ....*....|.
gi 1920237946 1681 AEAERARQVQV 1691
Cdd:COG1193    627 KPAKPPEELKV 637
CH_ASPM_rpt2 cd21224
second calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated ...
305-400 7.91e-08

second calponin homology (CH) domain found in abnormal spindle-like microcephaly-associated protein (ASPM) and similar proteins; ASPM, also called abnormal spindle protein homolog, or Asp homolog, is involved in mitotic spindle regulation and coordination of mitotic processes. It may also have a preferential role in regulating neurogenesis. Members of this family contain two copies of CH domain in the middle region. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409073 [Multi-domain]  Cd Length: 138  Bit Score: 54.23  E-value: 7.91e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  305 KLLL-WSQrMVEGCQGLRCDNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNL-----------------------EN 360
Cdd:cd21224      3 SLLLkWCQ-AVCAHYGVKVENFTVSFADGRALCYLIHHYLPSLLPLDAIRQPTTQtvdraqdeaedfwvaefspstgdSG 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946  361 LDQAFSVAER-----------DLG-VTRLLDPEDVDVPQPDEKSIITYVSSL 400
Cdd:cd21224     82 LSSELLANEKrnfklvqqavaELGgVPALLRASDMSNTIPDEKVVILFLSYL 133
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
1473-2100 8.70e-08

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 59.04  E-value: 8.70e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1473 EERLAEQQRAeeRERLAEVEAALEK-QRQLA------------------EAHAQAKAQAEREAQGLQRRMQEEVARREEV 1533
Cdd:PRK10246   253 DELQQEASRR--QQALQQALAAEEKaQPQLAalslaqparqlrphweriQEQSAALAHTRQQIEEVNTRLQSTMALRARI 330
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1534 AVEAQEQKRSIQEELQHLRQ-SSEAEIQAKARQVEAAERSRL----RIEEEIRVVRLQLEATERQRGGAEGELQALRARa 1608
Cdd:PRK10246   331 RHHAAKQSAELQAQQQSLNTwLAEHDRFRQWNNELAGWRAQFsqqtSDREQLRQWQQQLTHAEQKLNALPAITLTLTAD- 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1609 EEAEAQKRQAQEEAER-----LRRQVQDETQRKRQAEAelalrvqAEAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1683
Cdd:PRK10246   410 EVAAALAQHAEQRPLRqrlvaLHGQIVPQQKRLAQLQV-------AIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKT 482
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1684 ERARQVQVALETAQRsaeAELQS----------EHASFAEKTAqLERTlkeehvaVVQLREEATRRAQQQAeaeraraea 1753
Cdd:PRK10246   483 ICEQEARIKDLEAQR---AQLQAgqpcplcgstSHPAVEAYQA-LEPG-------VNQSRLDALEKEVKKL--------- 542
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1754 erelerwqlkANEALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgKAEEQAVRQRElAEQELEKQRQLAEGTAQQ 1833
Cdd:PRK10246   543 ----------GEEGAALRGQLDALTKQLQ---------------------RDESEAQSLRQ-EEQALTQQWQAVCASLNI 590
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1834 RLAAEQELIRLRAETEQGEQQRQLLEEELArLQREAAAATQKRRELEAELAKVRAEMEVLLASKA------RAEEESRST 1907
Cdd:PRK10246   591 TLQPQDDIQPWLDAQEEHERQLRLLSQRHE-LQGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYAltlpqeDEEASWLAT 669
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1908 SEKSKQRLEAEAGRFRELAEEAARLRALAEeakrqrQLAEEDAVrqRAEAERVLAEKLAAISEATrLKTEAEIALKEKEA 1987
Cdd:PRK10246   670 RQQEAQSWQQRQNELTALQNRIQQLTPLLE------TLPQSDDL--PHSEETVALDNWRQVHEQC-LSLHSQLQTLQQQD 740
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1988 ENERLR---------------RLAEDEAFQRRLLEEQAAQhkadieaRLAQLRKASESELERQKGLVEdtlrQRRQVEEE 2052
Cdd:PRK10246   741 VLEAQRlqkaqaqfdtalqasVFDDQQAFLAALLDEETLT-------QLEQLKQNLENQRQQAQTLVT----QTAQALAQ 809
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2053 ILALKGSFEKAAAGKAELELELGRIRGT-AEDTLRSKE---QAEQEA-ARQRQ 2100
Cdd:PRK10246   810 HQQHRPDGLDLTVTVEQIQQELAQLAQQlRENTTRQGEirqQLKQDAdNRQQQ 862
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
1228-1887 9.23e-08

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 59.03  E-value: 9.23e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1228 EEVLRAHEEQLKE-----AQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVER 1302
Cdd:pfam01576  337 EEETRSHEAQLQEmrqkhTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEGQLQE 416
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1303 WRERVT-------LLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLRDAK-QRQEQIQAvPLANSQAVReQL 1374
Cdd:pfam01576  417 LQARLSeserqraELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQLQDTQeLLQEETRQ-KLNLSTRLR-QL 494
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1375 RQEKALLEdiERHGEKVEECQRFAKQyinaIKDYELQLVTYKAQLEPVASPAK-----KPKVQSGSESIIQEYVDLRTRY 1449
Cdd:pfam01576  495 EDERNSLQ--EQLEEEEEAKRNVERQ----LSTLQAQLSDMKKKLEEDAGTLEaleegKKRLQRELEALTQQLEEKAAAY 568
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1450 SELS-------------TLTSQYIRFISETLRR-------MEEEERLAEQQRAEERERlAEVEAALEKQRQLAEAHA--- 1506
Cdd:pfam01576  569 DKLEktknrlqqelddlLVDLDHQRQLVSNLEKkqkkfdqMLAEEKAISARYAEERDR-AEAEAREKETRALSLARAlee 647
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1507 --QAKAQAEREAQGLQRRMQEEVARREEV------------AVEAQEQKRSIQEE-------------------LQHLRQ 1553
Cdd:pfam01576  648 alEAKEELERTNKQLRAEMEDLVSSKDDVgknvhelerskrALEQQVEEMKTQLEeledelqatedaklrlevnMQALKA 727
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 SSEAEIQAKArqvEAAERSRLRIEEEIRVVRLQLEATERQRGGA-------EGELQALRARAEEAEAQKRQAQEEAERLR 1626
Cdd:pfam01576  728 QFERDLQARD---EQGEEKRRQLVKQVRELEAELEDERKQRAQAvaakkklELDLKELEAQIDAANKGREEAVKQLKKLQ 804
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1627 RQVQDetqrkRQAEAELALRVQAEAEA-AREKQRALQALEELRLQAEE----AERRLRQAEAERAR-QVQVALETAQRSA 1700
Cdd:pfam01576  805 AQMKD-----LQRELEEARASRDEILAqSKESEKKLKNLEAELLQLQEdlaaSERARRQAQQERDElADEIASGASGKSA 879
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1701 eaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAE-----AERARAEAERELERWQL-KANEALRLRLQA 1774
Cdd:pfam01576  880 ---LQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQlttelAAERSTSQKSESARQQLeRQNKELKAKLQE 956
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1775 EEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQA-----VRQRELAEQEL----EKQRQLAEGTAQQRLAAEQELIRLR 1845
Cdd:pfam01576  957 MEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQaanklVRRTEKKLKEVllqvEDERRHADQYKDQAEKGNSRMKQLK 1036
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 1846 AETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVR 1887
Cdd:pfam01576 1037 RQLEEAEEEASRANAARRKLQRELDDATESNESMNREVSTLK 1078
PLEC smart00250
Plectin repeat;
4166-4202 9.63e-08

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 50.94  E-value: 9.63e-08
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  4166 IRLLEAQIATGGIIDPEESHRLPVDVAYQRGLFDEEM 4202
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1837-2037 1.14e-07

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 57.53  E-value: 1.14e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1837 AEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLE 1916
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1917 A---------------EAGRFRELAEEAARLRALAEEAKR---QRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEA 1978
Cdd:COG3883     94 AlyrsggsvsyldvllGSESFSDFLDRLSALSKIADADADlleELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1979 EIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKG 2037
Cdd:COG3883    174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAA 232
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1816-2068 1.20e-07

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 57.53  E-value: 1.20e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1816 AEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLA 1895
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERAR 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1896 SKARAEE---------ESRSTSE--KSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEK 1964
Cdd:COG3883     94 ALYRSGGsvsyldvllGSESFSDflDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1965 LAAISEATRLKteAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLR 2044
Cdd:COG3883    174 EAQQAEQEALL--AQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGA 251
                          250       260
                   ....*....|....*....|....
gi 1920237946 2045 QRRQVEEEILALKGSFEKAAAGKA 2068
Cdd:COG3883    252 AGAAGAAAGSAGAAGAAAGAAGAG 275
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
2284-2725 1.81e-07

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 57.88  E-value: 1.81e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2284 RQKQAADAEMEKHKQFAEQALRqkaqvEQELTALRLQLEE--TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRV 2361
Cdd:pfam01576   92 QQLQNEKKKMQQHIQDLEEQLD-----EEEAARQKLQLEKvtTEAKIKKLEEDILLLEDQNSKLSKERKLLEERISEFTS 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2362 QMEE-------LGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMK----------QVAEEAARLS-VAAQEAARLRQLA 2423
Cdd:pfam01576  167 NLAEeeekaksLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKlegestdlqeQIAELQAQIAeLRAQLAKKEEELQ 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2424 E-----EDLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQKELAQEQARRLQEDKEQMAQQL--AQETQGFQKTLET 2496
Cdd:pfam01576  247 AalarlEEETAQKNNALKKIRELEAQISE---LQEDLESERAARNKAEKQRRDLGEELEALKTELedTLDTTAAQQELRS 323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2497 ERQRQLEMSAEAERLRLRVAEMSRAQARaeedaRRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLRE 2576
Cdd:pfam01576  324 KREQEVTELKKALEEETRSHEAQLQEMR-----QKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQ 398
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2577 AIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQL-------------------------------------LQETQALQQ 2619
Cdd:pfam01576  399 AKQDSEHKRKKLEGQLQELQARLSESERQRAELAeklsklqselesvssllneaegkniklskdvsslesqLQDTQELLQ 478
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2620 SFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQekqqLAASMEEARRRQHEAEEGVRRQQEEL 2699
Cdd:pfam01576  479 EETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSD----MKKKLEEDAGTLEALEEGKKRLQREL 554
                          490       500       510
                   ....*....|....*....|....*....|
gi 1920237946 2700 ----QRLAQQQQQQEKLlaeenQRLRERLQ 2725
Cdd:pfam01576  555 ealtQQLEEKAAAYDKL-----EKTKNRLQ 579
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
1823-2023 2.05e-07

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 56.80  E-value: 2.05e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1823 QRQLAEGTAQQRLAAEQelIRLRAETEQGEQQRqllEEELARLQREAAAATQKRRELEAELAKVraemevllaskaraEE 1902
Cdd:COG2268    191 RRKIAEIIRDARIAEAE--AERETEIAIAQANR---EAEEAELEQEREIETARIAEAEAELAKK--------------KA 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1903 ESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEA-----ERVLAEKLAAISEAtrlKTE 1977
Cdd:COG2268    252 EERREAETARAEAEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAEREEAELeadvrKPAEAEKQAAEAEA---EAE 328
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 1978 AEIALKEKEAENERLRRLAE-DEAFQRRLLEEQAAQHKADIEARLAQ 2023
Cdd:COG2268    329 AEAIRAKGLAEAEGKRALAEaWNKLGDAAILLMLIEKLPEIAEAAAK 375
EmrA COG1566
Multidrug resistance efflux pump EmrA [Defense mechanisms];
1569-1689 2.08e-07

Multidrug resistance efflux pump EmrA [Defense mechanisms];


Pssm-ID: 441174 [Multi-domain]  Cd Length: 331  Bit Score: 56.21  E-value: 2.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1569 AERSRLRIEEEIRVVRLQLEATERQRGgAEGELQALRARAEEAEAQKRQAQEEAERLR---------RQVQDETQRKR-Q 1638
Cdd:COG1566     81 LQAALAQAEAQLAAAEAQLARLEAELG-AEAEIAAAEAQLAAAQAQLDLAQRELERYQalykkgavsQQELDEARAALdA 159
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 1639 AEAELAlRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV 1689
Cdd:COG1566    160 AQAQLE-AAQAQLAQAQAGLREEEELAAAQAQVAQAEAALAQAELNLARTT 209
CH_PLS3_rpt3 cd21331
third calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is ...
181-288 2.40e-07

third calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Plastin-3 contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409180  Cd Length: 134  Bit Score: 52.70  E-value: 2.40e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRVQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVL-------SGDSLPREKGRMRFHKLQNVQIALDYLR 253
Cdd:cd21331     18 EGETREERTFRNWMNS--LGVNPHVNHLYGDLQDALVILQLYEKIkvpvdwnKVNKPPYPKLGANMKKLENCNYAVELGK 95
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1920237946  254 HR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQI 288
Cdd:cd21331     96 HPaKFSLVGIGGQDLNDGNPTLTLALVWQLMRRYTL 131
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1876-2582 2.53e-07

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 57.62  E-value: 2.53e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1876 RRELEAELAKVRAEMEVLLASKARAEEESRStsEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRA 1955
Cdd:COG4913    220 EPDTFEAADALVEHFDDLERAHEALEDAREQ--IELLEPIRELAERYAAARERLAELEYLRAALRLWFAQRRLELLEAEL 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1956 EAERvlaeklaaiSEATRLKTEAEIALKEKEAENERLRRLaedeafqrrllEEQAAQHKADIEARLAQLRKASESELERQ 2035
Cdd:COG4913    298 EELR---------AELARLEAELERLEARLDALREELDEL-----------EAQIRGNGGDRLEQLEREIERLERELEER 357
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2036 KGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlaaeeerrrreaeer 2115
Cdd:COG4913    358 ERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELR--------------- 422
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2116 vqkSLAAEEEAARQRKAAL-EEVERLKAKVEEARRLRE------------RAEQES-------------------ARQLQ 2163
Cdd:COG4913    423 ---ELEAEIASLERRKSNIpARLLALRDALAEALGLDEaelpfvgelievRPEEERwrgaiervlggfaltllvpPEHYA 499
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2164 LAQEAAQkRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAAREraereaaqSRRQVEEAERL 2243
Cdd:COG4913    500 AALRWVN-RLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWLEAELGRRF--------DYVCVDSPEEL 570
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2244 KQSAEEQAQAQaqaqaaaekLRKeaeqeaarraqaEQAALRQKQAADAEMEKHkQFAEQALRQKAQVEQELTALRLQLEE 2323
Cdd:COG4913    571 RRHPRAITRAG---------QVK------------GNGTRHEKDDRRRIRSRY-VLGFDNRAKLAALEAELAELEEELAE 628
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2324 TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQmEELGKLKARIEAenralvLRDKDSAQRLLQEEAEKMKQVA 2403
Cdd:COG4913    629 AEERLEALEAELDALQERREALQRLAEYSWDEIDVASAE-REIAELEAELER------LDASSDDLAALEEQLEELEAEL 701
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2404 EEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQ-QKELAQEQARRLQEDKEQMAQQ 2482
Cdd:COG4913    702 EELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAAlGDAVERELRENLEERIDALRAR 781
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2483 LAQETQGFQKTleterqrqleMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERL------YRTELATQEKVML 2556
Cdd:COG4913    782 LNRAEEELERA----------MRAFNREWPAETADLDADLESLPEYLALLDRLEEDGLPEYeerfkeLLNENSIEFVADL 851
                          730       740
                   ....*....|....*....|....*.
gi 1920237946 2557 VQTLETQRQQSDRDAERLREAIAELE 2582
Cdd:COG4913    852 LSKLRRAIREIKERIDPLNDSLKRIP 877
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1852-2036 2.57e-07

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 56.35  E-value: 2.57e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1852 EQQRQLLEEELAR-LQREAAAATQKRRELEAELAKVRAEmevllasKARAEEESRSTSEKSKQRLEAEAGrfrelAEEAA 1930
Cdd:PRK09510    78 EEQRKKKEQQQAEeLQQKQAAEQERLKQLEKERLAAQEQ-------KKQAEEAAKQAALKQKQAEEAAAK-----AAAAA 145
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1931 RLRALAEeakrqrQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRrlAEDEAFQRrllEEQA 2010
Cdd:PRK09510   146 KAKAEAE------AKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKK--AEAEAKKK---AAAE 214
                          170       180
                   ....*....|....*....|....*.
gi 1920237946 2011 AQHKADIEARLAQLRKASESELERQK 2036
Cdd:PRK09510   215 AKKKAAAEAKAAAAKAAAEAKAAAEK 240
PRK05035 PRK05035
electron transport complex protein RnfC; Provisional
1458-1719 2.71e-07

electron transport complex protein RnfC; Provisional


Pssm-ID: 235334 [Multi-domain]  Cd Length: 695  Bit Score: 57.27  E-value: 2.71e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1458 QYIRFISETLRRMEEEERLAEQ--QRAEER-ERLAEVEAA-LEKQRQLAEAHAQAKAQA---------EREAQGLQRRMQ 1524
Cdd:PRK05035   429 QYYRQAKAEIRAIEQEKKKAEEakARFEARqARLEREKAArEARHKKAAEARAAKDKDAvaaalarvkAKKAAATQPIVI 508
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1525 EEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAA-ERSRLRIEEeirvvrlQLEATERQRGGAEGELQA 1603
Cdd:PRK05035   509 KAGARPDNSAVIAAREARKAQARARQAEKQAAAAADPKKAAVAAAiARAKAKKAA-------QQAANAEAEEEVDPKKAA 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1604 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAelalrvqaeAEAAREKQRALQALEELRLQAEEAERRLRQAEA 1683
Cdd:PRK05035   582 VAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAA---------AIARAKAKKAEQQANAEPEEPVDPRKAAVAAAI 652
                          250       260       270
                   ....*....|....*....|....*....|....*.
gi 1920237946 1684 ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLE 1719
Cdd:PRK05035   653 ARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAK 688
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
2289-2489 2.71e-07

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 56.38  E-value: 2.71e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2289 ADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEE--- 2365
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGErar 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2366 --------LGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKM 2437
Cdd:COG3883     94 alyrsggsVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2438 LKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQG 2489
Cdd:COG3883    174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAA 225
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1804-2021 2.72e-07

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 56.01  E-value: 2.72e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRreleAEL 1883
Cdd:TIGR02794   72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAERKA----KEE 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1884 AKVRAEMEvllaSKARAEEESRSTSEKSKQRLEAEAgrfreLAEEAARLRALAEEAKRQRQLAEEDA---VRQRAEAERV 1960
Cdd:TIGR02794  148 AAKQAEEE----AKAKAAAEAKKKAEEAKKKAEAEA-----KAKAEAEAKAKAEEAKAKAEAAKAKAaaeAAAKAEAEAA 218
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1961 LAEKLAAISEATRLKTEAEIAL-KEKEAENERLRRLAEDEAFQRRLleeqAAQHKADIEARL 2021
Cdd:TIGR02794  219 AAAAAEAERKADEAELGDIFGLaSGSNAEKQGGARGAAAGSEVDKY----AAIIQQAIQQNL 276
COG4995 COG4995
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
2401-2854 2.82e-07

Uncharacterized conserved protein, contains CHAT domain [Function unknown];


Pssm-ID: 444019 [Multi-domain]  Cd Length: 711  Bit Score: 56.90  E-value: 2.82e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2401 QVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMA 2480
Cdd:COG4995      9 LLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLALALAALALA 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2481 QQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTL 2560
Cdd:COG4995     89 LLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAAAAAALLAL 168
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2561 ETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKA 2640
Cdd:COG4995    169 ALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLALLALAAAAA 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2641 KLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRL 2720
Cdd:COG4995    249 ALAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLLLAALLLLLA 328
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2721 RERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFDGLRRKVPAQRLQEVGVLSAEELQQLAQ 2800
Cdd:COG4995    329 ALALLALLLLLAAAALLAAALAAALALAAALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAALLALAAAQL 408
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2801 GRTTVAELAQREDVRHYLQGRSSIAGLL-LKPADEKLTIYAALRRQLLSPGTALI 2854
Cdd:COG4995    409 LRLLLAALALLLALAAYAAARLALLALIeYIILPDRLYAFVQLYQLLIAPIEAEL 463
mukB PRK04863
chromosome partition protein MukB;
2296-2599 2.99e-07

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 57.27  E-value: 2.99e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2296 HKQFAEQALRQKAQVEQ---ELTALRLQLEEtdhQKSILDEelqrLKAEVTEAARQRGQVEEELFSLRVQME-------- 2364
Cdd:PRK04863   336 HLNLVQTALRQQEKIERyqaDLEELEERLEE---QNEVVEE----ADEQQEENEARAEAAEEEVDELKSQLAdyqqaldv 408
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2365 ---------------------------ELGKLKARIEAenraLVLRDKDSAQRLLQEE-----AEKMKQVAEEAARL--- 2409
Cdd:PRK04863   409 qqtraiqyqqavqalerakqlcglpdlTADNAEDWLEE----FQAKEQEATEELLSLEqklsvAQAAHSQFEQAYQLvrk 484
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2410 ---SVAAQEAARLRQLAEEDLAQQRALAEKM---------LKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKE 2477
Cdd:PRK04863   485 iagEVSRSEAWDVARELLRRLREQRHLAEQLqqlrmrlseLEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELE 564
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2478 QMAQQLAQEtqgfqktLETERQRQLEMSAEAERLRLRVAE-MSRAQA--RAEEDARRFRKQAEDigerlyrtELATQEKV 2554
Cdd:PRK04863   565 ARLESLSES-------VSEARERRMALRQQLEQLQARIQRlAARAPAwlAAQDALARLREQSGE--------EFEDSQDV 629
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2555 M--LVQTLETQRQQSdRDAERLREAIAELEHEKDKLKQ-----EAQLLQLKS 2599
Cdd:PRK04863   630 TeyMQQLLERERELT-VERDELAARKQALDEEIERLSQpggseDPRLNALAE 680
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1468-1736 3.34e-07

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 55.70  E-value: 3.34e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVeaQEQKRSIQEE 1547
Cdd:pfam13868   36 AEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQM--DEIVERIQEE 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1548 LQHLRQSSEAEIQAKARQVEAAERSRLRI---------EEEIRVVRLQLEATERQRggAEGELQALRARAEEAEAQKRQA 1618
Cdd:pfam13868  114 DQAEAEEKLEKQRQLREEIDEFNEEQAEWkelekeeerEEDERILEYLKEKAEREE--EREAEREEIEEEKEREIARLRA 191
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1619 QEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQR 1698
Cdd:pfam13868  192 QQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERMLRKQA 271
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1920237946 1699 SAEAELQSEhasfAEKTAQLERTLKEEHVAVVQLREEA 1736
Cdd:pfam13868  272 EDEEIEQEE----AEKRRMKRLEHRRELEKQIEEREEQ 305
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1603-2001 3.92e-07

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 56.88  E-value: 3.92e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1603 ALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAeaelalrvqaeAEAAREKQRALQAL---------EELRLQAEE 1673
Cdd:COG3096    829 AFAPDPEAELAALRQRRSELERELAQHRAQEQQLRQQ-----------LDQLKEQLQLLNKLlpqanlladETLADRLEE 897
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1674 AERRLRQAEAERA--RQVQVALETAQRSAEAeLQSEHASFAEktaqlertLKEEHVAVVQLREEATRRAQQQAEAERARA 1751
Cdd:COG3096    898 LREELDAAQEAQAfiQQHGKALAQLEPLVAV-LQSDPEQFEQ--------LQADYLQAKEQQRRLKQQIFALSEVVQRRP 968
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1752 EAERELERWQLKANEALRLRLQAeevaqqksltqaeaekqkeeaerearrrgkaeeqavrQRELAEQELEKQRQLAEGTA 1831
Cdd:COG3096    969 HFSYEDAVGLLGENSDLNEKLRA-------------------------------------RLEQAEEARREAREQLRQAQ 1011
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1832 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREA-----AAATQKRRELEAELAKVRAEmevllaskaraeeesRS 1906
Cdd:COG3096   1012 AQYSQYNQVLASLKSSRDAKQQTLQELEQELEELGVQAdaeaeERARIRRDELHEELSQNRSR---------------RS 1076
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1907 TSEKSKQRLEAE----AGRFRELAEEAARLRALAEEAK----RQRQLAEEDAVRQRAEAERVL---AEKLAAISEatrlk 1975
Cdd:COG3096   1077 QLEKQLTRCEAEmdslQKRLRKAERDYKQEREQVVQAKagwcAVLRLARDNDVERRLHRRELAylsADELRSMSD----- 1151
                          410       420
                   ....*....|....*....|....*....
gi 1920237946 1976 tEAEIALKEKEAENERLR---RLAEDEAF 2001
Cdd:COG3096   1152 -KALGALRLAVADNEHLRdalRLSEDPRR 1179
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1526-2153 3.95e-07

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 56.61  E-value: 3.95e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1526 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQ----------LEATERQRG 1595
Cdd:PRK03918   169 EVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEvkeleelkeeIEELEKELE 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1596 GAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEaEAAREKQRALQALEELRLQAEEAE 1675
Cdd:PRK03918   249 SLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYE-EYLDELREIEKRLSRLEEEINGIE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1676 RRLRQAEAERARQVQValetaqRSAEAELQSEHASFaEKTAQLERTLKEEHVAVVQLREEatrraqqqaeaeraraeaer 1755
Cdd:PRK03918   328 ERIKELEEKEERLEEL------KKKLKELEKRLEEL-EERHELYEEAKAKKEELERLKKR-------------------- 380
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1756 elerwqLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREarrrgkaeEQAVRQRELAEQELEKqrqlAEGTAQ--Q 1833
Cdd:PRK03918   381 ------LTGLTPEKLEKELEELEKAKEEIEEEISKITARIGEL--------KKEIKELKKAIEELKK----AKGKCPvcG 442
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1834 RLAAEQELIRLRAEteqgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVraEMEVLLASKARAE----EESRSTSE 1909
Cdd:PRK03918   443 RELTEEHRKELLEE----------YTAELKRIEKELKEIEEKERKLRKELREL--EKVLKKESELIKLkelaEQLKELEE 510
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1910 KSK----QRLEAEAGRFRELAEEAARLRalaeeaKRQRQLAEEdaVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEK 1985
Cdd:PRK03918   511 KLKkynlEELEKKAEEYEKLKEKLIKLK------GEIKSLKKE--LEKLEELKKKLAELEKKLDELEEELAELLKELEEL 582
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1986 --EAENERLRRLAEDEAFQRRLLEEQAAQHkaDIEARLAQLRKAsESELERQKGLVEDTLRQRRQVEEEILALKGSF--- 2060
Cdd:PRK03918   583 gfESVEELEERLKELEPFYNEYLELKDAEK--ELEREEKELKKL-EEELDKAFEELAETEKRLEELRKELEELEKKYsee 659
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2061 --EKAAAGKAELELELGRIRGTAEDTLRSKEQAEqeaarqrqlaaeeerrrreaeervqKSLAAEEEAARQRKAALEEVE 2138
Cdd:PRK03918   660 eyEELREEYLELSRELAGLRAELEELEKRREEIK-------------------------KTLEKLKEELEEREKAKKELE 714
                          650
                   ....*....|....*
gi 1920237946 2139 RLKAKVEEARRLRER 2153
Cdd:PRK03918   715 KLEKALERVEELREK 729
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
2000-2489 4.29e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 56.31  E-value: 4.29e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2000 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQkglvEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRg 2079
Cdd:COG4717     41 AFIRAMLLERLEKEADELFKPQGRKPELNLKELKEL----EEELKEAEEKEEEYAELQEELEELEEELEELEAELEELR- 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2080 tAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2159
Cdd:COG4717    116 -EELEKLEKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEEL 194
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2160 RQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVL-ERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVE 2238
Cdd:COG4717    195 QDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEaAALEERLKEARLLLLIAAALLALLGLGGSLLSLILT 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2239 EAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQA------LRQKAQVEQ 2312
Cdd:COG4717    275 IAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELlelldrIEELQELLR 354
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2313 ELTALRLQLEETDHQKSIlDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL-GKLKARIEAENRALVLRDKDSAQRL 2391
Cdd:COG4717    355 EAEELEEELQLEELEQEI-AALLAEAGVEDEEELRAALEQAEEYQELKEELEELeEQLEELLGELEELLEALDEEELEEE 433
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2392 LQEEAEKMKQVAEEAARLSVAAQEA-ARLRQLAEEDLAQQRALAEKMLKEKMQ-AVQEATRLKAEAELLQQQKELAQEqa 2469
Cdd:COG4717    434 LEELEEELEELEEELEELREELAELeAELEQLEEDGELAELLQELEELKAELReLAEEWAALKLALELLEEAREEYRE-- 511
                          490       500
                   ....*....|....*....|
gi 1920237946 2470 RRLQEDKEQMAQQLAQETQG 2489
Cdd:COG4717    512 ERLPPVLERASEYFSRLTDG 531
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
1608-1725 4.67e-07

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 55.65  E-value: 4.67e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1608 AEEAEAQKRQAQEEAErLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRAlQALEELRLQAEEAERRLRQA--EAER 1685
Cdd:COG2268    212 TEIAIAQANREAEEAE-LEQEREIETARIAEAEAELA-KKKAEERREAETARA-EAEAAYEIAEANAEREVQRQleIAER 288
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 1920237946 1686 ARQVQVALETAQRsAEAELQSEHASFAEktAQLERTLKEE 1725
Cdd:COG2268    289 EREIELQEKEAER-EEAELEADVRKPAE--AEKQAAEAEA 325
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
2382-2819 5.02e-07

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 56.52  E-value: 5.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2382 LRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQ 2461
Cdd:TIGR00618  158 LKAKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQ 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2462 KELAQEQARRLQEDKEQMA--QQLAQETQGFQKTLETERQRqLEMSAEAERLRLRVAEMSRAQARAEEdarrFRKQAEDI 2539
Cdd:TIGR00618  238 TQQSHAYLTQKREAQEEQLkkQQLLKQLRARIEELRAQEAV-LEETQERINRARKAAPLAAHIKAVTQ----IEQQAQRI 312
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2540 GERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKS---EEMQTVRQEQLLQETQA 2616
Cdd:TIGR00618  313 HTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCqqhTLTQHIHTLQQQKTTLT 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2617 LQQSFLSEKDSLLQRErciEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQ 2696
Cdd:TIGR00618  393 QKLQSLCKELDILQRE---QATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCAAAITCTAQCEKLEKIHLQESAQSLK 469
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2697 EELQRLAQQQQQQEKlLAEENQRLRERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFDGLR 2776
Cdd:TIGR00618  470 EREQQLQTKEQIHLQ-ETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVY 548
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2777 RKVPA-----QRLQEVGVLSAEELQQLAQGRTTVAELAQR-----EDVRHYLQ 2819
Cdd:TIGR00618  549 HQLTSerkqrASLKEQMQEIQQSFSILTQCDNRSKEDIPNlqnitVRLQDLTE 601
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
1605-1735 6.20e-07

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 55.73  E-value: 6.20e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1605 RARAEEAEAQ------KRQAQEEAerlRRQVQDETQRKRQAEAELALRVQAEAEAAREKQralQALEELRLQAEEAER-- 1676
Cdd:pfam15709  337 RLRAERAEMRrleverKRREQEEQ---RRLQQEQLERAEKMREELELEQQRRFEEIRLRK---QRLEEERQRQEEEERkq 410
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1677 -RLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ----LERTLKEEHVAVVQLREE 1735
Cdd:pfam15709  411 rLQLQAAQERARQQQEEFRRKLQELQRKKQQEEAERAEAEKQrqkeLEMQLAEEQKRLMEMAEE 474
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
2387-2603 6.21e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.16  E-value: 6.21e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2387 SAQRLLQEEAEKMKQVAEEAARLSVAAQEAARlrqlAEEDLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQKELAQ 2466
Cdd:COG4942     17 AQADAAAEAEAELEQLQQEIAELEKELAALKK----EEKALLKQLAALERRIAALARRIRA---LEQELAALEAELAELE 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2467 EQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRT 2546
Cdd:COG4942     90 KEIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAEL 169
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2547 ELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQ 2603
Cdd:COG4942    170 EAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELE 226
Caldesmon pfam02029
Caldesmon;
1804-2177 7.23e-07

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 55.26  E-value: 7.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRElaEQELEKQRQLAEGTAQQRLAAEQelIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1883
Cdd:pfam02029   14 RAREERRRQKE--EEEPSGQVTESVEPNEHNSYEED--SELKPSGQGGLDEEEAFLDRTAKREERRQKRLQEALERQKEF 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1884 AKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRElAEEAARLRALAEEAKRQ--RQLAEEDAVRQRAEAERVL 1961
Cdd:pfam02029   90 DPTIADEKESVAERKENNEEEENSSWEKEEKRDSRLGRYKE-EETEIREKEYQENKWSTevRQAEEEGEEEEDKSEEAEE 168
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1962 AEKLAAISEatrlKTEAEIALKEKEAENERLRRLAedeafQRRLLEEQAAQhkadiearlaqlrkASESELERQKGLVED 2041
Cdd:pfam02029  169 VPTENFAKE----EVKDEKIKKEKKVKYESKVFLD-----QKRGHPEVKSQ--------------NGEEEVTKLKVTTKR 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2042 TLRQRRQVEEEilalkgsfEKAAAGKAELELELGRIRGTAEDtlrsKEQAEQEAARQRQLAAEEERRRREAEERVQKSLA 2121
Cdd:pfam02029  226 RQGGLSQSQER--------EEEAEVFLEAEQKLEELRRRRQE----KESEEFEKLRQKQQEAELELEELKKKREERRKLL 293
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2122 AEEEaaRQRKAalEEVERLKAKVEEARRLRERAEQESArqlqlaqEAAQKRLQAEE 2177
Cdd:pfam02029  294 EEEE--QRRKQ--EEAERKLREEEEKRRMKEEIERRRA-------EAAEKRQKLPE 338
PLEC smart00250
Plectin repeat;
2929-2965 7.46e-07

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 48.25  E-value: 7.46e-07
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  2929 IRLLEAQIATGGIIDPVHSHRLPVDVAYQRGYFDEEM 2965
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
2293-2596 8.43e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 55.41  E-value: 8.43e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2293 MEKHKQFAEQALRQKAQVEQELTAlrlQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKAR 2372
Cdd:TIGR04523  389 LESQINDLESKIQNQEKLNQQKDE---QIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRES 465
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2373 IEAENRALvlrdKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAarlRQLAEE--DLAQQRALaekmLKEKMQavqeatr 2450
Cdd:TIGR04523  466 LETQLKVL----SRSINKIKQNLEQKQKELKSKEKELKKLNEEK---KELEEKvkDLTKKISS----LKEKIE------- 527
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2451 lKAEAELLQQQKELAQeqarrLQEDKEQMAQQLAQETqgfqktLETERQRQLEmsaEAERLRLRVAEMSRAQARAEEDAR 2530
Cdd:TIGR04523  528 -KLESEKKEKESKISD-----LEDELNKDDFELKKEN------LEKEIDEKNK---EIEELKQTQKSLKKKQEEKQELID 592
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2531 RFRKQAEDIGERLyrtelatQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQ 2596
Cdd:TIGR04523  593 QKEKEKKDLIKEI-------EEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIK 651
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
2388-2715 8.45e-07

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 54.54  E-value: 8.45e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2467
Cdd:pfam13868   33 RIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEIVERIQE 112
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2468 QARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERlrlRVAEMSRAQARAEEDARRFRKQAEDIGERLYrte 2547
Cdd:pfam13868  113 EDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDE---RILEYLKEKAEREEEREAEREEIEEEKEREI--- 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2548 latqeKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEaqllqlksEEMQTVRQEQLLQETQALQQSFLSEKDS 2627
Cdd:pfam13868  187 -----ARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKERE--------EAEKKARQRQELQQAREEQIELKERRLA 253
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2628 LLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQ 2707
Cdd:pfam13868  254 EEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRERIEE 333

                   ....*...
gi 1920237946 2708 QQEKLLAE 2715
Cdd:pfam13868  334 ERQKKLKE 341
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1369-2101 8.58e-07

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 55.50  E-value: 8.58e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1369 AVREQLRQEKALLEDierhGEKVEECQRFAKQyinaikdyELQLVTYKAQLepvaspakkpKVQSGsesiIQEYVDLRTR 1448
Cdd:pfam05483   96 SIEAELKQKENKLQE----NRKIIEAQRKAIQ--------ELQFENEKVSL----------KLEEE----IQENKDLIKE 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1449 yselSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKqrqLAEAHAQAKAQAEREAQGLQRRMQEEva 1528
Cdd:pfam05483  150 ----NNATRHLCNLLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEK---MILAFEELRVQAENARLEMHFKLKED-- 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1529 rreevaveaqeqkrsiQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARA 1608
Cdd:pfam05483  221 ----------------HEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENL 284
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1609 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELR----LQAEEAERRLRQAEaE 1684
Cdd:pfam05483  285 KELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKaahsFVVTEFEATTCSLE-E 363
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1685 RARQVQVALEtaqrSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaeaerarAEAERELERWQLKA 1764
Cdd:pfam05483  364 LLRTEQQRLE----KNEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKK-------------ILAEDEKLLDEKKQ 426
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1765 NEALRLRLQAEEvaQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQR-QLAEGTAQ--------QRL 1835
Cdd:pfam05483  427 FEKIAEELKGKE--QELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKlKNIELTAHcdklllenKEL 504
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1836 AAEQE--LIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA---ELAKVRAEMEVLL---ASKARAEEESRST 1907
Cdd:pfam05483  505 TQEASdmTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESvreEFIQKGDEVKCKLdksEENARSIEYEVLK 584
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1908 SEKSKQRLEAEAGRFRELAEEAAR-LRALAEEAKRQRQlaEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKE 1986
Cdd:pfam05483  585 KEKQMKILENKCNNLKKQIENKNKnIEELHQENKALKK--KGSAENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKE 662
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1987 AENERL--RRLAEdEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRrqvEEEILALKGSFEKAA 2064
Cdd:pfam05483  663 IEDKKIseEKLLE-EVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEER---DSELGLYKNKEQEQS 738
                          730       740       750
                   ....*....|....*....|....*....|....*..
gi 1920237946 2065 AGKAELELELGRIRGtaeDTLRSKEQAEQEAARQRQL 2101
Cdd:pfam05483  739 SAKAALEIELSNIKA---ELLSLKKQLEIEKEEKEKL 772
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1472-1656 9.51e-07

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 54.81  E-value: 9.51e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1472 EEERLAEQQRAEERERLAEveAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEvaveaqEQKRSIQEELQhl 1551
Cdd:PRK09510   107 EKERLAAQEQKKQAEEAAK--QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA------EAKKKAEAEAA-- 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1552 rQSSEAEIQAKARQVEAAersrlrieeeirvvrlQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAerlrrqvqd 1631
Cdd:PRK09510   177 -KKAAAEAKKKAEAEAAA----------------KAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKA--------- 230
                          170       180
                   ....*....|....*....|....*
gi 1920237946 1632 ETQRKRQAEAELALRVQAEAEAARE 1656
Cdd:PRK09510   231 AAEAKAAAEKAAAAKAAEKAAAAKA 255
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4269-4297 1.07e-06

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 47.71  E-value: 1.07e-06
                           10        20
                   ....*....|....*....|....*....
gi 1920237946 4269 IVDPETGKEMSVYEAYRKGLIDHQTYLEL 4297
Cdd:pfam00681   11 IIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
2381-2734 1.09e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 55.43  E-value: 1.09e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2381 VLRDKDSAQRLLQEEAEKmKQVAEEAARLSVAAQEAARLRQLAE------EDLAQQRALAEKMLKEKMQAVQEATRLKAE 2454
Cdd:PRK02224   181 VLSDQRGSLDQLKAQIEE-KEEKDLHERLNGLESELAELDEEIEryeeqrEQARETRDEADEVLEEHEERREELETLEAE 259
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2455 AELLQQQKELAQ---EQARRLQEDKEQMAQQLAQETQGFQKTLETER-------QRQLEMSAEAERLRLRVAEMSRAQAR 2524
Cdd:PRK02224   260 IEDLRETIAETErerEELAEEVRDLRERLEELEEERDDLLAEAGLDDadaeaveARREELEDRDEELRDRLEECRVAAQA 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2525 AEEDARRFRKQAEDIGERlyrtelaTQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQT 2604
Cdd:PRK02224   340 HNEEAESLREDADDLEER-------AEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAED 412
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2605 VRQEqllqetqalqqsFLSEKDSLLQRERCIEqekAKLEQLfQDEVAKAQALREE--------------QQRQQQQMQQE 2670
Cdd:PRK02224   413 FLEE------------LREERDELREREAELE---ATLRTA-RERVEEAEALLEAgkcpecgqpvegspHVETIEEDRER 476
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2671 KQQLAASMEEARRRQHEAEEGVRRQQE------ELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAA 2734
Cdd:PRK02224   477 VEELEAELEDLEEEVEEVEERLERAEDlveaedRIERLEERREDLEELIAERRETIEEKRERAEELRERA 546
DUF4659 pfam15558
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ...
1468-1729 1.12e-06

Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.


Pssm-ID: 464768 [Multi-domain]  Cd Length: 374  Bit Score: 54.27  E-value: 1.12e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVAR--REEVAVEAQEQKRSIQ 1545
Cdd:pfam15558   21 QRMRELQQQAALAWEELRRRDQKRQETLERERRLLLQQSQEQWQAEKEQRKARLGREERRRAdrREKQVIEKESRWREQA 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1546 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrlqleaterqRGGAEGELQALRARAEEAEaQKRQAQEEAERL 1625
Cdd:pfam15558  101 EDQENQRQEKLERARQEAEQRKQCQEQRLKEKEEEL------------QALREQNSLQLQERLEEAC-HKRQLKEREEQK 167
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1626 RRQVQDETQRKRQAEAELALRVQAEAEAAREK----QRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAE 1701
Cdd:pfam15558  168 KVQENNLSELLNHQARKVLVDCQAKAEELLRRlsleQSLQRSQENYEQLVEERHRELREKAQKEEEQFQRAKWRAEEKEE 247
                          250       260
                   ....*....|....*....|....*...
gi 1920237946 1702 AELQSEHASFAEKTAQLERTLKEEHVAV 1729
Cdd:pfam15558  248 ERQEHKEALAELADRKIQQARQVAHKTV 275
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
2293-2517 1.29e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 55.02  E-value: 1.29e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2293 MEKHKQFAEQALR----QKAQVEQELTALRLQLEE--TDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL 2366
Cdd:COG3206    166 LELRREEARKALEfleeQLPELRKELEEAEAALEEfrQKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAAL 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2367 GKLKARIEAENRALVlrDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEkmqAVQ 2446
Cdd:COG3206    246 RAQLGSGPDALPELL--QSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILAS---LEA 320
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2447 EATRLKAEAELLQQQKELAQEQARRLQEdKEQMAQQLAQETQGFQKTLET--ERQRQLEMSAEAERLRLRVAE 2517
Cdd:COG3206    321 ELEALQAREASLQAQLAQLEARLAELPE-LEAELRRLEREVEVARELYESllQRLEEARLAEALTVGNVRVID 392
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
1803-2023 1.31e-06

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 54.57  E-value: 1.31e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1803 GKAEEQAVRQRELAEQELEKQRQlAEGTAQQRLAAEQ-ELIRLRAETEQGEQ--QRQLLEEELARlqreaaaATQKRREL 1879
Cdd:pfam15709  307 GNMESEEERSEEDPSKALLEKRE-QEKASRDRLRAERaEMRRLEVERKRREQeeQRRLQQEQLER-------AEKMREEL 378
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1880 EAELAKVRAEMEvlLASKARAEEESRSTSEKSKQRLEaeagrfrelaEEAARLRALAEEAKRQRQLAEEDAVRQRAEAER 1959
Cdd:pfam15709  379 ELEQQRRFEEIR--LRKQRLEEERQRQEEEERKQRLQ----------LQAAQERARQQQEEFRRKLQELQRKKQQEEAER 446
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1960 VLAEKLAAISEATRLKTEAEIALkeKEAENERLRRLAE-DEAFQRRLLEEQAAQHKADIEARLAQ 2023
Cdd:pfam15709  447 AEAEKQRQKELEMQLAEEQKRLM--EMAEEERLEYQRQkQEAEEKARLEAEERRQKEEEAARLAL 509
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
1809-2551 1.37e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 54.96  E-value: 1.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1809 AVRQRELAEQELEKQRQLAeGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEE-------LARLQrEAAAATQKrrelea 1881
Cdd:COG3096    277 ANERRELSERALELRRELF-GARRQLAEEQYRLVEMARELEELSARESDLEQDyqaasdhLNLVQ-TALRQQEK------ 348
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1882 eLAKVRAEMEVLlasKARAEEESRSTSEKSKQRLEAEAgRFRELAEEAARLRAlaEEAKRQRQLaeeDAVRQRAEAERvl 1961
Cdd:COG3096    349 -IERYQEDLEEL---TERLEEQEEVVEEAAEQLAEAEA-RLEAAEEEVDSLKS--QLADYQQAL---DVQQTRAIQYQ-- 416
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1962 aEKLAAISEATRLKTEAEIALKEKEAENERLRRlAEDEAFQRRLleeQAAQHKADIEARLAQLRKAseseLERQKGLVED 2041
Cdd:COG3096    417 -QAVQALEKARALCGLPDLTPENAEDYLAAFRA-KEQQATEEVL---ELEQKLSVADAARRQFEKA----YELVCKIAGE 487
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2042 TLRQRR-QVEEEILALKGSFEKAAAGKAELELELGrirgtaedtlrskeQAEQEAARQRQLAAEEERRRREAEERVQKSL 2120
Cdd:COG3096    488 VERSQAwQTARELLRRYRSQQALAQRLQQLRAQLA--------------ELEQRLRQQQNAERLLEEFCQRIGQQLDAAE 553
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2121 AAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEA-----AQKRLQ--AEEKAHAFAVQQkeqelqQ 2193
Cdd:COG3096    554 ELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARApawlaAQDALErlREQSGEALADSQ------E 627
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2194 TLQQEQSVLERLRseaeaarraaeeaeaareraerEAAQSRRQVEEAERlkqsaeeqaqaqaqaqaaaeklrkeaeqeaa 2273
Cdd:COG3096    628 VTAAMQQLLERER----------------------EATVERDELAARKQ------------------------------- 654
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2274 rraQAEQAALRQKQAADAEMEKHKQFAEQ----------------------AL--------------------------- 2304
Cdd:COG3096    655 ---ALESQIERLSQPGGAEDPRLLALAERlggvllseiyddvtledapyfsALygparhaivvpdlsavkeqlagledcp 731
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2305 ---------------------------------RQ-------------KAQVEQELTALRLQLEETD---HQKSILDEEL 2335
Cdd:COG3096    732 edlyliegdpdsfddsvfdaeeledavvvklsdRQwrysrfpevplfgRAAREKRLEELRAERDELAeqyAKASFDVQKL 811
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2336 QRL--------------------KAEVTEAARQRGQVEEELFSLRVQM----EELGKLKARIEAENRAL----VLRDKDS 2387
Cdd:COG3096    812 QRLhqafsqfvgghlavafapdpEAELAALRQRRSELERELAQHRAQEqqlrQQLDQLKEQLQLLNKLLpqanLLADETL 891
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRL--LQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAekmlKEKMQAVQEATRLKAEA---------- 2455
Cdd:COG3096    892 ADRLeeLREELDAAQEAQAFIQQHGKALAQLEPLVAVLQSDPEQFEQLQ----ADYLQAKEQQRRLKQQIfalsevvqrr 967
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2456 -------------------ELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQkTLETERQRQLEMSAEAERlrlRVA 2516
Cdd:COG3096    968 phfsyedavgllgensdlnEKLRARLEQAEEARREAREQLRQAQAQYSQYNQVLA-SLKSSRDAKQQTLQELEQ---ELE 1043
                          890       900       910
                   ....*....|....*....|....*....|....*...
gi 1920237946 2517 EMS-RAQARAEEDA--RRFRKQAEDIGERLYRTELATQ 2551
Cdd:COG3096   1044 ELGvQADAEAEERAriRRDELHEELSQNRSRRSQLEKQ 1081
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1601-1875 1.39e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 54.00  E-value: 1.39e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1601 LQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQA--------LEELRLQAE 1672
Cdd:COG4942     15 AAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLA-ALERRIAALARRIRALEQelaaleaeLAELEKEIA 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1673 EAERRLRQAEAERARQVQVALETAQRSAEAELQSehasfAEKTAQLERTLKEEHVAVVQLREEATRRAQqqaeaerarae 1752
Cdd:COG4942     94 ELRAELEAQKEELAELLRALYRLGRQPPLALLLS-----PEDFLDAVRRLQYLKYLAPARREQAEELRA----------- 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1753 aerelerwQLKANEALRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrgkaEEQAVRQRELAEQELEKQRQLAEgtaq 1832
Cdd:COG4942    158 --------DLAELAALRAELEAERAELEALL----------------------AELEEERAALEALKAERQKLLAR---- 203
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 1833 qrlaAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQK 1875
Cdd:COG4942    204 ----LEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1903-2744 1.44e-06

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 55.12  E-value: 1.44e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1903 ESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIAL 1982
Cdd:pfam15921   81 EEYSHQVKDLQRRLNESNELHEKQKFYLRQSVIDLQTKLQEMQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCL 160
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1983 KEKEAENERlrrlAEDEAFQRRLLEEQAAQHkaDIEARLAQLRKASESELERQKGLVEDTLRQR--------RQVEEEIL 2054
Cdd:pfam15921  161 KEDMLEDSN----TQIEQLRKMMLSHEGVLQ--EIRSILVDFEEASGKKIYEHDSMSTMHFRSLgsaiskilRELDTEIS 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2055 ALKGSF----EKAAAGKAELELELGRIRGTAEDTLrskEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQR 2130
Cdd:pfam15921  235 YLKGRIfpveDQLEALKSESQNKIELLLQQHQDRI---EQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQ 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2131 KAA----LEEVE----RLKAKVEEARRLRERAEQESARQLQLAQ----EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQE 2198
Cdd:pfam15921  312 NSMymrqLSDLEstvsQLRSELREAKRMYEDKIEELEKQLVLANseltEARTERDQFSQESGNLDDQLQKLLADLHKREK 391
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2199 QSVLERlrseaeaarraaeeaeaareraereaAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQA 2278
Cdd:pfam15921  392 ELSLEK--------------------------EQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQ 445
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2279 EQAALRQKQAADAEMEKHKQFAEQALRQKA---QVEQELTALRLQLEETDHQKSILDEELQR----LKAEVTEAARQRGQ 2351
Cdd:pfam15921  446 MERQMAAIQGKNESLEKVSSLTAQLESTKEmlrKVVEELTAKKMTLESSERTVSDLTASLQEkeraIEATNAEITKLRSR 525
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2352 VE---EELFSLRVQMEELGKLKAriEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQ-EAArlrQLAEEDL 2427
Cdd:pfam15921  526 VDlklQELQHLKNEGDHLRNVQT--ECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQvEKA---QLEKEIN 600
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2428 AQQRALAE-KMLKEKMQAvqEATRLKAEAELLQQQK-ELAQEQARRLQEDKEqmaqqLAQETQGFQKTLETERQRQLEMS 2505
Cdd:pfam15921  601 DRRLELQEfKILKDKKDA--KIRELEARVSDLELEKvKLVNAGSERLRAVKD-----IKQERDQLLNEVKTSRNELNSLS 673
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2506 AEAERLRlrvaemsraqaraeedaRRFRKQAEDIGERLYRTELATQE-KVMLVQTLETQRQQSDRDAERLREA------I 2578
Cdd:pfam15921  674 EDYEVLK-----------------RNFRNKSEEMETTTNKLKMQLKSaQSELEQTRNTLKSMEGSDGHAMKVAmgmqkqI 736
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2579 AELEHEKDKLKQEAQLLQlksEEMQTVRQEQ-LLQEtqalqqsflsEKDSLLQRERCIEQEKAKLEQlfQDEVAKAQALR 2657
Cdd:pfam15921  737 TAKRGQIDALQSKIQFLE---EAMTNANKEKhFLKE----------EKNKLSQELSTVATEKNKMAG--ELEVLRSQERR 801
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2658 eeqqrqqqqMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA----EENQRLRERLQhleeeRRA 2733
Cdd:pfam15921  802 ---------LKEKVANMEVALDKASLQFAECQDIIQRQEQESVRLKLQHTLDVKELQgpgyTSNSSMKPRLL-----QPA 867
                          890
                   ....*....|.
gi 1920237946 2734 ALARSEEIAPS 2744
Cdd:pfam15921  868 SFTRTHSNVPS 878
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1535-2441 1.58e-06

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 55.05  E-value: 1.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1535 VEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAersrLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQ 1614
Cdd:TIGR00606  185 IKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKA----CEIRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEHN 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1615 KRQAQEEAERLRRQVQDETQRKRQaEAELALRVQAEAEAAREKqraLQALEELR-LQAEEAERRLRQAEAERARQVQVAL 1693
Cdd:TIGR00606  261 LSKIMKLDNEIKALKSRKKQMEKD-NSELELKMEKVFQGTDEQ---LNDLYHNHqRTVREKERELVDCQRELEKLNKERR 336
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1694 ETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRL--R 1771
Cdd:TIGR00606  337 LLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTLVIERQEDEAKTAAQLcaD 416
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1772 LQAEEVAQQKSLTQAEAEKQKEEAEREArrrgKAEEQAVRQRELaeQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQG 1851
Cdd:TIGR00606  417 LQSKERLKQEQADEIRDEKKGLGRTIEL----KKEILEKKQEEL--KFVIKELQQLEGSSDRILELDQELRKAERELSKA 490
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1852 EQQR--QLLEEELARLQREAAAATQKRRELEAELAKV------RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFR 1923
Cdd:TIGR00606  491 EKNSltETLKKEVKSLQNEKADLDRKLRKLDQEMEQLnhhtttRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGYFP 570
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1924 ELAEEAARLRALAEEAKRQRQ-LAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEI----ALKEKEAENERLRRLAED 1998
Cdd:TIGR00606  571 NKKQLEDWLHSKSKEINQTRDrLAKLNKELASLEQNKNHINNELESKEEQLSSYEDKLfdvcGSQDEESDLERLKEEIEK 650
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1999 EAFQRRLLEEQAAQHKADIEAR---------LAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKaaagkaE 2069
Cdd:TIGR00606  651 SSKQRAMLAGATAVYSQFITQLtdenqsccpVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEK------R 724
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2070 LELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRR--EAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEA 2147
Cdd:TIGR00606  725 RDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNdiEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDV 804
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2148 RRLRERAEQESaRQLQLAQEAAQKRLQAEEKAHAF-AVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERA 2226
Cdd:TIGR00606  805 ERKIAQQAAKL-QGSDLDRTVQQVNQEKQEKQHELdTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRR 883
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2227 EREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKL----------RKEAEQEAARRAQAEQAALRQKQAADAEMEKH 2296
Cdd:TIGR00606  884 QQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDqqekeelissKETSNKKAQDKVNDIKEKVKNIHGYMKDIENK 963
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2297 KQfaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVtEAARQRGQVEEELFSLRVQMEELGKLKarieae 2376
Cdd:TIGR00606  964 IQ--DGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDI-DTQKIQERWLQDNLTLRKRENELKEVE------ 1034
                          890       900       910       920       930       940
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2377 nRALVLRDKDSAQ-RLLQEEAEKMKqvAEEAARLSVAAQEAARLRQLAEEDlaqQRALAEKMLKEK 2441
Cdd:TIGR00606 1035 -EELKQHLKEMGQmQVLQMKQEHQK--LEENIDLIKRNHVLALGRQKGYEK---EIKHFKKELREP 1094
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
2454-2740 1.58e-06

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 54.74  E-value: 1.58e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2454 EAELLQQQKELAQEQA----RRLQEDKEQMAQQ-LAQETQgfQKTLETERQRQLEMS-----AEAERLRLRVAEMSRAQA 2523
Cdd:pfam17380  267 ENEFLNQLLHIVQHQKavseRQQQEKFEKMEQErLRQEKE--EKAREVERRRKLEEAekarqAEMDRQAAIYAEQERMAM 344
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2524 RAEEDARRFRKQAEDI-GERLYRTELATQ-EKVMLVQTLETQRQQSDrdaERLREAIAELEHEKDKLKQEAQLLQLKSEE 2601
Cdd:pfam17380  345 ERERELERIRQEERKReLERIRQEEIAMEiSRMRELERLQMERQQKN---ERVRQELEAARKVKILEEERQRKIQQQKVE 421
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2602 MQTVRQEQLLQETQALQQsflsekdslLQRERCIEQEKAKLEQLfqdevakaqalreeqqRqqqqMQQEKQQLAASMEEA 2681
Cdd:pfam17380  422 MEQIRAEQEEARQREVRR---------LEEERAREMERVRLEEQ----------------E----RQQQVERLRQQEEER 472
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2682 RRRQHEAEEGVRRQQ--EELQRLAQQQQQQEKLLAE-ENQRLRERLQHLEEERRAALARSEE 2740
Cdd:pfam17380  473 KRKKLELEKEKRDRKraEEQRRKILEKELEERKQAMiEEERKRKLLEKEMEERQKAIYEEER 534
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1803-2085 1.62e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 53.75  E-value: 1.62e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1803 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1882
Cdd:COG4372     23 GILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEE 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1883 LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQ----LAEEDAVRQRAEAE 1958
Cdd:COG4372    103 LESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEelaaLEQELQALSEAEAE 182
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1959 RVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGL 2038
Cdd:COG4372    183 QALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEE 262
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 2039 VEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTL 2085
Cdd:COG4372    263 LELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSL 309
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
2332-2537 1.71e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 53.68  E-value: 1.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2332 DEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVlRDKDSAQRLLQEEAEKMKQVAEEAARLSV 2411
Cdd:COG3883     15 DPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQ-AEIDKLQAEIAEAEAEIEERREELGERAR 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2412 AAQEAARLRQLAE--------EDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQL 2483
Cdd:COG3883     94 ALYRSGGSVSYLDvllgsesfSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 2484 AQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAE 2537
Cdd:COG3883    174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAA 227
PLEC smart00250
Plectin repeat;
3588-3624 1.79e-06

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 47.09  E-value: 1.79e-06
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3588 IRLLEAQIATGGIIDPVHSHRVPVDVAYQRGYFDEEM 3624
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
2014-2732 1.88e-06

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 54.46  E-value: 1.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2014 KADIEARLAQlRKASESELERQKGLVEDTLRQRrQVEEEILALKGSFEKAAAG-----KAELELELGRIRGTAEDTLRSK 2088
Cdd:pfam12128  199 KSMIVAILED-DGVVPPKSRLNRQQVEHWIRDI-QAIAGIMKIRPEFTKLQQEfntleSAELRLSHLHFGYKSDETLIAS 276
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2089 EQAEQEA--ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQ 2166
Cdd:pfam12128  277 RQEERQEtsAELNQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLDADIETAAADQEQLPSWQSEL 356
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2167 EAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQS 2246
Cdd:pfam12128  357 ENLEERLKALTGKHQDVTAKYNRRRSKIKEQNNRDIAGIKDKLAKIREARDRQLAVAEDDLQALESELREQLEAGKLEFN 436
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2247 AEEQAQaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDh 2326
Cdd:pfam12128  437 EEEYRL----------KSRLGELKLRLNQATATPELLLQLENFDERIERAREEQEAANAEVERLQSELRQARKRRDQAS- 505
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2327 qksildEELQRLKAEVTEAARQRGQVEEELFS--------LRVQM----EELGKLKARieaenrALVLR-------DKDS 2387
Cdd:pfam12128  506 ------EALRQASRRLEERQSALDELELQLFPqagtllhfLRKEApdweQSIGKVISP------ELLHRtdldpevWDGS 573
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQE 2467
Cdd:pfam12128  574 VGGELNLYGVKLDLKRIDVPEWAASEEELRERLDKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTALKNARL 653
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2468 QARRLQEDKEQMAQQLAQETQGFQKTLETERQrqlEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQA--EDIGERLYR 2545
Cdd:pfam12128  654 DLRRLFDEKQSEKDKKNKALAERKDSANERLN---SLEAQLKQLDKKHQAWLEEQKEQKREARTEKQAYwqVVEGALDAQ 730
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2546 TELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqLLQETQALQQSFLSEK 2625
Cdd:pfam12128  731 LALLKAAIAARRSGAKAELKALETWYKRDLASLGVDPDVIAKLKREIRTLERKIERIAVRRQE-VLRYFDWYQETWLQRR 809
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2626 DSLLQRERCIEQEKAKLEQlfqdevakaqalreEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQ 2705
Cdd:pfam12128  810 PRLATQLSNIERAISELQQ--------------QLARLIADTKLRRAKLEMERKASEKQQVRLSENLRGLRCEMSKLATL 875
                          730       740
                   ....*....|....*....|....*..
gi 1920237946 2706 QQQQEKllAEENQRLRERLQHLEEERR 2732
Cdd:pfam12128  876 KEDANS--EQAQGSIGERLAQLEDLKL 900
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
2305-2736 1.95e-06

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 54.37  E-value: 1.95e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2305 RQKAQVEQELTALRLQLEETDHQ---KSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALV 2381
Cdd:pfam07111  190 KQLAEAQKEAELLRKQLSKTQEEleaQVTLVESLRKYVGEQVPPEVHSQTWELERQELLDTMQHLQEDRADLQATVELLQ 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2382 LRDKDSAQRLLQEEAEKMKQVAEEAarlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAE-AELLQQ 2460
Cdd:pfam07111  270 VRVQSLTHMLALQEEELTRKIQPSD---SLEPEFPKKCRSLLNRWREKVFALMVQLKAQDLEHRDSVKQLRGQvAELQEQ 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2461 QKELAQEQA--RRLQEDKEQMAQQLAQETQGFQKTL----ETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRK 2534
Cdd:pfam07111  347 VTSQSQEQAilQRALQDKAAEVEVERMSAKGLQMELsraqEARRRQQQQTASAEEQLKFVVNAMSSTQIWLETTMTRVEQ 426
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2535 QAEDIGERLYRTELATQE----------KVMLVQTLETQRQQSDRDAERLREAIAELEH---EKDKLKQEAQL-LQLKSE 2600
Cdd:pfam07111  427 AVARIPSLSNRLSYAVRKvhtikglmarKVALAQLRQESCPPPPPAPPVDADLSLELEQlreERNRLDAELQLsAHLIQQ 506
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2601 EMQTVRQE-------------QLLQETQALQQSFLS---EKDSLLQRERCIEQEKAKLEQ-LFQDEVAKAQALREEQQRQ 2663
Cdd:pfam07111  507 EVGRAREQgeaerqqlsevaqQLEQELQRAQESLASvgqQLEVARQGQQESTEEAASLRQeLTQQQEIYGQALQEKVAEV 586
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2664 QQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQ----EELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALA 2736
Cdd:pfam07111  587 ETRLREQLSDTKRRLNEARREQAKAVVSLRQIQhratQEKERNQELRRLQDEARKEEGQRLARRVQELERDKNLMLA 663
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1471-1636 2.35e-06

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 53.31  E-value: 2.35e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERERLAEVEAAleKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQH 1550
Cdd:TIGR02794  101 EKAAKQAEQAAKQAEEKQKQAEEA--KAKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAE 178
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1551 LRQSSEAEIQAKARQVEA-AERSRLRIEEEI----RVVRLQLEATERQRGGAEGELQALRARAEEAEAQK------RQAQ 1619
Cdd:TIGR02794  179 AKAKAEAEAKAKAEEAKAkAEAAKAKAAAEAaakaEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKqggargAAAG 258
                          170
                   ....*....|....*..
gi 1920237946 1620 EEAERLRRQVQDETQRK 1636
Cdd:TIGR02794  259 SEVDKYAAIIQQAIQQN 275
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
2311-2740 2.37e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 54.41  E-value: 2.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2311 EQELTALRLQLEETDHQKSILDEElQRLKAEvtEAARQRGQVEeelfslRVQMEelGKLKariEAENRALVLRDKDSaqR 2390
Cdd:pfam01576   86 EEEERSQQLQNEKKKMQQHIQDLE-EQLDEE--EAARQKLQLE------KVTTE--AKIK---KLEEDILLLEDQNS--K 149
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2391 LLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLK-EKMQAVQEATRLKAEAELLQqqkelAQEQA 2469
Cdd:pfam01576  150 LSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKeEKGRQELEKAKRKLEGESTD-----LQEQI 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2470 RRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQA--RAEEDAR-RFRKQAEDIGERL--Y 2544
Cdd:pfam01576  225 AELQAQIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEdlESERAARnKAEKQRRDLGEELeaL 304
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2545 RTELA-TQEKVMLVQTLETQRQQsdrDAERLREAIaelehEKDKLKQEAQLLQLKSEEMQTVR--QEQLLQ--------- 2612
Cdd:pfam01576  305 KTELEdTLDTTAAQQELRSKREQ---EVTELKKAL-----EEETRSHEAQLQEMRQKHTQALEelTEQLEQakrnkanle 376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2613 -ETQALQQSFL---SEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQalreEQQRQQQQMQQEKQQLAASMEEARRRQHEA 2688
Cdd:pfam01576  377 kAKQALESENAelqAELRTLQQAKQDSEHKRKKLEGQLQELQARLS----ESERQRAELAEKLSKLQSELESVSSLLNEA 452
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2689 EEGVRRQQEELQRLAQQQQQQEKLLAEENQR---LRERLQHLEEERRAALARSEE 2740
Cdd:pfam01576  453 EGKNIKLSKDVSSLESQLQDTQELLQEETRQklnLSTRLRQLEDERNSLQEQLEE 507
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
2333-2741 2.43e-06

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 54.28  E-value: 2.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2333 EELQRLKAEVteAARQRGQVEEELFSLRVQMEELGKLKARIEaENRALVLRDKDSAQRLLQEEAEKMKQ---VAEEAARL 2409
Cdd:PRK02224   187 GSLDQLKAQI--EEKEEKDLHERLNGLESELAELDEEIERYE-EQREQARETRDEADEVLEEHEERREEletLEAEIEDL 263
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2410 SVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLaqetQG 2489
Cdd:PRK02224   264 RETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAA----QA 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2490 FQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLyrtelatqekvmlvQTLETQRQQSDR 2569
Cdd:PRK02224   340 HNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEI--------------EELRERFGDAPV 405
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2570 DAERLREAIAELEHEKDKLKQEAQLLQ--LKSEEMQTVRQEQLLQE------TQALQQSflSEKDSLLQRERCIEQEKAK 2641
Cdd:PRK02224   406 DLGNAEDFLEELREERDELREREAELEatLRTARERVEEAEALLEAgkcpecGQPVEGS--PHVETIEEDRERVEELEAE 483
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2642 LEQL------FQDEVAKAQALREEQQR--QQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLL 2713
Cdd:PRK02224   484 LEDLeeeveeVEERLERAEDLVEAEDRieRLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEA 563
                          410       420       430
                   ....*....|....*....|....*....|....
gi 1920237946 2714 AEENQRLRERLQHLEE------ERRAALARSEEI 2741
Cdd:PRK02224   564 EEEAEEAREEVAELNSklaelkERIESLERIRTL 597
DUF4659 pfam15558
Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins ...
1465-1698 2.46e-06

Domain of unknown function (DUF4659); This family of proteins is found in eukaryotes. Proteins in this family are typically between 427 and 674 amino acids in length. There are two completely conserved residues (D and I) that may be functionally important.


Pssm-ID: 464768 [Multi-domain]  Cd Length: 374  Bit Score: 53.12  E-value: 2.46e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRAEERERLA--EVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEvavEAQEQKR 1542
Cdd:pfam15558   51 ERRLLLQQSQEQWQAEKEQRKARLGreERRRADRREKQVIEKESRWREQAEDQENQRQEKLERARQEAEQ---RKQCQEQ 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1543 SIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEA------EAQKR 1616
Cdd:pfam15558  128 RLKEKEEELQALREQNSLQLQERLEEACHKRQLKEREEQKKVQENNLSELLNHQARKVLVDCQAKAEELlrrlslEQSLQ 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1617 QAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ----RALQALEELRL-QAEEAERRLRQAEAERARQVQV 1691
Cdd:pfam15558  208 RSQENYEQLVEERHRELREKAQKEEEQFQRAKWRAEEKEEERqehkEALAELADRKIqQARQVAHKTVQDKAQRARELNL 287

                   ....*..
gi 1920237946 1692 ALETAQR 1698
Cdd:pfam15558  288 EREKNHH 294
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
2285-2469 2.50e-06

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 53.27  E-value: 2.50e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2285 QKQAADAEMEKHKQFAEQA--LRQKAQVEQEltalRLQLEETDHQKSildeelQRLKAEVTEAARQRGQVEEELFSLRVQ 2362
Cdd:PRK09510    71 QKSAKRAEEQRKKKEQQQAeeLQQKQAAEQE----RLKQLEKERLAA------QEQKKQAEEAAKQAALKQKQAEEAAAK 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2363 MEELGKLKARIEAEN-RALVLRDKDSAQRLLQEEAEK-----MKQVAEEAARLSVAAQEAARLRQLAEE---DLAQQRAL 2433
Cdd:PRK09510   141 AAAAAKAKAEAEAKRaAAAAKKAAAEAKKKAEAEAAKkaaaeAKKKAEAEAAAKAAAEAKKKAEAEAKKkaaAEAKKKAA 220
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1920237946 2434 AEKmlKEKMQAVQEATRLKAEAELLQQQKELAQEQA 2469
Cdd:PRK09510   221 AEA--KAAAAKAAAEAKAAAEKAAAAKAAEKAAAAK 254
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1452-1707 2.55e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 53.87  E-value: 2.55e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1452 LSTLTSQYIRFISEtLRRMEEEERLA--EQQRAEERERLAEVEAALEKQRQlaeahAQAKAQAEREAQGLQRRMQEEVAR 1529
Cdd:COG3206    154 ANALAEAYLEQNLE-LRREEARKALEflEEQLPELRKELEEAEAALEEFRQ-----KNGLVDLSEEAKLLLQQLSELESQ 227
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1530 ReevaVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRieEEIRVVRLQLeATERQRGGAEG-ELQALRARA 1608
Cdd:COG3206    228 L----AEARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQLR--AQLAELEAEL-AELSARYTPNHpDVIALRAQI 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1609 EEAEAQKRQaqeEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAeaeRARQ 1688
Cdd:COG3206    301 AALRAQLQQ---EAQRILASLEAELEALQAREASLQAQLAQLEARLAELPELEAELRRLEREVEVARELYESL---LQRL 374
                          250
                   ....*....|....*....
gi 1920237946 1689 VQVALETAQRSAEAELQSE 1707
Cdd:COG3206    375 EEARLAEALTVGNVRVIDP 393
CH_PLS3_rpt1 cd21325
first calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is ...
186-294 2.57e-06

first calponin homology (CH) domain found in plastin-3; Plastin-3, also called T-plastin, is an actin-bundling protein found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Plastin- 3 contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409174  Cd Length: 148  Bit Score: 50.06  E-value: 2.57e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  186 QKKTFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIALDYL 252
Cdd:cd21325     25 EKYAFVNWINKalendpdcrHVIPMNPNTDDLFKAVGDGIVLCKMINLSVPDTIDErainKKKLTPFIIQENLNLALNSA 104
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1920237946  253 RHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQVS 294
Cdd:cd21325    105 SAIGCHVVNIGAEDLRAGKPHLVLGLLWQIIKIGLFADIELS 146
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
2283-2729 2.69e-06

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 53.96  E-value: 2.69e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQ-----KAQVEQELTA---LRLQLEETDHQKSI-LDEELQRLKAEVTEAARQRGQVE 2353
Cdd:pfam05483  160 LKETCARSAEKTKKYEYEREETRQvymdlNNNIEKMILAfeeLRVQAENARLEMHFkLKEDHEKIQHLEEEYKKEINDKE 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2354 EELFSLRVQM-EELGKLKARI----EAENRALVLRDKDSAQ-RLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDL 2427
Cdd:pfam05483  240 KQVSLLLIQItEKENKMKDLTflleESRDKANQLEEKTKLQdENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDL 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2428 AQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQ--------KELAQEQARRLQEDKEQMaQQLAQETQgfQKTLETERQ 2499
Cdd:pfam05483  320 QIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEfeattcslEELLRTEQQRLEKNEDQL-KIITMELQ--KKSSELEEM 396
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2500 RQLEMSAEAERLRLRVAeMSRAQARAEEdarrfRKQAEDIGERLYRTElatQEKVMLVQT-------LETQRQQSDRDAE 2572
Cdd:pfam05483  397 TKFKNNKEVELEELKKI-LAEDEKLLDE-----KKQFEKIAEELKGKE---QELIFLLQArekeihdLEIQLTAIKTSEE 467
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2573 RLREAIAEL--EHEKDKLKQeaqlLQLKSE-EMQTVRQEQLLQETQALQQSFLSEKDSLLQrerCIEQEKAKLEQLfqde 2649
Cdd:pfam05483  468 HYLKEVEDLktELEKEKLKN----IELTAHcDKLLLENKELTQEASDMTLELKKHQEDIIN---CKKQEERMLKQI---- 536
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2650 vakaQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEE 2729
Cdd:pfam05483  537 ----ENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEE 612
EmrA COG1566
Multidrug resistance efflux pump EmrA [Defense mechanisms];
1498-1626 2.73e-06

Multidrug resistance efflux pump EmrA [Defense mechanisms];


Pssm-ID: 441174 [Multi-domain]  Cd Length: 331  Bit Score: 52.74  E-value: 2.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1498 QRQLAEAHAQ-AKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKArQVEAAERSRLRI 1576
Cdd:COG1566     82 QAALAQAEAQlAAAEAQLARLEAELGAEAEIAAAEAQLAAAQAQLDLAQRELERYQALYKKGAVSQQ-ELDEARAALDAA 160
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1577 EEEIRVVRLQLEATERQRGGAEgELQALRARAEEAEAQKRQAQEEAERLR 1626
Cdd:COG1566    161 QAQLEAAQAQLAQAQAGLREEE-ELAAAQAQVAQAEAALAQAELNLARTT 209
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
2285-2470 2.76e-06

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 53.88  E-value: 2.76e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2285 QKQAADAEMEKHKQFAEQALRQKAQvEQELTALRLQLEETDHQKSILDEELQRLKAEVTeaarqrgQVEEELFSLRVQME 2364
Cdd:pfam05667  309 TNEAPAATSSPPTKVETEEELQQQR-EEELEELQEQLEDLESSIQELEKEIKKLESSIK-------QVEEELEELKEQNE 380
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2365 ELGK---LKARI-----EAENRALVLrdkdsaQRLLQEEAEKMKQVAE--EAARLSVAAQEAARLRQLAEEDLAQQRALA 2434
Cdd:pfam05667  381 ELEKqykVKKKTldllpDAEENIAKL------QALVDASAQRLVELAGqwEKHRVPLIEEYRALKEAKSNKEDESQRKLE 454
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1920237946 2435 E-KMLKEKMQAVQEATRLKAE--AELLQQQKELAQEQAR 2470
Cdd:pfam05667  455 EiKELREKIKEVAEEAKQKEElyKQLVAEYERLPKDVSR 493
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1557-1736 2.98e-06

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 52.93  E-value: 2.98e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1557 AEIQAKARQVEAAERSRLRIEEEirvvrlQLEATERQRGGAEGELQALRARAEEAEAqKRQAQEEAERLRRQVQDETQRK 1636
Cdd:TIGR02794   53 NRIQQQKKPAAKKEQERQKKLEQ------QAEEAEKQRAAEQARQKELEQRAAAEKA-AKQAEQAAKQAEEKQKQAEEAK 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1637 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1716
Cdd:TIGR02794  126 AKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAKAKA 205
                          170       180
                   ....*....|....*....|
gi 1920237946 1717 QLERTLKEEHVAVVQLREEA 1736
Cdd:TIGR02794  206 AAEAAAKAEAEAAAAAAAEA 225
PRK10929 PRK10929
putative mechanosensitive channel protein; Provisional
2390-2638 3.07e-06

putative mechanosensitive channel protein; Provisional


Pssm-ID: 236798 [Multi-domain]  Cd Length: 1109  Bit Score: 53.90  E-value: 3.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2390 RLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQ-EATRLKA---EAELLQQQKELA 2465
Cdd:PRK10929   123 RQAQQEQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNTPLAQAQLTALQaESAALKAlvdELELAQLSANNR 202
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2466 QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAE-AERLRLRVAEMSRAQARA----EEDARRFRKQAEDIG 2540
Cdd:PRK10929   203 QELARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAERALEsTELLAEQSGDLPKSIVAQfkinRELSQALNQQAQRMD 282
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2541 ERLYRTELATQEKVMLVQTLETQRQQ------SDRDAERLREAIAELEhEKDKLKQ----EAQLL--QLKSEEMQTvRQE 2608
Cdd:PRK10929   283 LIASQQRQAASQTLQVRQALNTLREQsqwlgvSNALGEALRAQVARLP-EMPKPQQldteMAQLRvqRLRYEDLLN-KQP 360
                          250       260       270
                   ....*....|....*....|....*....|
gi 1920237946 2609 QLLQETQALQQSFLSEKDSLLQRERCIEQE 2638
Cdd:PRK10929   361 QLRQIRQADGQPLTAEQNRILDAQLRTQRE 390
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
1506-1728 3.28e-06

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 52.54  E-value: 3.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1506 AQAKAQAEREAQGLQ---RRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKArqveaAERSRLRIEEEIRV 1582
Cdd:TIGR02794   46 GAVAQQANRIQQQKKpaaKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQ-----AEQAAKQAEEKQKQ 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1583 vrlQLEATERQRggaegelqALRARAEEAEAqKRQAQEEAerlRRQVQDETQRKRQAEAelalrvQAEAEAAREKQRAL- 1661
Cdd:TIGR02794  121 ---AEEAKAKQA--------AEAKAKAEAEA-ERKAKEEA---AKQAEEEAKAKAAAEA------KKKAEEAKKKAEAEa 179
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1662 --QALEELRLQAEEAERRLRQAE----------AERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVA 1728
Cdd:TIGR02794  180 kaKAEAEAKAKAEEAKAKAEAAKakaaaeaaakAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAG 258
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1651-2182 3.64e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 53.53  E-value: 3.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1651 AEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQV--ALETAQRSAEAELQS------EHASFAEKTAQLERTL 1722
Cdd:PRK03918   168 GEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREinEISSELPELREELEKlekevkELEELKEEIEELEKEL 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1723 KEEHVAVVQLRE---EATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREA 1799
Cdd:PRK03918   248 ESLEGSKRKLEEkirELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1800 RRRGKAEEQAVRQRELAEQELEKQRQLAE--------GTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR--LQREA 1869
Cdd:PRK03918   328 ERIKELEEKEERLEELKKKLKELEKRLEEleerhelyEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKeeIEEEI 407
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1870 AAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGR-FRELAEEAARLRALAEEAKRQ-RQLAE 1947
Cdd:PRK03918   408 SKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAeLKRIEKELKEIEEKERKLRKElRELEK 487
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1948 EDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERlRRLAEDEAFQRRLLEEqaAQHKADIEARLAQLRKA 2027
Cdd:PRK03918   488 VLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLK-EKLIKLKGEIKSLKKE--LEKLEELKKKLAELEKK 564
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2028 SEsELERQKGLVEDTLRQR-----RQVEEEILALKGSFEK---AAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQR 2099
Cdd:PRK03918   565 LD-ELEEELAELLKELEELgfesvEELEERLKELEPFYNEyleLKDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2100 QLAAEEERRrreaeervqKSLAAEEEAARQRKAALE---EVERLKAKVEEARRLRERAEQ--ESARQLQLAQEAAQKRLQ 2174
Cdd:PRK03918   644 ELRKELEEL---------EKKYSEEEYEELREEYLElsrELAGLRAELEELEKRREEIKKtlEKLKEELEEREKAKKELE 714

                   ....*...
gi 1920237946 2175 AEEKAHAF 2182
Cdd:PRK03918   715 KLEKALER 722
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
1488-1582 3.84e-06

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 53.42  E-value: 3.84e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1488 LAEVEAALEKQRQLAEAHAQAKAQAEREAqglqRRMQEEVARREEVAVEAQEQKRSIQEELQHLR-QSSEAEIQAKARQV 1566
Cdd:PRK11448   144 LHALQQEVLTLKQQLELQAREKAQSQALA----EAQQQELVALEGLAAELEEKQQELEAQLEQLQeKAAETSQERKQKRK 219
                           90
                   ....*....|....*....
gi 1920237946 1567 EAAERSRLRI---EEEIRV 1582
Cdd:PRK11448   220 EITDQAAKRLelsEEETRI 238
PLEC smart00250
Plectin repeat;
3847-3882 3.91e-06

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 46.32  E-value: 3.91e-06
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1920237946  3847 RQLLEAQAATGFLLDPVKGERLAVDEAVRKGLVGPE 3882
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
CH_PLS1_rpt1 cd21323
first calponin homology (CH) domain found in plastin-1; Plastin-1, also called ...
186-293 3.94e-06

first calponin homology (CH) domain found in plastin-1; Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. It contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409172  Cd Length: 145  Bit Score: 49.66  E-value: 3.94e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  186 QKKTFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIALDYL 252
Cdd:cd21323     25 EKVAFVNWINKalegdpdckHVVPMNPTDESLFKSLADGILLCKMINLSQPDTIDErainKKKLTPFTISENLNLALNSA 104
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1920237946  253 RHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQV 293
Cdd:cd21323    105 SAIGCTVVNIGSLDLKEGKPHLVLGLLWQIIKVGLFADIEI 145
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
1480-2184 4.73e-06

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 53.27  E-value: 4.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1480 QRAEERERLAEVEAALEKQRQLAEAHAQAKAQAER-EAQGlqrrmqeevarrEEVAVEAQEQKRSIQEELQHLRQsseae 1558
Cdd:PRK10246   168 ERAELLEELTGTEIYGQISAMVFEQHKSARTELEKlQAQA------------SGVALLTPEQVQSLTASLQVLTD----- 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1559 iqakarqveaaersrlriEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAerlrrqvQDETQRKRQ 1638
Cdd:PRK10246   231 ------------------EEKQLLTAQQQQQQSLNWLTRLDELQQEASRRQQALQQALAAEEKA-------QPQLAALSL 285
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1639 AEAELALRVQAEaeaarEKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQrsaeaELQSEHASFAEKTAql 1718
Cdd:PRK10246   286 AQPARQLRPHWE-----RIQEQSAALAHTRQQIEEVNTRLQSTMALRARIRHHAAKQSA-----ELQAQQQSLNTWLA-- 353
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1719 ertlkeEHVAVVQLREE-----ATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLtqaeaekqke 1793
Cdd:PRK10246   354 ------EHDRFRQWNNElagwrAQFSQQTSDREQLRQWQQQLTHAEQKLNALPAITLTLTADEVAAALAQ---------- 417
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1794 eaerearrrgKAEEQAVRQR--ELAEQELEKQRQLAEgTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLqREAAA 1871
Cdd:PRK10246   418 ----------HAEQRPLRQRlvALHGQIVPQQKRLAQ-LQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADV-KTICE 485
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1872 ATQKRRELEAELAKVRAEMEVLL--ASKARAEEESRS-TSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1948
Cdd:PRK10246   486 QEARIKDLEAQRAQLQAGQPCPLcgSTSHPAVEAYQAlEPGVNQSRLDALEKEVKKLGEEGAALRGQLDALTKQLQRDES 565
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1949 DAvRQRAEAERVLAEKLAAISEA--TRLKTEAEIA--LKEKEAENERLRRLAEDEAFQRRLLE--EQAAQHKADIEARLA 2022
Cdd:PRK10246   566 EA-QSLRQEEQALTQQWQAVCASlnITLQPQDDIQpwLDAQEEHERQLRLLSQRHELQGQIAAhnQQIIQYQQQIEQRQQ 644
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2023 QLR----------------------KASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGT 2080
Cdd:PRK10246   645 QLLtalagyaltlpqedeeaswlatRQQEAQSWQQRQNELTALQNRIQQLTPLLETLPQSDDLPHSEETVALDNWRQVHE 724
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2081 AEDTLRSK-----EQAEQEAARQRQLAAEEERRRREAEERVQKSLAAE--EEAARQRKAALEevERLKAKVEEARRLRER 2153
Cdd:PRK10246   725 QCLSLHSQlqtlqQQDVLEAQRLQKAQAQFDTALQASVFDDQQAFLAAllDEETLTQLEQLK--QNLENQRQQAQTLVTQ 802
                          730       740       750
                   ....*....|....*....|....*....|.
gi 1920237946 2154 AEQESARQLQLAQEAAQKRLQAEEKAHAFAV 2184
Cdd:PRK10246   803 TAQALAQHQQHRPDGLDLTVTVEQIQQELAQ 833
PLEC smart00250
Plectin repeat;
4052-4089 5.10e-06

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 45.94  E-value: 5.10e-06
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  4052 QRFLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTA 4089
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1945-2645 5.22e-06

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 53.13  E-value: 5.22e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1945 LAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQhkadiEARLAQL 2024
Cdd:TIGR00606  165 LSEGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQ-----LESSREI 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2025 RKASESEL----ERQKGlVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQ 2100
Cdd:TIGR00606  240 VKSYENELdplkNRLKE-IEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNHQRTVREKE 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2101 laaeeerrrrEAEERVQKSLAAEEEAAR---QRKAALE-EVERLKAKVE---EARRLRERAEQESARQLQLAQEAAQKRL 2173
Cdd:TIGR00606  319 ----------RELVDCQRELEKLNKERRllnQEKTELLvEQGRLQLQADrhqEHIRARDSLIQSLATRLELDGFERGPFS 388
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2174 QAE-EKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQ 2252
Cdd:TIGR00606  389 ERQiKNFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEG 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2253 AQAQAQAAAEKLRKEAE---------------QEAARRAQAEQAALRQKQAADAEME----------------KHKQFAE 2301
Cdd:TIGR00606  469 SSDRILELDQELRKAERelskaeknsltetlkKEVKSLQNEKADLDRKLRKLDQEMEqlnhhtttrtqmemltKDKMDKD 548
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2302 QALRQ-KAQVEQELTAL------RLQLEETDHQKS----ILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGklk 2370
Cdd:TIGR00606  549 EQIRKiKSRHSDELTSLlgyfpnKKQLEDWLHSKSkeinQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYE--- 625
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2371 ariEAENRALVLRDKDSAQRLLQEEAEKM-KQVAEEAARLSVAAQeaaRLRQLAEED-----LAQQRALAEKMLKEKMQA 2444
Cdd:TIGR00606  626 ---DKLFDVCGSQDEESDLERLKEEIEKSsKQRAMLAGATAVYSQ---FITQLTDENqsccpVCQRVFQTEAELQEFISD 699
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2445 VQEATRLkAEAELLQQQKELAQEQARRlqEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSR--AQ 2522
Cdd:TIGR00606  700 LQSKLRL-APDKLKSTESELKKKEKRR--DEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQRLKNDIEEQETllGT 776
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2523 ARAEEDARRFRKQAEDIGERLYRtELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEM 2602
Cdd:TIGR00606  777 IMPEEESAKVCLTDVTIMERFQM-ELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQ 855
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 2603 QTVRQeQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQL 2645
Cdd:TIGR00606  856 QEQIQ-HLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEV 897
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1430-1644 5.26e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 52.07  E-value: 5.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1430 KVQSGSESIIQEYVDLRTRYSELSTL---TSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEA-- 1504
Cdd:COG4942     45 ALKKEEKALLKQLAALERRIAALARRiraLEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYRLGRQPPLal 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1505 --HAQAKAQAEREAQGLQRRMQeevARREEVaveaqEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIrv 1582
Cdd:COG4942    125 llSPEDFLDAVRRLQYLKYLAP---ARREQA-----EELRADLAELAALRAELEAERAELEALLAELEEERAALEALK-- 194
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1583 vrlqleaTERQRggaegELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1644
Cdd:COG4942    195 -------AERQK-----LLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTP 244
CH_FLNA_rpt2 cd21312
second calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; ...
296-403 6.33e-06

second calponin homology (CH) domain found in filamin-A (FLN-A) and similar proteins; Filamin-A (FLN-A) is also called actin-binding protein 280 (ABP-280), alpha-filamin, endothelial actin-binding protein, filamin-1, or non-muscle filamin. It promotes orthogonal branching of actin filaments and links actin filaments to membrane glycoproteins. It also anchors various transmembrane proteins to the actin cytoskeleton and serves as a scaffold for a wide range of cytoplasmic signaling proteins. FLN-A contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409161  Cd Length: 114  Bit Score: 48.26  E-value: 6.33e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  296 QSEDMTAKEKLLLWSQRMVEGcqgLRCDNFTTSWRDGRLFNAIIHRHKPTLI-DMNKVYRQTNLENLDQAFSVAERDLGV 374
Cdd:cd21312      7 EAKKQTPKQRLLGWIQNKLPQ---LPITNFSRDWQSGRALGALVDSCAPGLCpDWDSWDASKPVTNAREAMQQADDWLGI 83
                           90       100
                   ....*....|....*....|....*....
gi 1920237946  375 TRLLDPEDVDVPQPDEKSIITYVSSLYDA 403
Cdd:cd21312     84 PQVITPEEIVDPNVDEHSVMTYLSQFPKA 112
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1516-2152 6.88e-06

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 52.42  E-value: 6.88e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1516 AQGLQR---RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATER 1592
Cdd:pfam05483   73 SEGLSRlysKLYKEAEKIKKWKVSIEAELKQKENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNA 152
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1593 QRGGAEgELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAElaLRVQAEaeaarekqralQALEELRLQAE 1672
Cdd:pfam05483  153 TRHLCN-LLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEKMILAFEE--LRVQAE-----------NARLEMHFKLK 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1673 EAERRLRQAEAERARQV-----QVALETAQrSAEAElqsehasfaEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAE 1747
Cdd:pfam05483  219 EDHEKIQHLEEEYKKEIndkekQVSLLLIQ-ITEKE---------NKMKDLTFLLEESRDKANQLEEKTKLQDENLKELI 288
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1748 RARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQla 1827
Cdd:pfam05483  289 EKKDHLTKELEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLR-- 366
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1828 egTAQQRLaaeqelirlraetEQGEQQRQLLEEELARLQREAAAATQ----KRRELEaELAKVRAEMEVLLASKARAEEE 1903
Cdd:pfam05483  367 --TEQQRL-------------EKNEDQLKIITMELQKKSSELEEMTKfknnKEVELE-ELKKILAEDEKLLDEKKQFEKI 430
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1904 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALaeEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLkteaeialk 1983
Cdd:pfam05483  431 AEELKGKEQELIFLLQAREKEIHDLEIQLTAI--KTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLL--------- 499
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1984 ekeaENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGL--VEDTLRQRR--------QVEEEI 2053
Cdd:pfam05483  500 ----ENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELesVREEFIQKGdevkckldKSEENA 575
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2054 LALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKsLAAEEEAARQR--- 2130
Cdd:pfam05483  576 RSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNK-LELELASAKQKfee 654
                          650       660       670
                   ....*....|....*....|....*....|....*...
gi 1920237946 2131 ----------------KAALEEVERLKAKVEEARRLRE 2152
Cdd:pfam05483  655 iidnyqkeiedkkiseEKLLEEVEKAKAIADEAVKLQK 692
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2301-2740 7.04e-06

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 52.76  E-value: 7.04e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2301 EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAenral 2380
Cdd:PRK03918   161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEE----- 235
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2381 vLRDKDSAQRLLQEEAEKMKQVAEEaarlsvaaqeaaRLRQLAEedlaqQRALAEKMLKEKMQAVQEATRLKAEAELLQQ 2460
Cdd:PRK03918   236 -LKEEIEELEKELESLEGSKRKLEE------------KIRELEE-----RIEELKKEIEELEEKVKELKELKEKAEEYIK 297
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2461 QKELAQEQARRLQEDKEQMA--QQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMsRAQARAEEDARRFRKQAED 2538
Cdd:PRK03918   298 LSEFYEEYLDELREIEKRLSrlEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEEL-EERHELYEEAKAKKEELER 376
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2539 IGERLyrTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKqeAQLLQLKS---------EEMQTVRQEQ 2609
Cdd:PRK03918   377 LKKRL--TGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK--KAIEELKKakgkcpvcgRELTEEHRKE 452
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2610 LLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEvAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAE 2689
Cdd:PRK03918   453 LLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKE-SELIKLKELAEQLKELEEKLKKYNLEELEKKAEEYEKLK 531
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2690 EGVRRQQEELQRLAqQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEE 2740
Cdd:PRK03918   532 EKLIKLKGEIKSLK-KELEKLEELKKKLAELEKKLDELEEELAELLKELEE 581
PRK10929 PRK10929
putative mechanosensitive channel protein; Provisional
1353-1672 7.30e-06

putative mechanosensitive channel protein; Provisional


Pssm-ID: 236798 [Multi-domain]  Cd Length: 1109  Bit Score: 52.75  E-value: 7.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1353 KQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKveecqrfAKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQ 1432
Cdd:PRK10929    29 TQELEQAKAAKTPAQAEIVEALQSALNWLEERKGSLER-------AKQYQQVIDNFPKLSAELRQQLNNERDEPRSVPPN 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1433 SGSESIIQEYVDLRTRYSELSTLTSQ---YIRFISETLRRMEeeerlaeQQRAEERERLAEVEAALEKQRQLAEAHAQAK 1509
Cdd:PRK10929   102 MSTDALEQEILQVSSQLLEKSRQAQQeqdRAREISDSLSQLP-------QQQTEARRQLNEIERRLQTLGTPNTPLAQAQ 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1510 AQAereaqglqrrMQEEVARReevaveaqeqkRSIQEELQhLRQSSEAEIQAKAR-QVEAAERSRLRIEEEIRVVRLQLE 1588
Cdd:PRK10929   175 LTA----------LQAESAAL-----------KALVDELE-LAQLSANNRQELARlRSELAKKRSQQLDAYLQALRNQLN 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1589 aTERQRggaegelqalraRAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ-------AEAEAAREKQRAL 1661
Cdd:PRK10929   233 -SQRQR------------EAERALESTELLAEQSGDLPKSIVAQFKINRELSQALNQQAQrmdliasQQRQAASQTLQVR 299
                          330
                   ....*....|.
gi 1920237946 1662 QALEELRLQAE 1672
Cdd:PRK10929   300 QALNTLREQSQ 310
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
2400-2616 7.36e-06

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 51.77  E-value: 7.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2400 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2479
Cdd:TIGR02794   46 GAVAQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK 125
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2480 AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKvmlvQT 2559
Cdd:TIGR02794  126 AKQAAEAKAKAEAEAERKAKEEAAKQAEEEAKAKAAAEAKKKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAE----AA 201
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2560 LETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQA 2616
Cdd:TIGR02794  202 KAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAG 258
PTZ00491 PTZ00491
major vault protein; Provisional
2442-2587 8.37e-06

major vault protein; Provisional


Pssm-ID: 240439 [Multi-domain]  Cd Length: 850  Bit Score: 52.33  E-value: 8.37e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2442 MQAVQEATRlkaeaELLQQQKELA-------QEQARRLQ-EDKEQMAQ-QLAQetQGFQKTLETERQRQLEMSAEAERLR 2512
Cdd:PTZ00491   638 VEPVDERTR-----DSLQKSVQLAieittksQEAAARHQaELLEQEARgRLER--QKMHDKAKAEEQRTKLLELQAESAA 710
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2513 LRVAEMSRAQARAEEDARRFRKQAEdigerLYRTEL-ATQEKVMLVQTLETQRQQSDRDAERlREAIAELEHEKDK 2587
Cdd:PTZ00491   711 VESSGQSRAEALAEAEARLIEAEAE-----VEQAELrAKALRIEAEAELEKLRKRQELELEY-EQAQNELEIAKAK 780
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
2283-2730 8.66e-06

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 52.13  E-value: 8.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTalrlQLEETDHQKSILDEELQRLKAEVTEAARQRGQVE-----EELF 2357
Cdd:pfam10174   58 LKEQYRVTQEENQHLQLTIQALQDELRAQRDLN----QLLQQDFTTSPVDGEDKFSTPELTEENFRRLQSEherqaKELF 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2358 SLRVQMEELgklKARIEAENRALVLRDkDSAQRLL---QEEAEKMKQVAEEAARLSVAAQEAARLRQLaeEDLAQQRALA 2434
Cdd:pfam10174  134 LLRKTLEEM---ELRIETQKQTLGARD-ESIKKLLemlQSKGLPKKSGEEDWERTRRIAEAEMQLGHL--EVLLDQKEKE 207
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2435 EKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLqedkEQMAQQLAQETQgfqkTLETErqrqLEMSAEAERLRLR 2514
Cdd:pfam10174  208 NIHLREELHRRNQLQPDPAKTKALQTVIEMKDTKISSL----ERNIRDLEDEVQ----MLKTN----GLLHTEDREEEIK 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2515 VAEMSRAQARaeedarrFRK-QAEDIGERLYRTE---LATQEKVmlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQ 2590
Cdd:pfam10174  276 QMEVYKSHSK-------FMKnKIDQLKQELSKKEselLALQTKL---ETLTNQNSDCKQHIEVLKESLTAKEQRAAILQT 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2591 EAQLLQLKSEEMQTV-----RQEQLLQETQALQQSFLSE-KDSLLQRERCIEQEKAKLEQL---FQDEVAKAQALREEQQ 2661
Cdd:pfam10174  346 EVDALRLRLEEKESFlnkktKQLQDLTEEKSTLAGEIRDlKDMLDVKERKINVLQKKIENLqeqLRDKDKQLAGLKERVK 425
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2662 RQQQQMQQEKQQLAaSMEEARRrqhEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEE 2730
Cdd:pfam10174  426 SLQTDSSNTDTALT-TLEEALS---EKERIIERLKEQREREDRERLEELESLKKENKDLKEKVSALQPE 490
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1529-1882 8.77e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 51.44  E-value: 8.77e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1529 RREEVAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARA 1608
Cdd:COG4372     21 KTGILIAALSEQLRKALFELDKLQE----ELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAEL 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1609 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQ 1688
Cdd:COG4372     97 AQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQAL 176
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1689 VQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEAL 1768
Cdd:COG4372    177 SEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVI 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1769 RLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAET 1848
Cdd:COG4372    257 LKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILL 336
                          330       340       350
                   ....*....|....*....|....*....|....
gi 1920237946 1849 EQGEQQRQLLEEELARLQREAAAATQKRRELEAE 1882
Cdd:COG4372    337 AELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
4054-4092 9.47e-06

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 45.01  E-value: 9.47e-06
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 4054 FLEGTSSIAGVLVDATKERLSVYQAMKKGIIRPGTAFEL 4092
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
COG5281 COG5281
Phage-related minor tail protein [Mobilome: prophages, transposons];
2362-2742 9.66e-06

Phage-related minor tail protein [Mobilome: prophages, transposons];


Pssm-ID: 444092 [Multi-domain]  Cd Length: 603  Bit Score: 51.92  E-value: 9.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2362 QMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEK 2441
Cdd:COG5281      1 AAALAAAAALAAAAAAAAASAAAAAAAAALAAAAAAAAAAAGLAAAAAAAAAASLAAAAAAAALAAAAAAAAAAAAADAL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2442 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2521
Cdd:COG5281     81 AAALAEDAAAAAAAAEAALAALAAAALALAAAALAEAALAAAAAAAAAAAAAAAAAAAAAAAAAEAAKAAAAAAAAAALA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2522 QARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE 2601
Cdd:COG5281    161 AAAAAAAAAAAAAAAAAALAAASAAAAAAAAKAAAEAAAEAAAAAEAAAAAAAAAAEAAAAEAQALAAAALAEQAALAAA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2602 MQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEA 2681
Cdd:COG5281    241 SAAAQALAALAAAAAAAALALAAAAELALTAQAEAAAAAAAAAAAAAQAAEAAAAAAEAQALAAAAAAAAAQLAAAAAAA 320
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2682 R-RRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIA 2742
Cdd:COG5281    321 AqALRAAAQALAALAQRALAAAALAAAAQEAALAAAAAALQAALEAAAAAAAAELAAAGDWA 382
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1502-1717 9.66e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 51.37  E-value: 9.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1502 AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLrqssEAEIQAKARQVEAAERsrlRIEEEIR 1581
Cdd:COG3883     14 ADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEAL----QAEIDKLQAEIAEAEA---EIEERRE 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1582 VVRLQLEATERQrGGAEGELQAL--------------------RARAEEAEAQKrQAQEEAERLRRQVQDETQRKRQAEA 1641
Cdd:COG3883     87 ELGERARALYRS-GGSVSYLDVLlgsesfsdfldrlsalskiaDADADLLEELK-ADKAELEAKKAELEAKLAELEALKA 164
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1642 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQ 1717
Cdd:COG3883    165 ELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 240
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1478-1632 9.71e-06

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 50.31  E-value: 9.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1478 EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREaqglQRRMQEEVARREEVAVEAQEQKRSIqeelqhlrqSSEA 1557
Cdd:COG1579     23 EHRLKELPAELAELEDELAALEARLEAAKTELEDLEKE----IKRLELEIEEVEARIKKYEEQLGNV---------RNNK 89
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1558 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDE 1632
Cdd:COG1579     90 EYEALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAE 164
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
2451-2737 9.73e-06

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 52.26  E-value: 9.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2451 LKAEAELLQQQKELAQEQARRlqedkEQMAQQLAQETQGfQKTLETERQrqlemsAEAERLRLRVAEMsraqaRAEEDAR 2530
Cdd:COG3096    288 LELRRELFGARRQLAEEQYRL-----VEMARELEELSAR-ESDLEQDYQ------AASDHLNLVQTAL-----RQQEKIE 350
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2531 RFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEA----QLLQLKsEEMQTVR 2606
Cdd:COG3096    351 RYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSLKSQLADYQQALDVQQTRAiqyqQAVQAL-EKARALC 429
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2607 Q---------EQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQ------DEVAKAQA-------LREEQQRQQ 2664
Cdd:COG3096    430 GlpdltpenaEDYLAAFRAKEQQATEEVLELEQKLSVADAARRQFEKAYElvckiaGEVERSQAwqtarelLRRYRSQQA 509
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2665 QQMQQEKqqLAASMEEARRRQHEAEEgVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALAR 2737
Cdd:COG3096    510 LAQRLQQ--LRAQLAELEQRLRQQQN-AERLLEEFCQRIGQQLDAAEELEELLAELEAQLEELEEQAAEAVEQ 579
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1602-1866 1.01e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 51.94  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1602 QALRARAEEAEAQKRQAQEEAERLRRQVqDETQRKRQA--EAELALRVQAEAEAAREKQRALQA-LEELRLQAEEAERRL 1678
Cdd:COG3206    164 QNLELRREEARKALEFLEEQLPELRKEL-EEAEAALEEfrQKNGLVDLSEEAKLLLQQLSELESqLAEARAELAEAEARL 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1679 RQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaeaeraraeaerele 1758
Cdd:COG3206    243 AALRAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQI---------------------- 300
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1759 rwqlkanEALRLRLQAEEVAQQKSLtqaeaekqkeeaerearrrgKAEEQAVRQRelaEQELEKQRQLAEGTAQQRLAAE 1838
Cdd:COG3206    301 -------AALRAQLQQEAQRILASL--------------------EAELEALQAR---EASLQAQLAQLEARLAELPELE 350
                          250       260
                   ....*....|....*....|....*...
gi 1920237946 1839 QELIRLRAETeqgEQQRQLLEEELARLQ 1866
Cdd:COG3206    351 AELRRLEREV---EVARELYESLLQRLE 375
growth_prot_Scy NF041483
polarized growth protein Scy;
1969-2722 1.07e-05

polarized growth protein Scy;


Pssm-ID: 469371 [Multi-domain]  Cd Length: 1293  Bit Score: 52.14  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1969 SEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQ 2048
Cdd:NF041483    22 AEMDRLKTEREKAVQHAEDLGYQVEVLRAKLHEARRSLASRPAYDGADIGYQAEQLLRNAQIQADQLRADAERELRDARA 101
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2049 VEEEIlaLKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQ----------AEQEAARQRQLAAEEERRRREAEERVQK 2118
Cdd:NF041483   102 QTQRI--LQEHAEHQARLQAELHTEAVQRRQQLDQELAERRQtveshvnenvAWAEQLRARTESQARRLLDESRAEAEQA 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2119 SLAAEEEAARqrkAALEEVERLKAKVEEARRLRE----RAEQESARQLQLAQEAAQKRLQAEEKAHAF-------AVQQK 2187
Cdd:NF041483   180 LAAARAEAER---LAEEARQRLGSEAESARAEAEailrRARKDAERLLNAASTQAQEATDHAEQLRSStaaesdqARRQA 256
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2188 EQELQQTLQQEQSVLERLR-----SEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAE 2262
Cdd:NF041483   257 AELSRAAEQRMQEAEEALRearaeAEKVVAEAKEAAAKQLASAESANEQRTRTAKEEIARLVGEATKEAEALKAEAEQAL 336
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2263 KLRKEAEQEAARRAQAEQAALRQKQAAdAEMEKHKQFAEQALRQKAQVEQELTalRLQLEETDHQKSILDEELQRLKAEV 2342
Cdd:NF041483   337 ADARAEAEKLVAEAAEKARTVAAEDTA-AQLAKAARTAEEVLTKASEDAKATT--RAAAEEAERIRREAEAEADRLRGEA 413
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2343 TEAARQ-RGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAAR-----LSVAAQEA 2416
Cdd:NF041483   414 ADQAEQlKGAAKDDTKEYRAKTVELQEEARRLRGEAEQLRAEAVAEGERIRGEARREAVQQIEEAARtaeelLTKAKADA 493
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2417 ARLRQLAEEDLAQQRALA-EKMLKEKMQAVQEATRLKAEAELLQQQkelAQEQARRLQEDKEQMAQQLAQETQGFQKTLE 2495
Cdd:NF041483   494 DELRSTATAESERVRTEAiERATTLRRQAEETLERTRAEAERLRAE---AEEQAEEVRAAAERAARELREETERAIAARQ 570
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2496 TERQRQLE-MSAEAERlRLRVAEMSRAQARAEedARRFRKQAEDIGERLyRTELAtqekvmlvQTLETQRQQSDRDAERL 2574
Cdd:NF041483   571 AEAAEELTrLHTEAEE-RLTAAEEALADARAE--AERIRREAAEETERL-RTEAA--------ERIRTLQAQAEQEAERL 638
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2575 R-EAIAELEHEkdKLKQEAQLLQLKSEEMQtvRQEQLLQETQALQQSFLSEKDSLLQRercIEQEKAKleqlfqdevaka 2653
Cdd:NF041483   639 RtEAAADASAA--RAEGENVAVRLRSEAAA--EAERLKSEAQESADRVRAEAAAAAER---VGTEAAE------------ 699
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2654 qalreeqqrqqqqmqqekqQLAASMEEARRRQHEAEEGVRRQQEEL-QRLAQQQQQQEKLLAEENQRLRE 2722
Cdd:NF041483   700 -------------------ALAAAQEEAARRRREAEETLGSARAEAdQERERAREQSEELLASARKRVEE 750
ATAD3_N pfam12037
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal ...
1446-1587 1.07e-05

ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal domain of ATPase family AAA domain-containing protein 3 (ATAD3) which is involved in dimerization and interacts with the inner surface of the outer mitochondrial membrane. This domain is found associated with the AAA ATPase domain (pfam00004). ATAD3 is essential for mitochondrial network organization, mitochondrial metabolism and cell growth at organizm and cellular level. It may also play an important role in mitochondrial protein synthesis.


Pssm-ID: 463442 [Multi-domain]  Cd Length: 264  Bit Score: 50.37  E-value: 1.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1446 RTRYSELSTLTSQYIRFI----SETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRqlaeahaqakAQAEREAQglqR 1521
Cdd:pfam12037   56 QTRQAELQAKIKEYEAAQeqlkIERQRVEYEERRKTLQEETKQKQQRAQYQDELARKR----------YQDQLEAQ---R 122
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1522 RMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQL 1587
Cdd:pfam12037  123 RRNEELLRKQEESVAKQEAMRIQAQRRQTEEHEAELRRETERAKAEAEAEARAKEERENEDLNLEQ 188
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
988-1638 1.09e-05

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 51.89  E-value: 1.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  988 NQEAQEAIARlEAQHQALVALWhQLHTEMKSLLAWQSLGRDMQlirsWSLATFRTLKPEE------------QRQALRsL 1055
Cdd:TIGR00618  223 VLEKELKHLR-EALQQTQQSHA-YLTQKREAQEEQLKKQQLLK----QLRARIEELRAQEavleetqerinrARKAAP-L 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1056 ELHYQAFLRDSQDAGGFGPEDRLQaEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRL 1135
Cdd:TIGR00618  296 AAHIKAVTQIEQQAQRIHTELQSK-MRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQ 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1136 PLDKEPARECAQRIT----EQQKAQAEVDGLGKGVARLSAEAEKVLALPEPSPAAptlRSELELTLGKLEQVRSLSAIYL 1211
Cdd:TIGR00618  375 HTLTQHIHTLQQQKTtltqKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHA---KKQQELQQRYAELCAAAITCTA 451
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1212 EKLKtisLVIRSTQEAEEVLRAHEEQLKEAQAVpaTLPELEATKAALKKLRAQAEAQQPVFDALR---------DELRGA 1282
Cdd:TIGR00618  452 QCEK---LEKIHLQESAQSLKEREQQLQTKEQI--HLQETRKKAVVLARLLELQEEPCPLCGSCIhpnparqdiDNPGPL 526
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1283 QEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLgawlrdakqRQEQIQAV 1362
Cdd:TIGR00618  527 TRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNL---------QNITVRLQ 597
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1363 PLansqaVREQLRQEKALLEDIERHGEKVEECQRFAK--QYINAIKDYELQLVTYKAQLEpvASPAKKPKVQSGSESIIQ 1440
Cdd:TIGR00618  598 DL-----TEKLSEAEDMLACEQHALLRKLQPEQDLQDvrLHLQQCSQELALKLTALHALQ--LTLTQERVREHALSIRVL 670
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1441 EYVDLRTRYSELSTLTSQY--IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQG 1518
Cdd:TIGR00618  671 PKELLASRQLALQKMQSEKeqLTYWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELMH 750
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1519 LQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAE 1598
Cdd:TIGR00618  751 QARTVLKARTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETLVQEE 830
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|
gi 1920237946 1599 GELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQ 1638
Cdd:TIGR00618  831 EQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAK 870
CH_NAV2 cd21285
calponin homology (CH) domain found in neuron navigator 2; Neuron navigator 2 (NAV2), also ...
183-282 1.10e-05

calponin homology (CH) domain found in neuron navigator 2; Neuron navigator 2 (NAV2), also called helicase APC down-regulated 1 (HELAD1), pore membrane and/or filament-interacting-like protein 2 (POMFIL2), retinoic acid inducible in neuroblastoma 1 (RAINB1), Steerin-2 (STEERIN2), or Unc-53 homolog 2 (unc53H2), possesses 3' to 5' helicase activity and exonuclease activity. It is involved in neuronal development, specifically in the development of different sensory organs. NAV2 contains a single copy of the CH domain at the N-terminus. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409134  Cd Length: 121  Bit Score: 47.65  E-value: 1.10e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  183 DRVQKKTFTKWVNKHLIKA--QRHISDLYEDLRDGHNLISLLEVLSGDSLPREKG--RMRFHKLQNVQIALDYLRHRQVK 258
Cdd:cd21285      8 NGFDKQIYTDWANHYLAKSghKRLIKDLQQDVTDGVLLAEIIQVVANEKIEDINGcpKNRSQMIENIDACLSFLAAKGIN 87
                           90       100
                   ....*....|....*....|....
gi 1920237946  259 LVNIRNDDIADGNPKLTLGLIWTI 282
Cdd:cd21285     88 IQGLSAEEIRNGNLKAILGLFFSL 111
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
2563-2752 1.11e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 51.30  E-value: 1.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2563 QRQQSDRDAERLREAIAELEHEKDKLKQEAQLL--QLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKA 2640
Cdd:COG4942     21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALlkQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELE 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2641 KLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASME------EARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA 2714
Cdd:COG4942    101 AQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQylkylaPARREQAEELRADLAELAALRAELEAERAELEALL 180
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1920237946 2715 EENQRLRERLQHLEEERRAALARSEEIAPSRAAAARAL 2752
Cdd:COG4942    181 AELEEERAALEALKAERQKLLARLEKELAELAAELAEL 218
Caldesmon pfam02029
Caldesmon;
1466-1682 1.15e-05

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 51.41  E-value: 1.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1466 TLRRMEEEERLAEQQRAEERERLAEVEAALEkqrqlaEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQ 1545
Cdd:pfam02029  134 EIREKEYQENKWSTEVRQAEEEGEEEEDKSE------EAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEV 207
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1546 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRV-VRLQLEATERQRGGAEG-ELQALRARAEEAEAQKRQAQEEAE 1623
Cdd:pfam02029  208 KSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFLeAEQKLEELRRRRQEKESeEFEKLRQKQQEAELELEELKKKRE 287
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1624 RLRRQVQDETQRKRQAEAELALRvqaEAEaarEKQRALQALEelRLQAEEAERRLRQAE 1682
Cdd:pfam02029  288 ERRKLLEEEEQRRKQEEAERKLR---EEE---EKRRMKEEIE--RRRAEAAEKRQKLPE 338
Crescentin pfam19220
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ...
2283-2585 1.20e-05

Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.


Pssm-ID: 437057 [Multi-domain]  Cd Length: 401  Bit Score: 51.22  E-value: 1.20e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHkqfAEQALRQKAQVEQELTALRLQLEETDHQksildeeLQRLKAEVTEAARQRGQVEEELFSLRVQ 2362
Cdd:pfam19220  113 LRDKTAQAEALERQ---LAAETEQNRALEEENKALREEAQAAEKA-------LQRAEGELATARERLALLEQENRRLQAL 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2363 MEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEkmkqvaeeaarlsVAAQEAARLRQLAEEDLAQQRALAEKM-LKEK 2441
Cdd:pfam19220  183 SEEQAAELAELTRRLAELETQLDATRARLRALEGQ-------------LAAEQAERERAEAQLEEAVEAHRAERAsLRMK 249
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2442 MQAVQeaTRLKAEAELLQQQKELAQEQARRLQEdKEQMAQQLAQETQGFQKTLEterqrqlEMSAEAERLRLRVAEMSRA 2521
Cdd:pfam19220  250 LEALT--ARAAATEQLLAEARNQLRDRDEAIRA-AERRLKEASIERDTLERRLA-------GLEADLERRTQQFQEMQRA 319
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2522 QARAEEDARRFRKQAEDIGERLYRTE---LATQEKV-MLVQTLETQRQQSDRDAERLREaiaELEHEK 2585
Cdd:pfam19220  320 RAELEERAEMLTKALAAKDAALERAEeriASLSDRIaELTKRFEVERAALEQANRRLKE---ELQRER 384
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1866-2187 1.27e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 51.66  E-value: 1.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1866 QREAAAATQKRRELEAELAKVRAEMEvllaSKARAEEESRSTSEKSKQRlEAEAGRFRELAEEAARLralaeEAKRQRQL 1945
Cdd:pfam17380  281 QKAVSERQQQEKFEKMEQERLRQEKE----EKAREVERRRKLEEAEKAR-QAEMDRQAAIYAEQERM-----AMEREREL 350
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1946 AEEDAVRQRAEAERVLAEKLAAisEATRLKtEAEIALKEKEAENERLRRLAEdEAFQRRLLEEQAAQHKADIEARLAQLR 2025
Cdd:pfam17380  351 ERIRQEERKRELERIRQEEIAM--EISRMR-ELERLQMERQQKNERVRQELE-AARKVKILEEERQRKIQQQKVEMEQIR 426
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2026 KASESELERQ-KGLVEDTLRQRRQVEEEilalkgsfekaaagKAELELELGRIRGTAEDTLRSKEQAEQEaaRQRQLAAE 2104
Cdd:pfam17380  427 AEQEEARQREvRRLEEERAREMERVRLE--------------EQERQQQVERLRQQEEERKRKKLELEKE--KRDRKRAE 490
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2105 EERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARrlRERAEQESARQLQLAQE---AAQKRLQAEEKAHA 2181
Cdd:pfam17380  491 EQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEER--RREAEEERRKQQEMEERrriQEQMRKATEERSRL 568

                   ....*.
gi 1920237946 2182 FAVQQK 2187
Cdd:pfam17380  569 EAMERE 574
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1076-1646 1.35e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 51.66  E-value: 1.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1076 DRLQAEREYGSCSRHYQQLLQSLEQGEQEESrcqrcISELKDIRL-QLEACETRTVHRLRlpldkepaRECAQRITEQQK 1154
Cdd:pfam15921  364 ERDQFSQESGNLDDQLQKLLADLHKREKELS-----LEKEQNKRLwDRDTGNSITIDHLR--------RELDDRNMEVQR 430
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1155 AQAEVDGL-----GKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEE 1229
Cdd:pfam15921  431 LEALLKAMksecqGQMERQMAAIQGKNESLEKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVSDLTASLQEKER 510
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1230 VlraheeqlkeaqavpatlpeLEATKAALKKLRAQAEAQQPVFDALRDE---LRGAQEVGERLQQRHGERDVEVERWRER 1306
Cdd:pfam15921  511 A--------------------IEATNAEITKLRSRVDLKLQELQHLKNEgdhLRNVQTECEALKLQMAEKDKVIEILRQQ 570
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1307 VTLLLE-------RWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAWLR--DAKQRQEQIQAVPLANSQAvrEQLRQE 1377
Cdd:pfam15921  571 IENMTQlvgqhgrTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIRelEARVSDLELEKVKLVNAGS--ERLRAV 648
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1378 KALLEDIERHGEKVEECQrfaKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQsgsesiiqeyvdLRTRYSELSTLTS 1457
Cdd:pfam15921  649 KDIKQERDQLLNEVKTSR---NELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQ------------LKSAQSELEQTRN 713
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1458 qyirfiseTLRRMEEEERLAeqqraeererlaeVEAALEKQRQLAEAHAQAKAqaereaqglqrrMQEEVARREEVAVEA 1537
Cdd:pfam15921  714 --------TLKSMEGSDGHA-------------MKVAMGMQKQITAKRGQIDA------------LQSKIQFLEEAMTNA 760
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQEELQHLRQsseaeiqakarqveaaersrlrieeeirvvRLQLEATERQRggAEGELQALRA---RAEEAEAQ 1614
Cdd:pfam15921  761 NKEKHFLKEEKNKLSQ------------------------------ELSTVATEKNK--MAGELEVLRSqerRLKEKVAN 808
                          570       580       590
                   ....*....|....*....|....*....|..
gi 1920237946 1615 KRQAQEEAERLRRQVQDETQRKRQAEAELALR 1646
Cdd:pfam15921  809 MEVALDKASLQFAECQDIIQRQEQESVRLKLQ 840
mukB PRK04863
chromosome partition protein MukB;
1081-1724 1.45e-05

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 51.88  E-value: 1.45e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1081 EREYGSCSRHYQQLLQSLEQGEQEEsrcqRCISELKDIRLQLEAcetrtvhrlRLPLDKEPARECAQRITEQQKAQAEVD 1160
Cdd:PRK04863   327 EQDYQAASDHLNLVQTALRQQEKIE----RYQADLEELEERLEE---------QNEVVEEADEQQEENEARAEAAEEEVD 393
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1161 GLGKGVA--------------------RLSAEAEKVLALPEPSPA-----APTLRSEL-ELTLGKLE---QVRSLSAIYL 1211
Cdd:PRK04863   394 ELKSQLAdyqqaldvqqtraiqyqqavQALERAKQLCGLPDLTADnaedwLEEFQAKEqEATEELLSleqKLSVAQAAHS 473
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1212 EKLKTISLVIR-----STQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALKK-LRAQAEAQQpvfdaLRDELRGAQEV 1285
Cdd:PRK04863   474 QFEQAYQLVRKiagevSRSEAWDVARELLRRLREQRHLAEQLQQLRMRLSELEQrLRQQQRAER-----LLAEFCKRLGK 548
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1286 G-------ERLQQRHGER----DVEVERWRERVTLLlerwQAVLAQTDVRQRELEQLGRQLRYYRESADPL----GAWLR 1350
Cdd:PRK04863   549 NlddedelEQLQEELEARleslSESVSEARERRMAL----RQQLEQLQARIQRLAARAPAWLAAQDALARLreqsGEEFE 624
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1351 DAKQRQEQIQ--AVPLANSQAVREQLRQEK-ALLEDIER----HGEKVEECQRFAKQ-------------------YINA 1404
Cdd:PRK04863   625 DSQDVTEYMQqlLERERELTVERDELAARKqALDEEIERlsqpGGSEDPRLNALAERfggvllseiyddvsledapYFSA 704
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1405 -------------IKDYELQLVT--------YKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSelstltsqyiRFI 1463
Cdd:PRK04863   705 lygparhaivvpdLSDAAEQLAGledcpedlYLIEGDPDSFDDSVFSVEELEKAVVVKIADRQWRYS----------RFP 774
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1464 SETL-RRMEEEERLAEQQRaeERERLAEveaalekqrqlaeahaqAKAQAEREAQGLQRRMQE---------EVARREEV 1533
Cdd:PRK04863   775 EVPLfGRAAREKRIEQLRA--EREELAE-----------------RYATLSFDVQKLQRLHQAfsrfigshlAVAFEADP 835
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1534 AVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRL--------------RIEEEIRVVRLQLEATE------RQ 1593
Cdd:PRK04863   836 EAELRQLNRRRVELERALADHESQEQQQRSQLEQAKEGLSAlnrllprlnlladeTLADRVEEIREQLDEAEeakrfvQQ 915
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1594 RGGA----EGELQALRARAEEAEAQKR---QAQEEAERLRRQVQDET---QRKRQAEAELALRVQAEAEAAREKQRalQA 1663
Cdd:PRK04863   916 HGNAlaqlEPIVSVLQSDPEQFEQLKQdyqQAQQTQRDAKQQAFALTevvQRRAHFSYEDAAEMLAKNSDLNEKLR--QR 993
                          730       740       750       760       770       780
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 1664 LEELRLQAEEAERRLRQAEAERARQVQValetaqrsaEAELQSEHASFAEKTAQLERTLKE 1724
Cdd:PRK04863   994 LEQAEQERTRAREQLRQAQAQLAQYNQV---------LASLKSSYDAKRQMLQELKQELQD 1045
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1079-1517 1.46e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 51.31  E-value: 1.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1079 QAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACE-----TRTVHRLRLPLDKEPAR--ECAQRITE 1151
Cdd:COG4717     78 EELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLqllplYQELEALEAELAELPERleELEERLEE 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1152 QQKAQAEVDGLGKGVARLSAEAEKVLALPEPSpaaptLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVL 1231
Cdd:COG4717    158 LRELEEELEELEAELAELQEELEELLEQLSLA-----TEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQL 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1232 R------AHEEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRE 1305
Cdd:COG4717    233 EneleaaALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPA 312
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1306 RVTLLLERWQAVLAQTDVRQREleqlgrQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEkALLEDIE 1385
Cdd:COG4717    313 LEELEEEELEELLAALGLPPDL------SPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALLAE-AGVEDEE 385
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1386 RHGEKVEECQRFaKQYINAIKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEyvdLRTRYSELSTLTSQYIRFISE 1465
Cdd:COG4717    386 ELRAALEQAEEY-QELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEE---LEELEEELEELREELAELEAE 461
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1466 tLRRMEEEERLAE--QQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQ 1517
Cdd:COG4717    462 -LEQLEEDGELAEllQELEELKAELRELAEEWAALKLALELLEEAREEYREERL 514
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
1836-1931 1.61e-05

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 51.49  E-value: 1.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1836 AAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLlasKARAEEESRSTSEKSKQRL 1915
Cdd:PRK11448   146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQL---QEKAAETSQERKQKRKEIT 222
                           90
                   ....*....|....*.
gi 1920237946 1916 EAEAGRFrELAEEAAR 1931
Cdd:PRK11448   223 DQAAKRL-ELSEEETR 237
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
1211-1679 1.64e-05

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 51.28  E-value: 1.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1211 LEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALkklraQAEAQQpVFDALRDELRGAQEVGERLQ 1290
Cdd:pfam05557   51 QELQKRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKLNEKESQ-----LADARE-VISCLKNELSELRRQIQRAE 124
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1291 QRHGERDVEVERWRERVTLLLERWQAV---LAQTDVRQREL---EQLGRQLRYYRESADPLGAWLRDAKQRQEQIqavpl 1364
Cdd:pfam05557  125 LELQSTNSELEELQERLDLLKAKASEAeqlRQNLEKQQSSLaeaEQRIKELEFEIQSQEQDSEIVKNSKSELARI----- 199
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1365 ANSQAVREQLRQEKALL----EDIERHGEKVEECQRFAKQYinaiKDYELQLVTYKAQLEPVASPAKK------------ 1428
Cdd:pfam05557  200 PELEKELERLREHNKHLneniENKLLLKEEVEDLKRKLERE----EKYREEAATLELEKEKLEQELQSwvklaqdtglnl 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1429 PKVQSGSESIIQEYVDLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEK-QRQL------ 1501
Cdd:pfam05557  276 RSPEDLSRRIEQLQQREIVLKEENSSLTSS-ARQLEKARRELEQELAQYLKKIEDLNKKLKRHKALVRRlQRRVllltke 354
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1502 ----------------AEAHAQAKAQAEREAQGLQRRMQ---EEVARREEVAVEA----QEQKRSIQEELQHLRQ----- 1553
Cdd:pfam05557  355 rdgyrailesydkeltMSNYSPQLLERIEEAEDMTQKMQahnEEMEAQLSVAEEElggyKQQAQTLERELQALRQqesla 434
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 ---SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQ 1630
Cdd:pfam05557  435 dpsYSKEEVDSLRRKLETLELERQRLREQKNELEMELERRCLQGDYDPKKTKVLHLSMNPAAEAYQQRKNQLEKLQAEIE 514
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 1631 --DETQRKRQAEAELALRVQAEAEAAREKQralqaLEELRLQAEEAERRLR 1679
Cdd:pfam05557  515 rlKRLLKKLEDDLEQVLRLPETTSTMNFKE-----VLDLRKELESAELKNQ 560
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
1804-2026 1.83e-05

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 49.05  E-value: 1.83e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRELAEQELEK-QRQLAEgtaqqrlaAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRE-LEA 1881
Cdd:COG1842     16 ALLDKAEDPEKMLDQAIRDmEEDLVE--------ARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKGREdLAR 87
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1882 ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1961
Cdd:COG1842     88 EALERKAELEAQAEALEAQLAQLEEQVEKLKEALRQLESKLEELKAKKDTLKARAKAAKAQEKVNEALSGIDSDDATSAL 167
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1962 AEklaaiseatrlkteAEIALKEKEAENERLRRLAEDEAFQRRLLEeqaAQHKADIEARLAQLRK 2026
Cdd:COG1842    168 ER--------------MEEKIEEMEARAEAAAELAAGDSLDDELAE---LEADSEVEDELAALKA 215
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1536-1673 1.93e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 49.15  E-value: 1.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1536 EAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGA---------EGELQA 1603
Cdd:COG1579     21 RLEHRLKELPAELAELEDelaALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVrnnkeyealQKEIES 100
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1604 LRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEE 1673
Cdd:COG1579    101 LKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAA 170
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1395-1726 1.97e-05

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 51.17  E-value: 1.97e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1395 QRFAKQYINAIKDYelqlvtYKAQLEPVASPAKKPKvQSGSESIIQEYVDLRTRY-SELSTLTSQyirfiSETLRRMEEE 1473
Cdd:NF033838    53 NESQKEHAKEVESH------LEKILSEIQKSLDKRK-HTQNVALNKKLSDIKTEYlYELNVLKEK-----SEAELTSKTK 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1474 ERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ----RRMQEEVArreEVAVEAQEQKRSIQEELQ 1549
Cdd:NF033838   121 KELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNYPtntyKTLELEIA---ESDVEVKKAELELVKEEA 197
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1550 HLRQSSEAEIQAKAR-QVEAAERSRLrieEEIRVVRLQLEATERQRGGAE-GELQALRARAEEAEAQKRQA--------- 1618
Cdd:NF033838   198 KEPRDEEKIKQAKAKvESKKAEATRL---EKIKTDREKAEEEAKRRADAKlKEAVEKNVATSEQDKPKRRAkrgvlgepa 274
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1619 -----QEEAERLRRQVQDET-------QRKRQAEAE-LALRVQAEAEAAREKQR---ALQALEELRLQAEEAERRLRQAE 1682
Cdd:NF033838   275 tpdkkENDAKSSDSSVGEETlpspslkPEKKVAEAEkKVEEAKKKAKDQKEEDRrnyPTNTYKTLELEIAESDVKVKEAE 354
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 1683 A----ERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEH 1726
Cdd:NF033838   355 LelvkEEAKEPRNEEKIKQAKAKVESKKAEATRLEKIKTDRKKAEEEA 402
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
1813-2525 2.13e-05

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 51.27  E-value: 2.13e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1813 RELAEQELEKQRQLAEGTAQQRLAAE---QELIRLRAETEQGEQQRQLLEEELARLQ-REAAAATQK-RRELEAELA--- 1884
Cdd:pfam15921  158 KCLKEDMLEDSNTQIEQLRKMMLSHEgvlQEIRSILVDFEEASGKKIYEHDSMSTMHfRSLGSAISKiLRELDTEISylk 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1885 ----KVRAEMEVLLA-SKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAER 1959
Cdd:pfam15921  238 grifPVEDQLEALKSeSQNKIELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMR 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1960 VLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQL-----RKASESELER 2034
Cdd:pfam15921  318 QLSDLESTVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLladlhKREKELSLEK 397
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2035 QK-----------GLVEDTLRQR---RQVEEEIL-ALKGSFEKAAAGkaELELELGRIRGtaedtlrSKEQAEQEAARQR 2099
Cdd:pfam15921  398 EQnkrlwdrdtgnSITIDHLRRElddRNMEVQRLeALLKAMKSECQG--QMERQMAAIQG-------KNESLEKVSSLTA 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2100 QLAAEEERRRREAEERVQKSLAAE--EEAARQRKAALEEVER-LKAKVEEARRLRERAEQesarQLQLAQEaaqkrLQAE 2176
Cdd:pfam15921  469 QLESTKEMLRKVVEELTAKKMTLEssERTVSDLTASLQEKERaIEATNAEITKLRSRVDL----KLQELQH-----LKNE 539
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2177 EKaHAFAVQQKEQELQQTLQQEQSVLERLRseaeaarraaeeaeaareraereaaqsrRQVEEAERLkqsaeeqaqaqaq 2256
Cdd:pfam15921  540 GD-HLRNVQTECEALKLQMAEKDKVIEILR----------------------------QQIENMTQL------------- 577
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2257 aqaaaeklrkeaeqeaarraqaeqaalrqkqaadaeMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQ 2336
Cdd:pfam15921  578 ------------------------------------VGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIR 621
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2337 RLKAEVTEAARQRGQV----EEELFSLRVQMEELGKLKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQVAEE-AARL 2409
Cdd:pfam15921  622 ELEARVSDLELEKVKLvnagSERLRAVKDIKQERDQLLNEVKTSRNELnsLSEDYEVLKRNFRNKSEEMETTTNKlKMQL 701
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2410 SVAAQEAARLR---QLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAqe 2486
Cdd:pfam15921  702 KSAQSELEQTRntlKSMEGSDGHAMKVAMGMQKQITAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELS-- 779
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 2487 tqgfqkTLETERQR---QLE-MSAEAERLRLRVAEMSRAQARA 2525
Cdd:pfam15921  780 ------TVATEKNKmagELEvLRSQERRLKEKVANMEVALDKA 816
PLEC smart00250
Plectin repeat;
3923-3959 2.18e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 44.01  E-value: 2.18e-05
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3923 LRLLDAQLATGGIVDPRLGFHLPLDVAYQRGYLDKDT 3959
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
PRK11637 PRK11637
AmiB activator; Provisional
2299-2530 2.18e-05

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 50.46  E-value: 2.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2299 FAEQALRQKAQ---VEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQrgqveeelfsLRVQMEELGKLKARIEA 2375
Cdd:PRK11637    38 FSAHASDNRDQlksIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRK----------LRETQNTLNQLNKQIDE 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2376 ENRalvlrdkdSAQRLLQEEAEKMKqvaeeaarlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQA----VQEAtRL 2451
Cdd:PRK11637   108 LNA--------SIAKLEQQQAAQER---------LLAAQLDAAFRQGEHTGLQLILSGEESQRGERILAyfgyLNQA-RQ 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2452 KAEAELLQQQKELAQEqaRRLQEDKEQMAQQLAQETQGFQKTLETER-----------------QRQL-EMSAEAERLRL 2513
Cdd:PRK11637   170 ETIAELKQTREELAAQ--KAELEEKQSQQKTLLYEQQAQQQKLEQARnerkktltglesslqkdQQQLsELRANESRLRD 247
                          250
                   ....*....|....*...
gi 1920237946 2514 RVAEMSR-AQARAEEDAR 2530
Cdd:PRK11637   248 SIARAEReAKARAEREAR 265
CHASE3 COG5278
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
1477-1905 2.27e-05

Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];


Pssm-ID: 444089 [Multi-domain]  Cd Length: 530  Bit Score: 50.68  E-value: 2.27e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE 1556
Cdd:COG5278     84 ARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLAL 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1557 AEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1636
Cdd:COG5278    164 ALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALALA 243
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1637 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1716
Cdd:COG5278    244 LLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAAA 323
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1717 QLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAE 1796
Cdd:COG5278    324 ALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAAA 403
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1797 REARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKR 1876
Cdd:COG5278    404 AEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAAA 483
                          410       420
                   ....*....|....*....|....*....
gi 1920237946 1877 RELEAELAKVRAEMEVLLASKARAEEESR 1905
Cdd:COG5278    484 LAEAEAAAALAAAAALSLALALAALLLAA 512
CH_PLS2_rpt1 cd21324
first calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or ...
186-293 2.35e-05

first calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-2 contains four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409173  Cd Length: 145  Bit Score: 47.31  E-value: 2.35e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  186 QKKTFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIALDYL 252
Cdd:cd21324     25 EKYAFVNWINKalendpdckHVIPMNPNTDDLFKAVGDGIVLCKMINFSVPDTIDErtinKKKLTPFTIQENLNLALNSA 104
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1920237946  253 RHRQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQISDIQV 293
Cdd:cd21324    105 SAIGCHVVNIGAEDLKEGKPYLVLGLLWQVIKIGLFADIEL 145
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
2283-2488 2.37e-05

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 50.99  E-value: 2.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADA-----EMEKHKQFAEQALRQKaqveqeltalrlQLEETDH---------QKSILDEELQRLKAEVTEAARQ 2348
Cdd:NF012221  1561 LADKERAEAdrqrlEQEKQQQLAAISGSQS------------QLESTDQnaletngqaQRDAILEESRAVTKELTTLAQG 1628
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2349 RGQVEEElfslRVQMEELGKlKARIEAENRAL--VLRDKDSAQRLLQEEAEKMKQ--------VAEEAARLSVAAQEAAR 2418
Cdd:NF012221  1629 LDALDSQ----ATYAGESGD-QWRNPFAGGLLdrVQEQLDDAKKISGKQLADAKQrhvdnqqkVKDAVAKSEAGVAQGEQ 1703
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2419 LRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQ 2488
Cdd:NF012221  1704 NQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQADAKGAK 1773
COG3903 COG3903
Predicted ATPase [General function prediction only];
1663-2101 2.44e-05

Predicted ATPase [General function prediction only];


Pssm-ID: 443109 [Multi-domain]  Cd Length: 933  Bit Score: 50.79  E-value: 2.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1663 ALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQ---------LR 1733
Cdd:COG3903    477 AAERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALrlaaalapfWF 556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1734 EEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQR 1813
Cdd:COG3903    557 LRGLLREGRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAAAAAAA 636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1814 ELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVL 1893
Cdd:COG3903    637 AAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAALAAAAA 716
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1894 LASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATR 1973
Cdd:COG3903    717 AAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAAAAAAA 796
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1974 LKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEI 2053
Cdd:COG3903    797 AAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALAAAAAA 876
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 2054 LALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQL 2101
Cdd:COG3903    877 AAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAA 924
CH_PLS1_rpt3 cd21329
third calponin homology (CH) domain found in plastin-1; Plastin-1, also called ...
181-289 2.54e-05

third calponin homology (CH) domain found in plastin-1; Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. It contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409178  Cd Length: 118  Bit Score: 46.52  E-value: 2.54e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRVQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEV---------LSGDSLPREKGRMRfhKLQNVQIALDY 251
Cdd:cd21329      2 EGESSEERTFRNWMNS--LGVNPYVNHLYSDLCDALVIFQLYEMtrvpvdwghVNKPPYPALGGNMK--KIENCNYAVEL 77
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1920237946  252 LRHR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 289
Cdd:cd21329     78 GKNKaKFSLVGIAGSDLNEGNKTLTLALIWQLMRRYTLN 116
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1807-2517 2.60e-05

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 50.88  E-value: 2.60e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQlAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR-----LQREAAAATQKRRELEA 1881
Cdd:pfam05483   98 EAELKQKENKLQENRKIIE-AQRKAIQELQFENEKVSLKLEEEIQENKDLIKENNATRhlcnlLKETCARSAEKTKKYEY 176
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1882 ELAKVR---AEMEVLLASKARAEEESRSTSEKSkqRLEAEAgrfrELAEEAARLRALAEEAKRQRQlaeedavrqraEAE 1958
Cdd:pfam05483  177 EREETRqvyMDLNNNIEKMILAFEELRVQAENA--RLEMHF----KLKEDHEKIQHLEEEYKKEIN-----------DKE 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1959 RVLAEKLAAISEATRLKTEAEIALKEKEaenERLRRLAEDEAFQRRLLEeQAAQHKADIEARLAQLRKASESELERQKGL 2038
Cdd:pfam05483  240 KQVSLLLIQITEKENKMKDLTFLLEESR---DKANQLEEKTKLQDENLK-ELIEKKDHLTKELEDIKMSLQRSMSTQKAL 315
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2039 VED---TLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEerrrreaeer 2115
Cdd:pfam05483  316 EEDlqiATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITME---------- 385
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2116 VQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQ--ESARQLQLAQEAAQKRLQAEEK-AHAFAVQqkeqeLQ 2192
Cdd:pfam05483  386 LQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQfeKIAEELKGKEQELIFLLQAREKeIHDLEIQ-----LT 460
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2193 QTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLkqsaeeqaqaqaqaqaaaeklrkeaeqea 2272
Cdd:pfam05483  461 AIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKELTQEASDM----------------------------- 511
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2273 arraqaeQAALRQKQAadaEMEKHKQFAEQALRQKAQVEQELTALRLQLE--------ETDHQKSILD---EELQRLKAE 2341
Cdd:pfam05483  512 -------TLELKKHQE---DIINCKKQEERMLKQIENLEEKEMNLRDELEsvreefiqKGDEVKCKLDkseENARSIEYE 581
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2342 VTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRAlvLRDKDSAQRlLQEEAEKMKqVAEEAARLSVAAQEAARLRQ 2421
Cdd:pfam05483  582 VLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKA--LKKKGSAEN-KQLNAYEIK-VNKLELELASAKQKFEEIID 657
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2422 LAEEDLAQQRALAEKMLKEKMQA---VQEATRLKAEAELLQQQK--------ELAQEQARRLQEDKEQ---MAQQLAQET 2487
Cdd:pfam05483  658 NYQKEIEDKKISEEKLLEEVEKAkaiADEAVKLQKEIDKRCQHKiaemvalmEKHKHQYDKIIEERDSelgLYKNKEQEQ 737
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|.
gi 1920237946 2488 QGFQKTLETER----------QRQLEMS-AEAERLRLRVAE 2517
Cdd:pfam05483  738 SSAKAALEIELsnikaellslKKQLEIEkEEKEKLKMEAKE 778
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
1463-1668 2.62e-05

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 48.67  E-value: 2.62e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVAR-REEVAVEAQEQK 1541
Cdd:COG1842     14 INALLDKAEDPEKMLDQAIRDMEEDLVEARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKgREDLAREALERK 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1542 RSIQEELQHLRQsseaeiqakarQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegelQALRARAEEAEAQKR----- 1616
Cdd:COG1842     94 AELEAQAEALEA-----------QLAQLEEQVEKLKEALRQLESKLEELKAKK-------DTLKARAKAAKAQEKvneal 155
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1617 ------QAQEEAERLRRQVqDETQRKRQAEAELALR--VQAEAEAAREKQRALQALEELR 1668
Cdd:COG1842    156 sgidsdDATSALERMEEKI-EEMEARAEAAAELAAGdsLDDELAELEADSEVEDELAALK 214
CHASE3 COG5278
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
1761-2179 2.72e-05

Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];


Pssm-ID: 444089 [Multi-domain]  Cd Length: 530  Bit Score: 50.29  E-value: 2.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1761 QLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQE 1840
Cdd:COG5278    105 QQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLALALAALLLAAAALLLLLLALAA 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1841 LIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAG 1920
Cdd:COG5278    185 LLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALALALLLAALLLALLAALALAALLA 264
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1921 RFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEA 2000
Cdd:COG5278    265 AALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAAAALAALLALALATALAAAAAAL 344
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2001 FQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGT 2080
Cdd:COG5278    345 ALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAAAAEAAAAAAAAAAASAAEALEL 424
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2081 AEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2160
Cdd:COG5278    425 AEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAAALAEAEAAAALAAAAALSLALA 504
                          410
                   ....*....|....*....
gi 1920237946 2161 QLQLAQEAAQKRLQAEEKA 2179
Cdd:COG5278    505 LAALLLAAAEAALAAALAA 523
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
2283-2733 2.74e-05

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 50.61  E-value: 2.74e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFA--EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQ-RGQVEEELFSL 2359
Cdd:pfam12128  227 IRDIQAIAGIMKIRPEFTklQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTlDDQWKEKRDEL 306
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2360 R----VQMEELGKLKARIEA-ENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAA---------RLRQLAEE 2425
Cdd:pfam12128  307 NgelsAADAAVAKDRSELEAlEDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKhqdvtakynRRRSKIKE 386
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2426 DLA------QQRALAEKMLKEKMQAVQEATRLKAEAEL---LQQQKELAQEQARRLQEDKEQMAQQLAQETQGfQKTLET 2496
Cdd:pfam12128  387 QNNrdiagiKDKLAKIREARDRQLAVAEDDLQALESELreqLEAGKLEFNEEEYRLKSRLGELKLRLNQATAT-PELLLQ 465
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2497 ERQRQLEMSAEAERLRLRVAEMSRAQaraeEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQ-RQQSDRDAERLR 2575
Cdd:pfam12128  466 LENFDERIERAREEQEAANAEVERLQ----SELRQARKRRDQASEALRQASRRLEERQSALDELELQlFPQAGTLLHFLR 541
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2576 EAIAELEHEKDKLKQEAQL----LQLKSEEMQTVRQEQLLQETQALQQSflsEKDSLLQRERCIEQEKAKLEQLFQDEVA 2651
Cdd:pfam12128  542 KEAPDWEQSIGKVISPELLhrtdLDPEVWDGSVGGELNLYGVKLDLKRI---DVPEWAASEEELRERLDKAEEALQSARE 618
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2652 KAQALreeqqrqqqqmQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLA----QQQQQQEKLLAEENQRLRERLQHL 2727
Cdd:pfam12128  619 KQAAA-----------EEQLVQANGELEKASREETFARTALKNARLDLRRLFdekqSEKDKKNKALAERKDSANERLNSL 687

                   ....*.
gi 1920237946 2728 EEERRA 2733
Cdd:pfam12128  688 EAQLKQ 693
Caldesmon pfam02029
Caldesmon;
1463-1687 2.80e-05

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 50.25  E-value: 2.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRRMEEEERLAEQQRAEERERLAEVEAAlEKQRQLAEAHAQAKAQAEREAQglqRRMQEEVARREEVavEAQEQKR 1542
Cdd:pfam02029  100 VAERKENNEEEENSSWEKEEKRDSRLGRYKEE-ETEIREKEYQENKWSTEVRQAE---EEGEEEEDKSEEA--EEVPTEN 173
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1543 SIQEELQHLRQSSEAEIQAKARQVEaaERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKrQAQEEA 1622
Cdd:pfam02029  174 FAKEEVKDEKIKKEKKVKYESKVFL--DQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEVFL-EAEQKL 250
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1623 ERLRRQVQD------ETQRKRQAEAELALrvqAEAEAAREKQRAL--------QALEELRLQAEEAERRLRQAEAERAR 1687
Cdd:pfam02029  251 EELRRRRQEkeseefEKLRQKQQEAELEL---EELKKKREERRKLleeeeqrrKQEEAERKLREEEEKRRMKEEIERRR 326
Crescentin pfam19220
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ...
1434-1736 3.18e-05

Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.


Pssm-ID: 437057 [Multi-domain]  Cd Length: 401  Bit Score: 49.68  E-value: 3.18e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1434 GSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAaLEKQ--------RQLAEAH 1505
Cdd:pfam19220   63 AYGKLRRELAGLTRRLSAAEGELEELVARLAKLEAALREAEAAKEELRIELRDKTAQAEA-LERQlaaeteqnRALEEEN 141
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1506 AQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRsiqeeLQHLRQSSEAEIQAKARQVE------AAERSRLRIEEe 1579
Cdd:pfam19220  142 KALREEAQAAEKALQRAEGELATARERLALLEQENRR-----LQALSEEQAAELAELTRRLAeletqlDATRARLRALE- 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1580 irvVRLQLEATERQRGGAEGELQALRARAEEAEaqkrqaqeeaerLRRQVQDETQRKRQAE---AELALRVQAEAEAARE 1656
Cdd:pfam19220  216 ---GQLAAEQAERERAEAQLEEAVEAHRAERAS------------LRMKLEALTARAAATEqllAEARNQLRDRDEAIRA 280
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1657 KQRalqALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSE--HASFAEKTAQLERTlkEEHVAVVQLRE 1734
Cdd:pfam19220  281 AER---RLKEASIERDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEmlTKALAAKDAALERA--EERIASLSDRI 355

                   ..
gi 1920237946 1735 EA 1736
Cdd:pfam19220  356 AE 357
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1554-1767 3.37e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 49.83  E-value: 3.37e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQD-- 1631
Cdd:COG3883     13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGEra 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1632 -ETQRKRQAEAELAL------------RVQAEAEAAREKQRALQALEELRLQAEEAerrlrQAEAERARQVQVALETAQR 1698
Cdd:COG3883     93 rALYRSGGSVSYLDVllgsesfsdfldRLSALSKIADADADLLEELKADKAELEAK-----KAELEAKLAELEALKAELE 167
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1699 SAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEA 1767
Cdd:COG3883    168 AAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAA 236
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1773-1949 3.47e-05

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 49.80  E-value: 3.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1773 QAEEVAQQKsltQAEAEKQKEEAEREArrrgKAEEQAVRQRELAEQELEKQRQLAEGTAQQ----RLAAEQELIRLRAET 1848
Cdd:PRK09510    88 QAEELQQKQ---AAEQERLKQLEKERL----AAQEQKKQAEEAAKQAALKQKQAEEAAAKAaaaaKAKAEAEAKRAAAAA 160
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1849 EQGEQQRQLLEEELArlQREAAAATQKRRELEA-----ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfr 1923
Cdd:PRK09510   161 KKAAAEAKKKAEAEA--AKKAAAEAKKKAEAEAaakaaAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEA---- 234
                          170       180
                   ....*....|....*....|....*.
gi 1920237946 1924 ELAEEAARLRALAEEAKRQRQLAEED 1949
Cdd:PRK09510   235 KAAAEKAAAAKAAEKAAAAKAAAEVD 260
PTZ00491 PTZ00491
major vault protein; Provisional
1513-1707 3.49e-05

major vault protein; Provisional


Pssm-ID: 240439 [Multi-domain]  Cd Length: 850  Bit Score: 50.40  E-value: 3.49e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1513 EREAQGLQRRMQEEVarreEVAVEAQEQKRSIQEELQhlrqsseaEIQAKARqveaAERSRL--RIE-EEIRVVRLQLEA 1589
Cdd:PTZ00491   643 ERTRDSLQKSVQLAI----EITTKSQEAAARHQAELL--------EQEARGR----LERQKMhdKAKaEEQRTKLLELQA 706
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1590 TerqrgGAEGELQAlRARAE-EAEAQKRQAQEEAErlrrqVQDETQRKRqaeaelALRVQAEAEAAREKQRALQALEELR 1668
Cdd:PTZ00491   707 E-----SAAVESSG-QSRAEaLAEAEARLIEAEAE-----VEQAELRAK------ALRIEAEAELEKLRKRQELELEYEQ 769
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946 1669 LQAE---EAERRLRQAEAERARQVQVAL--ETAQRSAEA--ELQSE 1707
Cdd:PTZ00491   770 AQNEleiAKAKELADIEATKFERIVEALgrETLIAIARAgpELQAK 815
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1948-2426 3.53e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 50.15  E-value: 3.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1948 EDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLleEQAAQHKADIEARLAQLRKA 2027
Cdd:COG4717     77 EEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQEL--EALEAELAELPERLEELEER 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2028 seselerqkglvedtLRQRRQVEEEILALKgsfEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEER 2107
Cdd:COG4717    155 ---------------LEELRELEEELEELE---AELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELE 216
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2108 RRREAEervqKSLAAEEEAARQRKAALEEVERLKakveearRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQK 2187
Cdd:COG4717    217 EAQEEL----EELEEELEQLENELEAAALEERLK-------EARLLLLIAAALLALLGLGGSLLSLILTIAGVLFLVLGL 285
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2188 EQELQQTLQQEQSVLERLRseaeaarraaeeaEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKE 2267
Cdd:COG4717    286 LALLFLLLAREKASLGKEA-------------EELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQEL 352
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2268 AEQEAARRAQAEQAALRQKQAADaeMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQ-KSILDEELQRLKAEVTEAA 2346
Cdd:COG4717    353 LREAEELEEELQLEELEQEIAAL--LAEAGVEDEEELRAALEQAEEYQELKEELEELEEQlEELLGELEELLEALDEEEL 430
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2347 RQR-GQVEEELFSLRVQMEELGKLKARIEAENRAlvLRDKDSAQRLLQEEAE---KMKQVAEEAARLSVAAQEAARLRQL 2422
Cdd:COG4717    431 EEElEELEEELEELEEELEELREELAELEAELEQ--LEEDGELAELLQELEElkaELRELAEEWAALKLALELLEEAREE 508

                   ....
gi 1920237946 2423 AEED 2426
Cdd:COG4717    509 YREE 512
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1618-1912 3.53e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 49.76  E-value: 3.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1618 AQEEAERLRRQVQDETQRKRQAEAELAlrvqaeaEAAREKQRALQALEELRLQAEEAERRLRQAEAERArqvqvALETAQ 1697
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIAELEKELA-------ALKKEEKALLKQLAALERRIAALARRIRALEQELA-----ALEAEL 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1698 RSAEAELQSEHASFAEKTAQLERTLKeehvavvqlreeatrraqqqaeaeraraeaerelERWQLKANEALRLRLQAEEV 1777
Cdd:COG4942     86 AELEKEIAELRAELEAQKEELAELLR----------------------------------ALYRLGRQPPLALLLSPEDF 131
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1778 AQqksltqaeaekqKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQL 1857
Cdd:COG4942    132 LD------------AVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKAERQK 199
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1858 LEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSK 1912
Cdd:COG4942    200 LLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFAALKGK 254
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
2302-2529 4.01e-05

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 49.44  E-value: 4.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2302 QALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKL--KARIEAENRA 2379
Cdd:COG3883     13 FADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEieERREELGERA 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2380 LVLRDKDSAQRLLQE--EAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAEL 2457
Cdd:COG3883     93 RALYRSGGSVSYLDVllGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAE 172
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2458 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDA 2529
Cdd:COG3883    173 LEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 244
PLEC smart00250
Plectin repeat;
3476-3511 4.24e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 43.24  E-value: 4.24e-05
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1920237946  3476 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTA 3511
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3476-3514 4.34e-05

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 43.47  E-value: 4.34e-05
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3476 LLQGSGCLAGIYLEDSKEKVTIYEAMRRGLLRPSTATLL 3514
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PLEC smart00250
Plectin repeat;
4262-4290 4.68e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 43.24  E-value: 4.68e-05
                            10        20
                    ....*....|....*....|....*....
gi 1920237946  4262 VRKRRVVIVDPETGKEMSVYEAYRKGLID 4290
Cdd:smart00250    6 AQSAIGGIIDPETGQKLSVEEALRRGLID 34
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1950-2161 4.83e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.99  E-value: 4.83e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1950 AVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIeARLAQLRKASE 2029
Cdd:COG4942     18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAEL-AELEKEIAELR 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2030 SELERQKGLVEDTL----RQRRQVEEEILALKGSFEKAAAGKA-------ELELELGRIRGTAEDTLRSKEQAEQEAARQ 2098
Cdd:COG4942     97 AELEAQKEELAELLralyRLGRQPPLALLLSPEDFLDAVRRLQylkylapARREQAEELRADLAELAALRAELEAERAEL 176
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2099 RQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQ 2161
Cdd:COG4942    177 EALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAA 239
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
1530-1690 5.00e-05

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 49.75  E-value: 5.00e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1530 REEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAE 1609
Cdd:pfam07111  490 RNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAASLRQELT 569
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1610 EAEAQKRQAQEEA-----ERLRRQVQDETQ-----RKRQAEAELALRvQAEAEAAREKQRAlqalEELRLQAEEAerrlR 1679
Cdd:pfam07111  570 QQQEIYGQALQEKvaeveTRLREQLSDTKRrlneaRREQAKAVVSLR-QIQHRATQEKERN----QELRRLQDEA----R 640
                          170
                   ....*....|..
gi 1920237946 1680 QAEAER-ARQVQ 1690
Cdd:pfam07111  641 KEEGQRlARRVQ 652
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1477-1874 5.19e-05

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 49.27  E-value: 5.19e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEElqhlrqsse 1556
Cdd:COG3064      4 ALEEKAAEAAAQERLEQAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAE--------- 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1557 aeiqaKARQVEAAERSRLRIEEEIRvvrlqlEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1636
Cdd:COG3064     75 -----AAKKLAEAEKAAAEAEKKAA------AEKAKAAKEAEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKA 143
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1637 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTA 1716
Cdd:COG3064    144 AEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAA 223
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1717 QLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAE 1796
Cdd:COG3064    224 RAAAASREAALAAVEATEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAAL 303
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 1797 REARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQ 1874
Cdd:COG3064    304 AAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLA 381
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
1598-1732 5.21e-05

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 48.60  E-value: 5.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1598 EGELQALRARAEEAEAQkrqAQEEAERLRRQVQDetqRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERR 1677
Cdd:pfam09787   67 RGQIQQLRTELQELEAQ---QQEEAESSREQLQE---LEEQLATERSARREAEAELERLQEELRYLEEELRRSKATLQSR 140
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1678 LRQAEAERARQ-VQVALETAQRSAEAELQSE-HA---SFAEKTAQLERTLKEEHVAVVQL 1732
Cdd:pfam09787  141 IKDREAEIEKLrNQLTSKSQSSSSQSELENRlHQlteTLIQKQTMLEALSTEKNSLVLQL 200
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1953-2343 5.61e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 49.74  E-value: 5.61e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1953 QRAEAERVLAEKLAAIsEATRLKTEAEialkEKEAENERLRRLAEDEAFQRRLLEEQAAqhkadIEARLAQLRKASESEL 2032
Cdd:pfam17380  281 QKAVSERQQQEKFEKM-EQERLRQEKE----EKAREVERRRKLEEAEKARQAEMDRQAA-----IYAEQERMAMEREREL 350
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2033 ERQKglVEDTLRQRRQVEEEilalkgsfekaaagkaELELELGRIRGTaeDTLRSKEQAEQEAARQRqlaaeeerrrrea 2112
Cdd:pfam17380  351 ERIR--QEERKRELERIRQE----------------EIAMEISRMREL--ERLQMERQQKNERVRQE------------- 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2113 EERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLR-ERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKeqel 2191
Cdd:pfam17380  398 LEAARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREvRRLEEERAREMERVRLEEQERQQQVERLRQQEEERK---- 473
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2192 qqtlqqeqsvleRLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQE 2271
Cdd:pfam17380  474 ------------RKKLELEKEKRDRKRAEEQRRKILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEE 541
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2272 aarraqaeqaalRQKQaadAEMEKHKQFAEQALRqkaqVEQELTALRLQLEETDHQKSILDEELQRLKAEVT 2343
Cdd:pfam17380  542 ------------RRKQ---QEMEERRRIQEQMRK----ATEERSRLEAMEREREMMRQIVESEKARAEYEAT 594
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
2395-2530 5.68e-05

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 49.10  E-value: 5.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2395 EAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEAtrlKAEAELLQQQKELAQEQARRLQE 2474
Cdd:COG2268    196 EIIRDARIAEAEAERETEIAIAQANREAEEAELEQEREIETARIAEAEAELAKK---KAEERREAETARAEAEAAYEIAE 272
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2475 DKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDAR 2530
Cdd:COG2268    273 ANAEREVQRQLEIAEREREIELQEKEAEREEAELEADVRKPAEAEKQAAEAEAEAE 328
CH_PARVA_B_rpt2 cd21306
second calponin homology (CH) domain found in the alpha/beta parvin subfamily; The alpha/beta ...
185-286 5.87e-05

second calponin homology (CH) domain found in the alpha/beta parvin subfamily; The alpha/beta parvin subfamily includes alpha-parvin and beta-parvin. Alpha-parvin, also called actopaxin, calponin-like integrin-linked kinase-binding protein (CH-ILKBP), or matrix-remodeling-associated protein 2, plays a role in sarcomere organization and in smooth muscle cell contraction. It is required for normal development of the embryonic cardiovascular system, and for normal septation of the heart outflow tract. Beta-parvin, also called affixin, is an adapter protein that plays a role in integrin signaling via ILK and in activation of the GTPases Cdc42 and Rac1 by guanine exchange factors, such as ARHGEF6. Both alpha-parvin and beta-parvin are involved in the reorganization of the actin cytoskeleton and the formation of lamellipodia, and both play roles in cell adhesion, cell spreading, establishment or maintenance of cell polarity, and cell migration. Members of this subfamily contain two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409155  Cd Length: 121  Bit Score: 45.49  E-value: 5.87e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  185 VQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMRF----HKLQNVQIALDYLRHRQVKLV 260
Cdd:cd21306     16 VVKKSLITFVNKHLNKLNLEVTDLDTQFHDGVYLVLLMGLLEGYFVPLHSFHLTPtsfeQKVHNVQFAFELMQDAGLPKP 95
                           90       100
                   ....*....|....*....|....*.
gi 1920237946  261 NIRNDDIADGNPKLTLGLIWTIILHF 286
Cdd:cd21306     96 KARPEDIVNLDLKSTLRVLYNLFTKY 121
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1211-1688 5.88e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 49.66  E-value: 5.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1211 LEKLKTISLVIRSTQEA-EEVLRAHEEQLKEAQAVPATLPELEATKAALKKL---RAQAEAQQPVFDALRDELRGA---- 1282
Cdd:TIGR00606  600 LASLEQNKNHINNELESkEEQLSSYEDKLFDVCGSQDEESDLERLKEEIEKSskqRAMLAGATAVYSQFITQLTDEnqsc 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1283 --------------QEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQRELEQLGRQLRYYRESADPLGAW 1348
Cdd:TIGR00606  680 cpvcqrvfqteaelQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRD 759
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1349 LRDAKQRQEQIQAVplanSQAVREQLRQEKALLED---IERHGEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASP 1425
Cdd:TIGR00606  760 IQRLKNDIEEQETL----LGTIMPEEESAKVCLTDvtiMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQ 835
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1426 AKKPKVQSGSE---SIIQEYVD-LRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQR---------AEERERLAEVE 1492
Cdd:TIGR00606  836 HELDTVVSKIElnrKLIQDQQEqIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTevqslireiKDAKEQDSPLE 915
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1493 AALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVarrEEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERS 1572
Cdd:TIGR00606  916 TFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKV---KNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKH 992
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1573 RLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALrvqaeae 1652
Cdd:TIGR00606  993 QEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQMKQEHQKLEENIDL------- 1065
                          490       500       510
                   ....*....|....*....|....*....|....*.
gi 1920237946 1653 AAREKQRALQALEELRLQAEEAERRLRQAEAERARQ 1688
Cdd:TIGR00606 1066 IKRNHVLALGRQKGYEKEIKHFKKELREPQFRDAEE 1101
PLEC smart00250
Plectin repeat;
3257-3293 5.98e-05

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 42.85  E-value: 5.98e-05
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3257 LRLLDAQLSTGGIVDPSKSHRVPLDVACARGYLDKET 3293
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
COG3899 COG3899
Predicted ATPase [General function prediction only];
1827-2323 6.07e-05

Predicted ATPase [General function prediction only];


Pssm-ID: 443106 [Multi-domain]  Cd Length: 1244  Bit Score: 49.47  E-value: 6.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1827 AEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV----------RAEMEVLLAS 1896
Cdd:COG3899    737 PDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYAnlgllllgdyEEAYEFGELA 816
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1897 KARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKT 1976
Cdd:COG3899    817 LALAERLGDRRLEARALFNLGFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAARL 896
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1977 EAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILAL 2056
Cdd:COG3899    897 LAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAAA 976
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2057 KGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEE 2136
Cdd:COG3899    977 AAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAAA 1056
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2137 VERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAA 2216
Cdd:COG3899   1057 AAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAALAALALAAALAALALAAAARAAAALL 1136
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2217 EEAEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKH 2296
Cdd:COG3899   1137 LLAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALLALAARLAALLALALLA 1216
                          490       500
                   ....*....|....*....|....*..
gi 1920237946 2297 KQFAEQALRQKAQVEQELTALRLQLEE 2323
Cdd:COG3899   1217 LEAAALLLLLLLAALALAAALLALRLL 1243
CH_jitterbug-like_rpt3 cd21185
third calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and ...
323-400 6.25e-05

third calponin homology (CH) domain found in Drosophila melanogaster protein jitterbug and similar proteins; Protein jitterbug (Jbug) is an actin-meshwork organizing protein. It is required to maintain the shape and cell orientation of the Drosophila notum epithelium during flight muscle attachment to tendon cells. Jbug contains three copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409034  Cd Length: 98  Bit Score: 44.60  E-value: 6.25e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946  323 DNFTTSWRDGRLFNAIIHRHKPTLIDMNKVYRQTNLENLDQAFSVAERdLGVTRLLDPEDVDVPQPDEKSIITYVSSL 400
Cdd:cd21185     20 NNFTTDWNDGRLLCGLVNALGGSVPGWPNLDPEESENNIQRGLEAGKS-LGVEPVLTAEEMADPEVEHLGIMAYAAQL 96
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
2300-2689 7.08e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 49.00  E-value: 7.08e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2300 AEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVT--EAARQRGQVEEELFSLRVQMEELGKLKARIEAEN 2377
Cdd:COG4717     76 LEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEklEKLLQLLPLYQELEALEAELAELPERLEELEERL 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2378 RALVlRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKekmQAVQEATRLKAEAEL 2457
Cdd:COG4717    156 EELR-ELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELE---EAQEELEELEEELEQ 231
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2458 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLET----------------------ERQRQLEMSAEAERLRLRV 2515
Cdd:COG4717    232 LENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSliltiagvlflvlgllallfllLAREKASLGKEAEELQALP 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2516 AEMSRAQARAEEDARRFRKQAEDIGER---LYRTELATQEKVMLVQTLETQRQQSDRDAER---LREAIAELEHEKDKLK 2589
Cdd:COG4717    312 ALEELEEEELEELLAALGLPPDLSPEElleLLDRIEELQELLREAEELEEELQLEELEQEIaalLAEAGVEDEEELRAAL 391
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2590 QEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQ 2669
Cdd:COG4717    392 EQAEEYQELKEELEELEEQLEELLGELEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGEL 471
                          410       420
                   ....*....|....*....|
gi 1920237946 2670 EKQQLAASMEEARRRQHEAE 2689
Cdd:COG4717    472 AELLQELEELKAELRELAEE 491
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1546-1920 7.47e-05

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 48.74  E-value: 7.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1546 EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEAT----ERQRGGAEGELQALRARAEEAEAQKRQAQEE 1621
Cdd:pfam07888   30 ELLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQrrelESRVAELKEELRQSREKHEELEEKYKELSAS 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1622 AERL---RRQVQDETQRKRQAEAELALRVQAEAEAAREKQralQALEELRLQAEEAERRLRQAEAERaRQVQVALETAQ- 1697
Cdd:pfam07888  110 SEELseeKDALLAQRAAHEARIRELEEDIKTLTQRVLERE---TELERMKERAKKAGAQRKEEEAER-KQLQAKLQQTEe 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1698 --RSAEAELQSEHASFAEKTAQLERtLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKAnEALRLRLqaE 1775
Cdd:pfam07888  186 elRSLSKEFQELRNSLAQRDTQVLQ-LQDTITTLTQKLTTAHRKEAENEALLEELRSLQERLNASERKV-EGLGEEL--S 261
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1776 EVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQR 1855
Cdd:pfam07888  262 SMAAQRDRTQAELHQARLQAAQLTLQLADASLALREGRARWAQERETLQQSAEADKDRIEKLSAELQRLEERLQEERMER 341
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 1856 QLLEEELArlqREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSE---KSKQRLEAEAG 1920
Cdd:pfam07888  342 EKLEVELG---REKDCNRVQLSESRRELQELKASLRVAQKEKEQLQAEKQELLEyirQLEQRLETVAD 406
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1611-1736 8.04e-05

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 48.65  E-value: 8.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1611 AEAQKRQAQEEAERLRRQVQDETQRKRQAEaELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLR----QAEAERA 1686
Cdd:PRK09510    61 VEQYNRQQQQQKSAKRAEEQRKKKEQQQAE-ELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAlkqkQAEEAAA 139
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1687 RQVQVA----------LETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1736
Cdd:PRK09510   140 KAAAAAkakaeaeakrAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEA 199
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2124-2737 8.46e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 49.14  E-value: 8.46e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2124 EEAARQRKAALEEVERLKAKVEEARRlreraeqeSARQLQLAQEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqSVLE 2203
Cdd:COG4913    224 FEAADALVEHFDDLERAHEALEDARE--------QIELLEPIRELAERYAAARERL--------------------AELE 275
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2204 RLRSEAEAARRAAEEAEAARERAEREAAQSRRQvEEAERLKQsaeeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAAL 2283
Cdd:COG4913    276 YLRAALRLWFAQRRLELLEAELEELRAELARLE-AELERLEA-----------------RLDALREELDELEAQIRGNGG 337
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2284 RQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETdhqKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQM 2363
Cdd:COG4913    338 DRLEQLEREIERLERELEERERRRARLEALLAALGLPLPAS---AEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2364 EELGKLKARIEAENRALVLRDKDSAQRLLQeeaekMKQVAEEAARLS-VAAQEAARLRQLAEEDLAQQRAlAEKML---- 2438
Cdd:COG4913    415 RDLRRELRELEAEIASLERRKSNIPARLLA-----LRDALAEALGLDeAELPFVGELIEVRPEEERWRGA-IERVLggfa 488
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2439 ------KEKMQAVQEAT-RLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETE--RQRQLEMSAEAE 2509
Cdd:COG4913    489 ltllvpPEHYAAALRWVnRLHLRGRLVYERVRTGLPDPERPRLDPDSLAGKLDFKPHPFRAWLEAElgRRFDYVCVDSPE 568
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2510 RLRLRVAEMSRA-QARAeeDARRFRKQAEDIGERLYRTELATQEKvmlVQTLETQRQQSDRDAERLREAIAELEHEKDKL 2588
Cdd:COG4913    569 ELRRHPRAITRAgQVKG--NGTRHEKDDRRRIRSRYVLGFDNRAK---LAALEAELAELEEELAEAEERLEALEAELDAL 643
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2589 KQEAQLLQ-LKSEEMQTVRQEQLLQETQALQQsflsekdsllQRERcIEQEKAKLEQLfQDEVAKAQALREEQQRQQQQM 2667
Cdd:COG4913    644 QERREALQrLAEYSWDEIDVASAEREIAELEA----------ELER-LDASSDDLAAL-EEQLEELEAELEELEEELDEL 711
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2668 QQEKQQLAASMEEARRRQHEAEEGVRRqQEELQRLAQQQQQQEKLLAEENQRLRERL-QHLEEERRAALAR 2737
Cdd:COG4913    712 KGEIGRLEKELEQAEEELDELQDRLEA-AEDLARLELRALLEERFAAALGDAVERELrENLEERIDALRAR 781
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
2306-2549 8.53e-05

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 48.30  E-value: 8.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2306 QKAQVEQELTALRLQleetdhQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDK 2385
Cdd:TIGR02794   44 DPGAVAQQANRIQQQ------KKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEK 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2386 DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARlrQLAEEDLAQQRALAEKMLKE-KMQAVQEA-----TRLKAEAELLQ 2459
Cdd:TIGR02794  118 QKQAEEAKAKQAAEAKAKAEAEAERKAKEEAAK--QAEEEAKAKAAAEAKKKAEEaKKKAEAEAkakaeAEAKAKAEEAK 195
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2460 QQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQaedI 2539
Cdd:TIGR02794  196 AKAEAAKAKAAAEAAAKAEAEAAAAAAAEAERKADEAELGDIFGLASGSNAEKQGGARGAAAGSEVDKYAAIIQQA---I 272
                          250
                   ....*....|
gi 1920237946 2540 GERLYRTELA 2549
Cdd:TIGR02794  273 QQNLYDDPSF 282
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1838-1988 8.64e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 47.23  E-value: 8.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1838 EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRS-TSEKSKQRLE 1916
Cdd:COG1579     16 DSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNvRNNKEYEALQ 95
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1917 AE----AGRFRELAEEAARLRALAEEAKRQRQLAEEdavrQRAEAERVLAEKLAAISEATRlKTEAEIALKEKEAE 1988
Cdd:COG1579     96 KEieslKRRISDLEDEILELMERIEELEEELAELEA----ELAELEAELEEKKAELDEELA-ELEAELEELEAERE 166
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
1075-1736 9.07e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 49.20  E-value: 9.07e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1075 EDRLQAEREYGSCSRHYQQLLQSLEQGEQEESRCQRCISELKDIRLQLEACETRTVHRLRLPLDKEPARECAQRITEQQK 1154
Cdd:pfam02463  293 KEEEELKSELLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLE 372
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1155 AQAEVDGLGKGVARLSAEAEKVLALPEPSPAAPTLRSELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAH 1234
Cdd:pfam02463  373 EELLAKKKLESERLSSAAKLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEE 452
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1235 EEQLKEAQAVPATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQE-------VGERLQQRHGERDVEVERWRERV 1307
Cdd:pfam02463  453 LEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKEskarsglKVLLALIKDGVGGRIISAHGRLG 532
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1308 TLLLERWQAVLAQTDVRQRELEQLG-----RQLRYYRESADPLGAWLRDAKQRQEQIQAVPLA---NSQAVREQLRQEKA 1379
Cdd:pfam02463  533 DLGVAVENYKVAISTAVIVEVSATAdeveeRQKLVRALTELPLGARKLRLLIPKLKLPLKSIAvleIDPILNLAQLDKAT 612
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1380 LLEDIERHGEKV----EECQRFAKQYINAiKDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTL 1455
Cdd:pfam02463  613 LEADEDDKRAKVvegiLKDTELTKLKESA-KAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAK 691
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1456 TSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL-QRRMQEEVARREEVA 1534
Cdd:pfam02463  692 EEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSrLKKEEKEEEKSELSL 771
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1535 VEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRvvrLQLEATERQRGGAEGELQALRARAEEAEAQ 1614
Cdd:pfam02463  772 KEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLE---EEQLLIEQEEKIKEEELEELALELKEEQKL 848
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1615 KRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALE 1694
Cdd:pfam02463  849 EKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAE 928
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 1695 TAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEA 1736
Cdd:pfam02463  929 ILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKE 970
PRK01156 PRK01156
chromosome segregation protein; Provisional
2304-2744 9.11e-05

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 49.13  E-value: 9.11e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2304 LRQKAQVEQ-ELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIeaenralvl 2382
Cdd:PRK01156   188 LEEKLKSSNlELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRYESEI--------- 258
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2383 RDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQK 2462
Cdd:PRK01156   259 KTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDY 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2463 ELAQEQARRLQEDKEQMAQQLAQET--QGFQKTLETERQRQLEMSAEAERLRlrvAEMSRAQARAEEDARRFRKQAEDIG 2540
Cdd:PRK01156   339 NDYIKKKSRYDDLNNQILELEGYEMdyNSYLKSIESLKKKIEEYSKNIERMS---AFISEILKIQEIDPDAIKKELNEIN 415
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2541 ERL--YRTELAT--QEKVMLVQTLETQRQQSD----------------------------RDAERLREAIAELEHEKDKL 2588
Cdd:PRK01156   416 VKLqdISSKVSSlnQRIRALRENLDELSRNMEmlngqsvcpvcgttlgeeksnhiinhynEKKSRLEEKIREIEIEVKDI 495
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2589 KQEAQLLQ-----LKSEEM-QTVRQEQLLQETQALQQSFLSE----KDSLLQRERCIEQEKA-KLEQLFQDEVAKAQALR 2657
Cdd:PRK01156   496 DEKIVDLKkrkeyLESEEInKSINEYNKIESARADLEDIKIKinelKDKHDKYEEIKNRYKSlKLEDLDSKRTSWLNALA 575
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2658 EEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEG-----------VRRQQEELQRLaqqqqQQEKLLAEENQRLRERLQH 2726
Cdd:PRK01156   576 VISLIDIETNRSRSNEIKKQLNDLESRLQEIEIGfpddksyidksIREIENEANNL-----NNKYNEIQENKILIEKLRG 650
                          490
                   ....*....|....*...
gi 1920237946 2727 LEEERRAALARSEEIAPS 2744
Cdd:PRK01156   651 KIDNYKKQIAEIDSIIPD 668
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
2393-2491 9.15e-05

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 47.96  E-value: 9.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2393 QEEAEKMKQVAEEAARLSVAAQEAARL----RQLAEEDLAQQRALAE--KMLKEKMQavQEATRLKAEAELLQQQKElaQ 2466
Cdd:cd16269    191 QALTEKEKEIEAERAKAEAAEQERKLLeeqqRELEQKLEDQERSYEEhlRQLKEKME--EERENLLKEQERALESKL--K 266
                           90       100
                   ....*....|....*....|....*
gi 1920237946 2467 EQARRLQEDKEQMAQQLAQETQGFQ 2491
Cdd:cd16269    267 EQEALLEEGFKEQAELLQEEIRSLK 291
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
2300-2734 9.17e-05

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 49.18  E-value: 9.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2300 AEQALRQKAQVEQELTALRLQL-------EETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLK-A 2371
Cdd:COG3096    524 LEQRLRQQQNAERLLEEFCQRIgqqldaaEELEELLAELEAQLEELEEQAAEAVEQRSELRQQLEQLRARIKELAARApA 603
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2372 RIEAENRALVLRD------KDSA------QRLLQEE----------AEKMKQVAEEAARLSVAA-QEAARLRQLAE---- 2424
Cdd:COG3096    604 WLAAQDALERLREqsgealADSQevtaamQQLLEREreatverdelAARKQALESQIERLSQPGgAEDPRLLALAErlgg 683
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2425 -------EDLAQQRA-LAEKMLKEKMQA--VQEATRLKAEAE----------LLQQQKELAQEQARRLQEDKEQMAQQLA 2484
Cdd:COG3096    684 vllseiyDDVTLEDApYFSALYGPARHAivVPDLSAVKEQLAgledcpedlyLIEGDPDSFDDSVFDAEELEDAVVVKLS 763
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2485 QETQGFQKTLE------TERQRQLE-MSAEAERLRLRVAEMS---RAQARAEEDARRFrkqaedIGERLYRTELATQEKV 2554
Cdd:COG3096    764 DRQWRYSRFPEvplfgrAAREKRLEeLRAERDELAEQYAKASfdvQKLQRLHQAFSQF------VGGHLAVAFAPDPEAE 837
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2555 MlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQsflsEKDSLLQRERC 2634
Cdd:COG3096    838 L--AALRQRRSELERELAQHRAQEQQLRQQLDQLKEQLQLLNKLLPQANLLADETLADRLEELRE----ELDAAQEAQAF 911
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2635 IEQEKAKLEQLfqdeVAKAQALReeqqrqqqQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLA 2714
Cdd:COG3096    912 IQQHGKALAQL----EPLVAVLQ--------SDPEQFEQLQADYLQAKEQQRRLKQQIFALSEVVQRRPHFSYEDAVGLL 979
                          490       500
                   ....*....|....*....|....
gi 1920237946 2715 EENQ----RLRERLQHLEEERRAA 2734
Cdd:COG3096    980 GENSdlneKLRARLEQAEEARREA 1003
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
2125-2430 9.64e-05

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 48.97  E-value: 9.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2125 EAARQRKAALEEVERLKAKVEEARRLRER----AEQESARQLQLAQEAA----QKRLQAEEKAHAFAVQQKEQELQqtlq 2196
Cdd:pfam17380  286 ERQQQEKFEKMEQERLRQEKEEKAREVERrrklEEAEKARQAEMDRQAAiyaeQERMAMERERELERIRQEERKRE---- 361
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2197 qeqsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAERlKQSAEEQAQAQAQAQAAAEKLRKEAEQEAArra 2276
Cdd:pfam17380  362 -----LERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAAR-KVKILEEERQRKIQQQKVEMEQIRAEQEEA--- 432
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2277 qaeqaalRQKQAADAEMEKHKQFaEQALRQKAQVEQELTALRLQLEETDHQKSILD---------EELQRLKAEVTEAAR 2347
Cdd:pfam17380  433 -------RQREVRRLEEERAREM-ERVRLEEQERQQQVERLRQQEEERKRKKLELEkekrdrkraEEQRRKILEKELEER 504
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2348 QRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEE--AEKMKQVAEEAARLSVAAQEAARLRQLAEE 2425
Cdd:pfam17380  505 KQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEMEERRriQEQMRKATEERSRLEAMEREREMMRQIVES 584

                   ....*
gi 1920237946 2426 DLAQQ 2430
Cdd:pfam17380  585 EKARA 589
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
1436-1780 9.77e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 48.61  E-value: 9.77e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1436 ESIIQEYVDLRTRYSELST---LTSQYIRFISETLRRMEEEERLAE-QQRAEE-RERLAEVEAALEKQRQLAEAHAQAKA 1510
Cdd:COG4717     98 EELEEELEELEAELEELREeleKLEKLLQLLPLYQELEALEAELAElPERLEElEERLEELRELEEELEELEAELAELQE 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1511 QAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQ---- 1586
Cdd:COG4717    178 ELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQE----ELEELEEELEQLENELEAAALEERLKEARllll 253
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1587 --------------LEATERQRGGA----EGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQ 1648
Cdd:COG4717    254 iaaallallglggsLLSLILTIAGVlflvLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPD 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1649 AEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV----------------------QVALETAQRSAEAELQS 1706
Cdd:COG4717    334 LSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAAllaeagvedeeelraaleqaeeYQELKEELEELEEQLEE 413
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1707 -----EHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKAnEALRLRLQAEEVAQQ 1780
Cdd:COG4717    414 llgelEELLEALDEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQ-ELEELKAELRELAEE 491
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
1473-1702 1.04e-04

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 48.44  E-value: 1.04e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1473 EERLAEQQRAEERERLAEVEAAlEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELqhlr 1552
Cdd:PRK07735    11 KKEAARRAKEEARKRLVAKHGA-EISKLEEENREKEKALPKNDDMTIEEAKRRAAAAAKAKAAALAKQKREGTEEV---- 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1553 qsSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDE 1632
Cdd:PRK07735    86 --TEEEKAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKE 163
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1633 TQRKRQAEAELALrvqaeaEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEA 1702
Cdd:PRK07735   164 KAKAKAAAAAKAK------AAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKASQGN 227
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1807-2002 1.06e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 48.47  E-value: 1.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAEgTAQQRLAA---EQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1883
Cdd:COG3206    171 EEARKALEFLEEQLPELRKELE-EAEAALEEfrqKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1884 AKVRAEMEVLLASKA--------------RAEEESRSTSEKSK-QRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEE 1948
Cdd:COG3206    250 GSGPDALPELLQSPViqqlraqlaeleaeLAELSARYTPNHPDvIALRAQIAALRAQLQQEAQRILASLEAELEALQARE 329
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1949 DAVRQRAEAERVLAEKLAAIS-EATRLKTEAEIALKEKEAENERLRRLAEDEAFQ 2002
Cdd:COG3206    330 ASLQAQLAQLEARLAELPELEaELRRLEREVEVARELYESLLQRLEEARLAEALT 384
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
1609-1688 1.10e-04

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 48.79  E-value: 1.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1609 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAE------AEAAREKQRALQA-LEELRLQAEEAERRLRQA 1681
Cdd:PRK11448   138 EDPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELvaleglAAELEEKQQELEAqLEQLQEKAAETSQERKQK 217

                   ....*..
gi 1920237946 1682 EAERARQ 1688
Cdd:PRK11448   218 RKEITDQ 224
KpsE COG3524
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
2299-2465 1.17e-04

Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442746 [Multi-domain]  Cd Length: 370  Bit Score: 47.92  E-value: 1.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2299 FAEQALrqkAQVEQELTALRLQLEETDHQKSILDEElqrlkAEVTEAARQRGQVEEELFSLRVQMEELgklkarieaenR 2378
Cdd:COG3524    181 FAEEEV---ERAEERLRDAREALLAFRNRNGILDPE-----ATAEALLQLIATLEGQLAELEAELAAL-----------R 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2379 AlVLRDKDSAQRLLQEEAEKM-KQVAEEAARLSVAAQEAARLRQLAE-EDLAQQRALAEKMLKEKMQAVQEAtrlKAEAE 2456
Cdd:COG3524    242 S-YLSPNSPQVRQLRRRIAALeKQIAAERARLTGASGGDSLASLLAEyERLELEREFAEKAYTSALAALEQA---RIEAA 317

                   ....*....
gi 1920237946 2457 llQQQKELA 2465
Cdd:COG3524    318 --RQQRYLA 324
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
1436-1647 1.18e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 48.47  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1436 ESIIQEYVDLRTRYSelSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERL------AEVEAALEKQRQLAEAHAQAK 1509
Cdd:COG3206    155 NALAEAYLEQNLELR--REEARKALEFLEEQLPELRKELEEAEAALEEFRQKNglvdlsEEAKLLLQQLSELESQLAEAR 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1510 AQAeREAQGLQRRMQEEVARREEVAVEA-------------QEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRI 1576
Cdd:COG3206    233 AEL-AEAEARLAALRAQLGSGPDALPELlqspviqqlraqlAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEA 311
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1577 EEEIRVVRLQLEATERQRGGAEGELQALRARAE---EAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRV 1647
Cdd:COG3206    312 QRILASLEAELEALQAREASLQAQLAQLEARLAelpELEAELRRLEREVEVARELYESLLQRLEEARLAEALTV 385
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
2000-2525 1.18e-04

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 48.72  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2000 AFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEkAAAGKAELELELGRIRG 2079
Cdd:COG3321    867 PFQREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAA-LLALVALAAAAAALLAL 945
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2080 TAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESA 2159
Cdd:COG3321    946 AAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAAAALLALA 1025
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2160 RQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEE 2239
Cdd:COG3321   1026 ALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALALAALAAA 1105
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2240 AERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRL 2319
Cdd:COG3321   1106 LLLLALLAALALAAAAAALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALALALAAAL 1185
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2320 QLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEElgklkARIEAENRALVLRDKDSAQRLLQEEAEKM 2399
Cdd:COG3321   1186 AAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAA-----ALALLALAAAAAAVAALAAAAAALLAALA 1260
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2400 KQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2479
Cdd:COG3321   1261 ALALLAAAAGLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAAVAAA 1340
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946 2480 AQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARA 2525
Cdd:COG3321   1341 LALAAAAAAAAAAAAAAAAAAALAAAAGAAAAAAALALAALAAAVA 1386
PRK05035 PRK05035
electron transport complex protein RnfC; Provisional
1637-1921 1.18e-04

electron transport complex protein RnfC; Provisional


Pssm-ID: 235334 [Multi-domain]  Cd Length: 695  Bit Score: 48.41  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1637 RQAEAELALRVQAEAEAAREKQR--ALQAleelRLQAEEAER--RLRQAEAERARQVQVALETAQRSAEAELQSEHASFA 1712
Cdd:PRK05035   432 RQAKAEIRAIEQEKKKAEEAKARfeARQA----RLEREKAAReaRHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIV 507
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1713 EKTAQlertlKEEHVAVVQLREEATRRAQQQAEAERARAEAERelerwQLKANEALRLRLQAEEVAQQKSLTqaeAEKQK 1792
Cdd:PRK05035   508 IKAGA-----RPDNSAVIAAREARKAQARARQAEKQAAAAADP-----KKAAVAAAIARAKAKKAAQQAANA---EAEEE 574
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1793 EEAEREARRRGKAEEQAvrqRELAEQELEKQRQLAEGTAQQRLAAEQELIrLRAETEQGEQQRQLLEEElarlqreaaAA 1872
Cdd:PRK05035   575 VDPKKAAVAAAIARAKA---KKAAQQAASAEPEEQVAEVDPKKAAVAAAI-ARAKAKKAEQQANAEPEE---------PV 641
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1920237946 1873 TQKRRELEAELAKVRAEmevlLASKARAEEESRSTSEKSKQRLEAEAGR 1921
Cdd:PRK05035   642 DPRKAAVAAAIARAKAR----KAAQQQANAEPEEAEDPKKAAVAAAIAR 686
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1920-2166 1.22e-04

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 48.35  E-value: 1.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1920 GRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDE 1999
Cdd:pfam07888   34 NRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEEL 113
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2000 AFQRRLLEEQAAQHKADIE------ARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELE 2073
Cdd:pfam07888  114 SEEKDALLAQRAAHEARIReleediKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKE 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2074 LGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAeervqKSLAAEEEAARQRKAALEE-VERLKAKVEEARRLRE 2152
Cdd:pfam07888  194 FQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAEN-----EALLEELRSLQERLNASERkVEGLGEELSSMAAQRD 268
                          250
                   ....*....|....*
gi 1920237946 2153 RAEQESAR-QLQLAQ 2166
Cdd:pfam07888  269 RTQAELHQaRLQAAQ 283
CH_PLS_rpt1 cd21292
first calponin homology (CH) domain found in the plastin family; The plastin family includes ...
186-283 1.24e-04

first calponin homology (CH) domain found in the plastin family; The plastin family includes plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409141  Cd Length: 145  Bit Score: 45.35  E-value: 1.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  186 QKKTFTKWVN---------KHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPR----EKGRMRFHKLQNVQIALDYL 252
Cdd:cd21292     25 EKVAFVNWINknlgddpdcKHLLPMDPNTDDLFEKVKDGILLCKMINLSVPDTIDErainKKKLTVFTIHENLTLALNSA 104
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1920237946  253 RHRQVKLVNIRNDDIADGNPKLTLGLIWTII 283
Cdd:cd21292    105 SAIGCNVVNIGAEDLKEGKPHLVLGLLWQII 135
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1804-1997 1.28e-04

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 47.61  E-value: 1.28e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1883
Cdd:pfam13868  149 EEREEDERILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAE 228
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1884 AKVRAEMEVLLASKARAEEESRsTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAE 1963
Cdd:pfam13868  229 KKARQRQELQQAREEQIELKER-RLAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEEREEQRA 307
                          170       180       190
                   ....*....|....*....|....*....|....
gi 1920237946 1964 KLAAISEATRLKTEAEIALKEKEAENERLRRLAE 1997
Cdd:pfam13868  308 AEREEELEEGERLREEEAERRERIEEERQKKLKE 341
PLEC smart00250
Plectin repeat;
3220-3256 1.31e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 42.08  E-value: 1.31e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3220 KLLSAEKAVTGYKDPYSGQSVSLFQALKKGLIPREQG 3256
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
1811-2527 1.36e-04

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 48.26  E-value: 1.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1811 RQRELAEQEleKQRQLAEGTAQQRLAAEQ-ELIRLRAeTEQGEQQRQLLEeelaRLQREAAAATQKRR---ELEAELAKV 1886
Cdd:PRK10246   251 RLDELQQEA--SRRQQALQQALAAEEKAQpQLAALSL-AQPARQLRPHWE----RIQEQSAALAHTRQqieEVNTRLQST 323
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1887 RAEMEVLLASKARAEEESRSTSEKSKQRLeAEAGRFRELAEEAARLRALAEEAKRQRQlaEEDAVRQRAEAERvlaEKLA 1966
Cdd:PRK10246   324 MALRARIRHHAAKQSAELQAQQQSLNTWL-AEHDRFRQWNNELAGWRAQFSQQTSDRE--QLRQWQQQLTHAE---QKLN 397
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1967 AISEATRLKTEAEIAlkekeaenERLRRLAEDEAFQRRLLEEQaAQHkADIEARLAQLrKASESELERQKGLVEDTLRQR 2046
Cdd:PRK10246   398 ALPAITLTLTADEVA--------AALAQHAEQRPLRQRLVALH-GQI-VPQQKRLAQL-QVAIQNVTQEQTQRNAALNEM 466
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2047 RQVEEEilalkgsfekaaagKAElelELGRIRGTAEDTLRSKEQAEQEAarQRQLAAEEERRRREAEERVQKSLAAEEEA 2126
Cdd:PRK10246   467 RQRYKE--------------KTQ---QLADVKTICEQEARIKDLEAQRA--QLQAGQPCPLCGSTSHPAVEAYQALEPGV 527
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2127 ARQRKAALE-EVERLKakvEEARRLRERAEQeSARQLQLAQEAAQkRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERL 2205
Cdd:PRK10246   528 NQSRLDALEkEVKKLG---EEGAALRGQLDA-LTKQLQRDESEAQ-SLRQEEQALTQQWQAVCASLNITLQPQDDIQPWL 602
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2206 rseaeaarraaeeaeaareraereaaqsrrqvEEAERLKQsaeeqaqaqaqaqaaaeklrkeaeqeaarraqaEQAALRQ 2285
Cdd:PRK10246   603 --------------------------------DAQEEHER---------------------------------QLRLLSQ 617
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2286 KQAADAEMEKH----KQFAEQALRQKAQVEQELTALRLQLEETDHQKSIL---DEELQRLKAEVTEAARQRGQVE--EEL 2356
Cdd:PRK10246   618 RHELQGQIAAHnqqiIQYQQQIEQRQQQLLTALAGYALTLPQEDEEASWLatrQQEAQSWQQRQNELTALQNRIQqlTPL 697
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2357 FSLRVQMEELGKLKARIEAEN------RALVLRDKDSA--QRLLQEEAEKMKQVAEEAARL--SVAAQEAARLRQLAEED 2426
Cdd:PRK10246   698 LETLPQSDDLPHSEETVALDNwrqvheQCLSLHSQLQTlqQQDVLEAQRLQKAQAQFDTALqaSVFDDQQAFLAALLDEE 777
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2427 LAQQralAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQK--TLETERQRQLEM 2504
Cdd:PRK10246   778 TLTQ---LEQLKQNLENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVTVEQIQQELAQLAQQLREntTRQGEIRQQLKQ 854
                          730       740
                   ....*....|....*....|....
gi 1920237946 2505 SAEA-ERLRLRVAEMSRAQARAEE 2527
Cdd:PRK10246   855 DADNrQQQQALMQQIAQATQQVED 878
PLEC smart00250
Plectin repeat;
4092-4126 1.45e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 41.70  E-value: 1.45e-04
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1920237946  4092 LLEAQAATGYVIDPIKGLKLTVEEAVRMGIVGPEF 4126
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPET 37
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1545-1903 1.57e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 47.59  E-value: 1.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1545 QEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1624
Cdd:COG4372     12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1625 LRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAEL 1704
Cdd:COG4372     92 AQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQ 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1705 QSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLT 1784
Cdd:COG4372    172 ELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEEL 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1785 QAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELAR 1864
Cdd:COG4372    252 LEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELA 331
                          330       340       350
                   ....*....|....*....|....*....|....*....
gi 1920237946 1865 LQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1903
Cdd:COG4372    332 LAILLAELADLLQLLLVGLLDNDVLELLSKGAEAGVADG 370
Crescentin pfam19220
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ...
1807-2054 1.64e-04

Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.


Pssm-ID: 437057 [Multi-domain]  Cd Length: 401  Bit Score: 47.37  E-value: 1.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRaeteqgeQQRQLLEEELARLQR-------EAAAATQKRREL 1879
Cdd:pfam19220  128 AAETEQNRALEEENKALREEAQAAEKALQRAEGELATAR-------ERLALLEQENRRLQAlseeqaaELAELTRRLAEL 200
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1880 EAELAKVRAEMEVLLASKAraeeESRSTSEKSKQRLEAEAGRFR-ELAEEAARLRALAEEAKRQRQLAEE--DAVRQRAE 1956
Cdd:pfam19220  201 ETQLDATRARLRALEGQLA----AEQAERERAEAQLEEAVEAHRaERASLRMKLEALTARAAATEQLLAEarNQLRDRDE 276
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1957 AERVLAEKLaaiSEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQK 2036
Cdd:pfam19220  277 AIRAAERRL---KEASIERDTLERRLAGLEADLERRTQQFQEMQRARAELEERAEMLTKALAAKDAALERAEERIASLSD 353
                          250
                   ....*....|....*...
gi 1920237946 2037 GLveDTLRQRRQVEEEIL 2054
Cdd:pfam19220  354 RI--AELTKRFEVERAAL 369
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1599-1821 1.78e-04

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 47.49  E-value: 1.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1599 GELQALRARAEEAEAQ-KRQAQEEAERLRRQVQDETQRKRQAEAElalRVQAEAEAAREKQRALQALEELRlQAEEAERR 1677
Cdd:PRK09510    65 NRQQQQQKSAKRAEEQrKKKEQQQAEELQQKQAAEQERLKQLEKE---RLAAQEQKKQAEEAAKQAALKQK-QAEEAAAK 140
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1678 LRQAEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAtrraqqqaeaeraraeaerel 1757
Cdd:PRK09510   141 AAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEA--------------------- 199
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1758 erwQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELE 1821
Cdd:PRK09510   200 ---KKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVD 260
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2503-2737 1.81e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 47.99  E-value: 1.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2503 EMSAEAERLRLRVAEMSRAQARAEeDARRFRKQAEDIgERLYRTELATQEKVMLVQTLETQRQ--QSDRDAERLREAIAE 2580
Cdd:COG4913    222 DTFEAADALVEHFDDLERAHEALE-DAREQIELLEPI-RELAERYAAARERLAELEYLRAALRlwFAQRRLELLEAELEE 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2581 LEHEKDKLKQEAQLLQLKSEEMQtvrqEQLLQETQALQQSFLSEKDSLlqrERCIEQEKAKLEQLFQdevaKAQALREEQ 2660
Cdd:COG4913    300 LRAELARLEAELERLEARLDALR----EELDELEAQIRGNGGDRLEQL---EREIERLERELEERER----RRARLEALL 368
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2661 QRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQqqqekllaeenQRLRERLQHLEEERRAALAR 2737
Cdd:COG4913    369 AALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL-----------RDLRRELRELEAEIASLERR 434
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
990-1947 1.86e-04

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 47.86  E-value: 1.86e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  990 EAQEAIARLEAQHQALVAlwhQLHTEMKSLLAWQSLGRDMQLIRSWSLATFRTLKpEEQRQALRSLELHYQAFLRDSQDA 1069
Cdd:pfam01576  219 DLQEQIAELQAQIAELRA---QLAKKEEELQAALARLEEETAQKNNALKKIRELE-AQISELQEDLESERAARNKAEKQR 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1070 GGFGPE-DRLQAEREYGSCSRHYQQLLQSleQGEQEESRCQRCISElkdirlqleacETRtVHRLRLPLDKEPARECAQR 1148
Cdd:pfam01576  295 RDLGEElEALKTELEDTLDTTAAQQELRS--KREQEVTELKKALEE-----------ETR-SHEAQLQEMRQKHTQALEE 360
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1149 ITEQQKaQAEVDGLGKGVARLSAEAEkVLALPEPSPAAPTLRSELELTLGKLE-QVRSLSAIYLEKLKtislvirstQEA 1227
Cdd:pfam01576  361 LTEQLE-QAKRNKANLEKAKQALESE-NAELQAELRTLQQAKQDSEHKRKKLEgQLQELQARLSESER---------QRA 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1228 EEVLRAHEEQLkEAQAVPATLPELEATKAALKKLRAQAEAQ-QPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRER 1306
Cdd:pfam01576  430 ELAEKLSKLQS-ELESVSSLLNEAEGKNIKLSKDVSSLESQlQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEE 508
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1307 VtlllerwqavlaqtdVRQRELEqlgRQLRyyresadPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIEr 1386
Cdd:pfam01576  509 E---------------EAKRNVE---RQLS-------TLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLE- 562
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1387 hgEKVEECQRFAKQYINAIKDYELQLVTYKAQLEPVASPAKKPK----VQSGSESIIQEYVDLRTRYS----ELSTLTSQ 1458
Cdd:pfam01576  563 --EKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKkfdqMLAEEKAISARYAEERDRAEaearEKETRALS 640
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1459 YIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEreaqglqrRMQEEVARREEVAVEAQ 1538
Cdd:pfam01576  641 LARALEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVE--------EMKTQLEELEDELQATE 712
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1539 EQKRSIQEELQHLRQSSEAEIQAKArqvEAAERSRLRIEEEIRVVRLQLEATERQRGGA-------EGELQALRARAEEA 1611
Cdd:pfam01576  713 DAKLRLEVNMQALKAQFERDLQARD---EQGEEKRRQLVKQVRELEAELEDERKQRAQAvaakkklELDLKELEAQIDAA 789
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1612 EAQKRQAQEEAERLRRQVQDetqrkRQAEAELALRVQAEAEA-AREKQRALQALEELRLQAEE----AERRLRQAEAERA 1686
Cdd:pfam01576  790 NKGREEAVKQLKKLQAQMKD-----LQRELEEARASRDEILAqSKESEKKLKNLEAELLQLQEdlaaSERARRQAQQERD 864
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1687 R-QVQVALETAQRSAeaeLQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQqaeaeraraeaerelerwqlkaN 1765
Cdd:pfam01576  865 ElADEIASGASGKSA---LQDEKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQ----------------------V 919
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1766 EALRLRLQAEEVAQQKSltqaeaekqkeeaerearrrgkaeEQAVRQRELAEQELEKQRQLAEGTAQQRLAAeqELIRLR 1845
Cdd:pfam01576  920 EQLTTELAAERSTSQKS------------------------ESARQQLERQNKELKAKLQEMEGTVKSKFKS--SIAALE 973
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1846 AETEQgeqqrqlLEEELARLQREAAAATQKRRELEAELAKVRAEMEvllaSKARAEEESRSTSEKSKQRLEAEAGRFREL 1925
Cdd:pfam01576  974 AKIAQ-------LEEQLEQESRERQAANKLVRRTEKKLKEVLLQVE----DERRHADQYKDQAEKGNSRMKQLKRQLEEA 1042
                          970       980
                   ....*....|....*....|..
gi 1920237946 1926 AEEAArlRALAEEAKRQRQLAE 1947
Cdd:pfam01576 1043 EEEAS--RANAARRKLQRELDD 1062
PLEC smart00250
Plectin repeat;
4401-4434 1.96e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 41.31  E-value: 1.96e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1920237946  4401 EETGPVAGILDTETLEKVSITEAMHRNLVDNITG 4434
Cdd:smart00250    5 EAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
2283-2457 2.08e-04

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 47.11  E-value: 2.08e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEE---LFSL 2359
Cdd:PRK09510    80 QRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEakrAAAA 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2360 RVQMEELGKLKARIEAENRALVLRDK----DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAE 2435
Cdd:PRK09510   160 AKKAAAEAKKKAEAEAAKKAAAEAKKkaeaEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAE 239
                          170       180
                   ....*....|....*....|..
gi 1920237946 2436 KmlKEKMQAVQEATRLKAEAEL 2457
Cdd:PRK09510   240 K--AAAAKAAEKAAAAKAAAEV 259
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
2505-2753 2.19e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 47.07  E-value: 2.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2505 SAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHE 2584
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAE 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2585 KDKLKQEAQLLQLKSEEMQTVRQEQLLqetqaLQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQ 2664
Cdd:COG4942     99 LEAQKEELAELLRALYRLGRQPPLALL-----LSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAER 173
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2665 QqmqqEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPS 2744
Cdd:COG4942    174 A----ELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAAGFA 249

                   ....*....
gi 1920237946 2745 RAAAARALP 2753
Cdd:COG4942    250 ALKGKLPWP 258
PRK12704 PRK12704
phosphodiesterase; Provisional
1498-1677 2.22e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 47.47  E-value: 2.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1498 QRQLAEAHAQAK---AQAEREAQglqrrmqeevARREEVAVEAqeqkrsiQEELQHLRQSSEAEIQAKARQVEAAERsRL 1574
Cdd:PRK12704    30 EAKIKEAEEEAKrilEEAKKEAE----------AIKKEALLEA-------KEEIHKLRNEFEKELRERRNELQKLEK-RL 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1575 RIEEEIrvVRLQLEATERQRGGAEGELQALRARAEEAEAQKrqaqEEAERLRRQVQDETQR-----KRQAEAELALRVqa 1649
Cdd:PRK12704    92 LQKEEN--LDRKLELLEKREEELEKKEKELEQKQQELEKKE----EELEELIEEQLQELERisgltAEEAKEILLEKV-- 163
                          170       180
                   ....*....|....*....|....*....
gi 1920237946 1650 EAEAAREKQRALQALEElrlQA-EEAERR 1677
Cdd:PRK12704   164 EEEARHEAAVLIKEIEE---EAkEEADKK 189
MAP7 pfam05672
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ...
1496-1627 2.23e-04

MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.


Pssm-ID: 461709 [Multi-domain]  Cd Length: 153  Bit Score: 44.65  E-value: 2.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1496 EKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHL--RQSSEAEIQAKARQVEAAERSR 1573
Cdd:pfam05672   11 EAARILAEKRRQAREQREREEQERLEKEEEERLRKEELRRRAEEERARREEEARRLeeERRREEEERQRKAEEEAEEREQ 90
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1574 LRIEEEIRVVRLQLEATERQRGGAEGELQalraraeeaEAQKRQAQEEAERLRR 1627
Cdd:pfam05672   91 REQEEQERLQKQKEEAEAKAREEAERQRQ---------EREKIMQQEEQERLER 135
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1895-2092 2.27e-04

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 47.11  E-value: 2.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1895 ASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQrqlaEEDAVRQRAEAERVLAEKLAAISEATRL 1974
Cdd:PRK09510    72 KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ----AEEAAKQAALKQKQAEEAAAKAAAAAKA 147
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1975 KTEAEIALKEKEAENerlrrlAEDEAfQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEIL 2054
Cdd:PRK09510   148 KAEAEAKRAAAAAKK------AAAEA-KKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAA 220
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1920237946 2055 ALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAE 2092
Cdd:PRK09510   221 AEAKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAE 258
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1444-1724 2.31e-04

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 47.20  E-value: 2.31e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1444 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQrAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRM 1523
Cdd:pfam07888   77 ELESRVAELKEELRQSREKHEELEEKYKELSASSEEL-SEEKDALLAQRAAHEARIRELEEDIKTLTQRVLERETELERM 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1524 QEEVARREEVAVEAQEQKRSIQEELQHLRQ---SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGE 1600
Cdd:pfam07888  156 KERAKKAGAQRKEEEAERKQLQAKLQQTEEelrSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEAL 235
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1601 LQALRARAEEAEAQKRQAQ------EEAERLRRQVQDETQRKRQAEAELALRVQAEAEAARE-KQRALQALEELRLQAE- 1672
Cdd:pfam07888  236 LEELRSLQERLNASERKVEglgeelSSMAAQRDRTQAELHQARLQAAQLTLQLADASLALREgRARWAQERETLQQSAEa 315
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 1673 EAERRLRQAEAERARQVQVALETAQR-SAEAELQSEHASFAEKTAQLERTLKE 1724
Cdd:pfam07888  316 DKDRIEKLSAELQRLEERLQEERMEReKLEVELGREKDCNRVQLSESRRELQE 368
CHASE3 COG5278
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
2298-2744 2.37e-04

Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];


Pssm-ID: 444089 [Multi-domain]  Cd Length: 530  Bit Score: 47.21  E-value: 2.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2298 QFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAEN 2377
Cdd:COG5278     76 SFLEPYEEARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIR 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2378 RALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAEL 2457
Cdd:COG5278    156 ARLLLLALALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELL 235
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2458 LQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAE 2537
Cdd:COG5278    236 AALALALALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAA 315
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2538 DIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQAL 2617
Cdd:COG5278    316 AAAAAAAAALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAI 395
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2618 QQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQE 2697
Cdd:COG5278    396 AAAAAAAAAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALA 475
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 2698 ELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALARSEEIAPS 2744
Cdd:COG5278    476 ALAAAAAALAEAEAAAALAAAAALSLALALAALLLAAAEAALAAALA 522
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
1558-1725 2.37e-04

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.07  E-value: 2.37e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1558 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQvQDETQRKR 1637
Cdd:COG1579     11 DLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQ-LGNVRNNK 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1638 QAEAelalrVQAEAEAAREKQRAL-QALEELRLQAEEAERRLRQAEAERARQVQ--VALETAQRSAEAELQSEHASFAEK 1714
Cdd:COG1579     90 EYEA-----LQKEIESLKRRISDLeDEILELMERIEELEEELAELEAELAELEAelEEKKAELDEELAELEAELEELEAE 164
                          170
                   ....*....|.
gi 1920237946 1715 TAQLERTLKEE 1725
Cdd:COG1579    165 REELAAKIPPE 175
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
1218-1873 2.38e-04

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 47.41  E-value: 2.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1218 SLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPE-LEATKAALKKLRAQAEAQqpvfdalRDELR-GAQEVGERLQQRHGE 1295
Cdd:pfam05483  158 NLLKETCARSAEKTKKYEYEREETRQVYMDLNNnIEKMILAFEELRVQAENA-------RLEMHfKLKEDHEKIQHLEEE 230
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1296 RDVEVERWRERVTLLLerwqavlAQTDVRQRELEQLGRQLRYYRESADPLgawlrdakQRQEQIQAVPLANSQAVREQLR 1375
Cdd:pfam05483  231 YKKEINDKEKQVSLLL-------IQITEKENKMKDLTFLLEESRDKANQL--------EEKTKLQDENLKELIEKKDHLT 295
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1376 QEkalLEDIERHGEKVEECQRFAKQYINAIKDYELQLVTYK-AQLEPVASPAK---------KPKVQSGSESIIQEYVDL 1445
Cdd:pfam05483  296 KE---LEDIKMSLQRSMSTQKALEEDLQIATKTICQLTEEKeAQMEELNKAKAahsfvvtefEATTCSLEELLRTEQQRL 372
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1446 RTRYSELSTLTSQYIRFISEtLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLaEAHAQAKAQAEREAQGLQRRMQE 1525
Cdd:pfam05483  373 EKNEDQLKIITMELQKKSSE-LEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQF-EKIAEELKGKEQELIFLLQAREK 450
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1526 EVARREEVAVEAQEQKRSIQEELQHLRQSSEAEiqaKARQVEAAERSRLRIEEEIRVVR------LQLEATERQRGGAEG 1599
Cdd:pfam05483  451 EIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKE---KLKNIELTAHCDKLLLENKELTQeasdmtLELKKHQEDIINCKK 527
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1600 ELQALRARAEEAEAQKRQAQEEAERLRR---QVQDETQRKRQAEAELALRVQAEAEAAREKQRALQ-ALEELRLQAEEAE 1675
Cdd:pfam05483  528 QEERMLKQIENLEEKEMNLRDELESVREefiQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILEnKCNNLKKQIENKN 607
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1676 RRLRQAEAE-RARQVQVALETAQRSA--------EAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRaqqqaea 1746
Cdd:pfam05483  608 KNIEELHQEnKALKKKGSAENKQLNAyeikvnklELELASAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEKA------- 680
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1747 eraraeaerelerwQLKANEALRLRLQAEEVAQQKsltqaeaekqkEEAEREARRRGKAEEQAVRQRELAEQELEKQRQL 1826
Cdd:pfam05483  681 --------------KAIADEAVKLQKEIDKRCQHK-----------IAEMVALMEKHKHQYDKIIEERDSELGLYKNKEQ 735
                          650       660       670       680
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 1827 AEGTAqqRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAAT 1873
Cdd:pfam05483  736 EQSSA--KAALEIELSNIKAELLSLKKQLEIEKEEKEKLKMEAKENT 780
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
2287-2610 2.39e-04

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 46.84  E-value: 2.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2287 QAADAEMEKHKQFAEQALRQKAQVEQELtalRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEEL 2366
Cdd:pfam13868   16 LAAKCNKERDAQIAEKKRIKAEEKEEER---RLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEE 92
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2367 GKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARlSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQ 2446
Cdd:pfam13868   93 YEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDE-FNEEQAEWKELEKEEEREEDERILEYLKEKAEREEER 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2447 EATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGF---QKTLETERQRQLEMSAEAERLRLRVAEMSRA 2521
Cdd:pfam13868  172 EAEREEIEEEKEREIARLRaqQEKAQDEKAERDELRAKLYQEEQERkerQKEREEAEKKARQRQELQQAREEQIELKERR 251
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2522 QARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQtLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE 2601
Cdd:pfam13868  252 LAEEAEREEEEFERMLRKQAEDEEIEQEEAEKRRMKR-LEHRRELEKQIEEREEQRAAEREEELEEGERLREEEAERRER 330

                   ....*....
gi 1920237946 2602 MQTVRQEQL 2610
Cdd:pfam13868  331 IEEERQKKL 339
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
2306-2645 2.79e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 47.32  E-value: 2.79e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2306 QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVT-------EAARQRGQVEEELFSLRVQMEELGKLKARIEAENR 2378
Cdd:TIGR04523  118 QKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEklnnkynDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLL 197
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2379 AL-----VLRDKDSAQRLLQEEAEKMK----QVAEEAARLSVAAQEAARLRQLAEE---DLAQQRALAEKMLKEKMQAVQ 2446
Cdd:TIGR04523  198 KLelllsNLKKKIQKNKSLESQISELKkqnnQLKDNIEKKQQEINEKTTEISNTQTqlnQLKDEQNKIKKQLSEKQKELE 277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2447 EATR-----------LKAEAELLQQQKE---------LAQEQARRLQEDKEQMAQ------QLAQETQGFQKTLETERQR 2500
Cdd:TIGR04523  278 QNNKkikelekqlnqLKSEISDLNNQKEqdwnkelksELKNQEKKLEEIQNQISQnnkiisQLNEQISQLKKELTNSESE 357
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2501 QLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAE 2580
Cdd:TIGR04523  358 NSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIK 437
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2581 LEHEKDKLKQEAQLLQLKSEEMQTVRqEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQL 2645
Cdd:TIGR04523  438 NNSEIKDLTNQDSVKELIIKNLDNTR-ESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKL 501
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1807-2088 2.81e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 46.82  E-value: 2.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1886
Cdd:COG4372     69 EQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAER 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1887 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1966
Cdd:COG4372    149 EEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLE 228
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1967 AISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQR 2046
Cdd:COG4372    229 AKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALS 308
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 2047 RQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSK 2088
Cdd:COG4372    309 LIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGL 350
Filament pfam00038
Intermediate filament protein;
1460-1724 3.09e-04

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 46.45  E-value: 3.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1460 IRFISETLRRMEEEERLAEQQRAEERERLAEV-EAALEKQRQLAEAHAQAKAQAEREaqglqrrmqeevarREEVAVEAQ 1538
Cdd:pfam00038   20 VRFLEQQNKLLETKISELRQKKGAEPSRLYSLyEKEIEDLRRQLDTLTVERARLQLE--------------LDNLRLAAE 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1539 EQKRSIQEELQhLRQSSEAEIQAKARQVEAAERSRLRIEEEIrvvrlqleaterqrggaegelQALRaraEEAEAQKRQA 1618
Cdd:pfam00038   86 DFRQKYEDELN-LRTSAENDLVGLRKDLDEATLARVDLEAKI---------------------ESLK---EELAFLKKNH 140
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1619 QEEAERLRRQVQDETQ-------RKRQAEAELA-LRVQAEAEAAREKQRA----LQALEELRLQAEEAERRLRQAEAERA 1686
Cdd:pfam00038  141 EEEVRELQAQVSDTQVnvemdaaRKLDLTSALAeIRAQYEEIAAKNREEAeewyQSKLEELQQAAARNGDALRSAKEEIT 220
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1687 ---RQVQ-------------VALETAQRSAEAELQSEHASFAEKTAQLERTLKE 1724
Cdd:pfam00038  221 elrRTIQsleielqslkkqkASLERQLAETEERYELQLADYQELISELEAELQE 274
tolA_full TIGR02794
TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the ...
2383-2553 3.10e-04

TolA protein; TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cutoffs are based largely conserved operon struction. //The Tol-Pal complex is required for maintaining outer membrane integrity. Also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins ompC, phoE and lamB. [Transport and binding proteins, Other, Cellular processes, Pathogenesis]


Pssm-ID: 274303 [Multi-domain]  Cd Length: 346  Bit Score: 46.38  E-value: 3.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2383 RDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATR-LKAEAELLQQQ 2461
Cdd:TIGR02794   65 KEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAkAKAEAEAERKA 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2462 KELAQEQARrlqEDKEQMAQQLAQetqgfQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGE 2541
Cdd:TIGR02794  145 KEEAAKQAE---EEAKAKAAAEAK-----KKAEEAKKKAEAEAKAKAEAEAKAKAEEAKAKAEAAKAKAAAEAAAKAEAE 216
                          170
                   ....*....|..
gi 1920237946 2542 RLYRTELATQEK 2553
Cdd:TIGR02794  217 AAAAAAAEAERK 228
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
1804-2016 3.11e-04

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 47.52  E-value: 3.11e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRELAEQ-----ELEKQRQLAEGTAQQrlaAEQELIRLRAETEQGEQQRQLLEEElarlqreaaaatqkRRE 1878
Cdd:NF012221  1555 DAAQNALADKERAEAdrqrlEQEKQQQLAAISGSQ---SQLESTDQNALETNGQAQRDAILEE--------------SRA 1617
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1879 LEAELAKVRAEMEVLLAS-------------------KARAEEESRSTSEKSKQRLEAEAGRF----RELAEEAARLRAL 1935
Cdd:NF012221  1618 VTKELTTLAQGLDALDSQatyagesgdqwrnpfagglLDRVQEQLDDAKKISGKQLADAKQRHvdnqQKVKDAVAKSEAG 1697
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1936 AEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRlLEEQAAQHKA 2015
Cdd:NF012221  1698 VAQGEQNQANAEQDIDDAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRGEQDASAAENKANQAQ-ADAKGAKQDE 1776

                   .
gi 1920237946 2016 D 2016
Cdd:NF012221  1777 S 1777
EmrA COG1566
Multidrug resistance efflux pump EmrA [Defense mechanisms];
1536-1671 3.12e-04

Multidrug resistance efflux pump EmrA [Defense mechanisms];


Pssm-ID: 441174 [Multi-domain]  Cd Length: 331  Bit Score: 46.19  E-value: 3.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1536 EAQEQKRSIQEELQHLRQSS--EAEIQAKARQVEAAERSRLRIEEEI-RVVRLQleateRQRGGAEGELQALRARAEEAE 1612
Cdd:COG1566     87 QAEAQLAAAEAQLARLEAELgaEAEIAAAEAQLAAAQAQLDLAQRELeRYQALY-----KKGAVSQQELDEARAALDAAQ 161
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 1613 AQKRQAQEEAERLRRQVQDETQrKRQAEAELAlrvQAEAEAAREKQRalqaLEELRLQA 1671
Cdd:COG1566    162 AQLEAAQAQLAQAQAGLREEEE-LAAAQAQVA---QAEAALAQAELN----LARTTIRA 212
PRK11281 PRK11281
mechanosensitive channel MscK;
2290-2486 3.17e-04

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 47.21  E-value: 3.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2290 DAEMEKHKQFAEQALR---QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQR------GQVEEELFSLR 2360
Cdd:PRK11281    55 EAEDKLVQQDLEQTLAlldKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETlstlslRQLESRLAQTL 134
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2361 VQMEELGklKARIEAENRALVLRDK--------DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAeedLAQQRA 2432
Cdd:PRK11281   135 DQLQNAQ--NDLAEYNSQLVSLQTQperaqaalYANSQRLQQIRNLLKGGKVGGKALRPSQRVLLQAEQAL---LNAQND 209
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2433 LAEKMLK--EKMQAVQEATR--LKAEAELLQQQKELAQE--QARRLQEDKEQMAQQLAQE 2486
Cdd:PRK11281   210 LQRKSLEgnTQLQDLLQKQRdyLTARIQRLEHQLQLLQEaiNSKRLTLSEKTVQEAQSQD 269
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
2128-2617 3.43e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 47.07  E-value: 3.43e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2128 RQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRS 2207
Cdd:COG4717     64 RKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEALEAELAE 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2208 EAeaarraaeeaeaareraereaaqsrRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAArraqaeqaalRQKQ 2287
Cdd:COG4717    144 LP-------------------------ERLEELEERLEELRELEEELEELEAELAELQEELEELLE----------QLSL 188
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2288 AADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKA-EVTEAARQRGQVEEELFSLRVQMEEL 2366
Cdd:COG4717    189 ATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALeERLKEARLLLLIAAALLALLGLGGSL 268
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2367 GKLKARI------EAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE 2440
Cdd:COG4717    269 LSLILTIagvlflVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEE 348
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2441 KMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKtleTERQRQLEMSAEAERLRLRVAEMSR 2520
Cdd:COG4717    349 LQELLREAEELEEELQLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQEL---KEELEELEEQLEELLGELEELLEAL 425
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2521 AQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAER--LREAIAELEHEKDKLKQEAQLLQLK 2598
Cdd:COG4717    426 DEEELEEELEELEEELEELEEELEELREELAELEAELEQLEEDGELAELLQELeeLKAELRELAEEWAALKLALELLEEA 505
                          490
                   ....*....|....*....
gi 1920237946 2599 SEEMQTVRQEQLLQETQAL 2617
Cdd:COG4717    506 REEYREERLPPVLERASEY 524
PTZ00491 PTZ00491
major vault protein; Provisional
1804-1975 3.50e-04

major vault protein; Provisional


Pssm-ID: 240439 [Multi-domain]  Cd Length: 850  Bit Score: 46.93  E-value: 3.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQR-ELAEQElekqrqlaegtAQQRLaaEQELIRLRAETEqgEQQRQLLeeelarlqreaaaatqkrrELEAE 1882
Cdd:PTZ00491   662 KSQEAAARHQaELLEQE-----------ARGRL--ERQKMHDKAKAE--EQRTKLL-------------------ELQAE 707
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1883 LAKVRAemevllASKARAEEESRSTSEKSKQRLEAEAGRFRelaEEAARLRALAE-EAKRQRQLAEEDAVRQRAEAERVL 1961
Cdd:PTZ00491   708 SAAVES------SGQSRAEALAEAEARLIEAEAEVEQAELR---AKALRIEAEAElEKLRKRQELELEYEQAQNELEIAK 778
                          170
                   ....*....|....
gi 1920237946 1962 AEKLAAIsEATRLK 1975
Cdd:PTZ00491   779 AKELADI-EATKFE 791
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
1803-1984 3.53e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 46.77  E-value: 3.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1803 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQR-LAAEQELIRLRAEteqgeqQRQLLEEELARLQREAAAATQKRRELEA 1881
Cdd:COG2433    375 GLSIEEALEELIEKELPEEEPEAEREKEHEEReLTEEEEEIRRLEE------QVERLEAEVEELEAELEEKDERIERLER 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1882 ELAKVRAEMEvllaSKARAEEESRstsekskqRLEAEAGRF-RELAEEAARLRALAEEAKRQRQLAEEDavrqrAEAERV 1960
Cdd:COG2433    449 ELSEARSEER----REIRKDREIS--------RLDREIERLeRELEEERERIEELKRKLERLKELWKLE-----HSGELV 511
                          170       180
                   ....*....|....*....|....
gi 1920237946 1961 LAEKLAAISEATRLKTEAEIALKE 1984
Cdd:COG2433    512 PVKVVEKFTKEAIRRLEEEYGLKE 535
PLEC smart00250
Plectin repeat;
3143-3180 3.58e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.54  E-value: 3.58e-04
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  3143 RRALRGSGVIAGVWLEEAGQKLSIYEALRKDLLQPEAA 3180
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1529-1889 3.65e-04

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 46.43  E-value: 3.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1529 RREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARA 1608
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEEL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1609 EEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERA-- 1686
Cdd:COG4372     83 EELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLEsl 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1687 -RQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKAN 1765
Cdd:COG4372    163 qEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDAL 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1766 EALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLR 1845
Cdd:COG4372    243 ELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALL 322
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 1920237946 1846 AETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAE 1889
Cdd:COG4372    323 ELAKKLELALAILLAELADLLQLLLVGLLDNDVLELLSKGAEAG 366
PLEC smart00250
Plectin repeat;
3551-3587 3.91e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.54  E-value: 3.91e-04
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3551 KLLSAEKAVTGYRDPYSGSTISLFQAMKKGLVLREHG 3587
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
2283-2740 3.98e-04

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 46.66  E-value: 3.98e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQV----EQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFS 2358
Cdd:pfam05557   78 NRLKKKYLEALNKKLNEKESQLADAREVisclKNELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQN 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2359 LRVQMEELGKLKARIEAENRALVLRDKDSaqrllqEEAEKMKqvaEEAARLSVAAQEAARLRqlaeEDLAQQRALAEKML 2438
Cdd:pfam05557  158 LEKQQSSLAEAEQRIKELEFEIQSQEQDS------EIVKNSK---SELARIPELEKELERLR----EHNKHLNENIENKL 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2439 KEKMQAVQEATRLkaeaellqQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKT-------------LETERQRQLEMS 2505
Cdd:pfam05557  225 LLKEEVEDLKRKL--------EREEKYREEAATLELEKEKLEQELQSWVKLAQDTglnlrspedlsrrIEQLQQREIVLK 296
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2506 AEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTE-----------LATQEKVMLVQTLE------TQRQQSD 2568
Cdd:pfam05557  297 EENSSLTSSARQLEKARRELEQELAQYLKKIEDLNKKLKRHKalvrrlqrrvlLLTKERDGYRAILEsydkelTMSNYSP 376
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2569 RDAERLREA----------IAELEHEKDKLKQEA----QLLQLKSEEMQTVRQeqllQETQALQQSFLSEKDSLLQRERC 2634
Cdd:pfam05557  377 QLLERIEEAedmtqkmqahNEEMEAQLSVAEEELggykQQAQTLERELQALRQ----QESLADPSYSKEEVDSLRRKLET 452
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2635 IEQEKAKLEQlfQDEVAKAQALREEQQRQQQQMQQEKQQLaasmeearrRQHEAEEGVRRQQEELQRLAqqqqqqeklla 2714
Cdd:pfam05557  453 LELERQRLRE--QKNELEMELERRCLQGDYDPKKTKVLHL---------SMNPAAEAYQQRKNQLEKLQ----------- 510
                          490       500
                   ....*....|....*....|....*.
gi 1920237946 2715 EENQRLRERLQHLEEERRAALARSEE 2740
Cdd:pfam05557  511 AEIERLKRLLKKLEDDLEQVLRLPET 536
CEP63 pfam17045
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole ...
1803-1925 4.00e-04

Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole activity.


Pssm-ID: 465338 [Multi-domain]  Cd Length: 264  Bit Score: 45.58  E-value: 4.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1803 GKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQ-ELIRLRAETEQGEQQRQLLEEELARLQR-----EAAAATQKR 1876
Cdd:pfam17045  127 GKLEEFRQKSLEWEQQRLQYQQQVASLEAQRKALAEQsSLIQSAAYQVQLEGRKQCLEASQSEIQRlrsklERAQDSLCA 206
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1877 RELEAELAKVRAE-----MEVLLASKARAEEESRStSEKSKQRLEAEAGRFREL 1925
Cdd:pfam17045  207 QELELERLRMRVSelgdsNRKLLEEQQRLLEELRM-SQRQLQVLQNELMELKAT 259
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
1794-1957 4.24e-04

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 46.40  E-value: 4.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1794 EAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEgtaQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREaaaat 1873
Cdd:COG2268    229 EQEREIETARIAEAEAELAKKKAEERREAETARAE---AEAAYEIAEANAEREVQRQLEIAEREREIELQEKEAE----- 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1874 QKRRELEAELaKVRAEmevllASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEedAVRQ 1953
Cdd:COG2268    301 REEAELEADV-RKPAE-----AEKQAAEAEAEAEAEAIRAKGLAEAEGKRALAEAWNKLGDAAILLMLIEKLPE--IAEA 372

                   ....
gi 1920237946 1954 RAEA 1957
Cdd:COG2268    373 AAKP 376
PRK11637 PRK11637
AmiB activator; Provisional
1815-2100 4.36e-04

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 46.22  E-value: 4.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1815 LAEQELEKQRQLAegTAQQRLAAEQELIRlraeteQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLL 1894
Cdd:PRK11637    38 FSAHASDNRDQLK--SIQQDIAAKEKSVR------QQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELN 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1895 ASKARAEEESRSTSEKSKQRLEAEagrFRELAEEAARLRALAEEAKRqrqlaeedavrqraeAERVLAeKLAAISEAtRL 1974
Cdd:PRK11637   110 ASIAKLEQQQAAQERLLAAQLDAA---FRQGEHTGLQLILSGEESQR---------------GERILA-YFGYLNQA-RQ 169
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1975 KTEAEialkekeaenerLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVedtlrqrrqveeeil 2054
Cdd:PRK11637   170 ETIAE------------LKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLT--------------- 222
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 2055 ALKGSFEKAAAGKAELELELGRIRGT-AEDTLRSKEQAEQEA-------ARQRQ 2100
Cdd:PRK11637   223 GLESSLQKDQQQLSELRANESRLRDSiARAEREAKARAEREAreaarvrDKQKQ 276
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
2570-2700 4.50e-04

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 46.74  E-value: 4.50e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2570 DAERLREAIAELEHEKDKLKQEAQLLqlkseemqtvrqEQLLQETQALQQSFLSEKDSLLQRErciEQEKAKLEQLFQDE 2649
Cdd:PRK00409   514 DKEKLNELIASLEELERELEQKAEEA------------EALLKEAEKLKEELEEKKEKLQEEE---DKLLEEAEKEAQQA 578
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2650 V--AKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGV-------RRQQEELQ 2700
Cdd:PRK00409   579 IkeAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKekkkkkqKEKQEELK 638
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
2335-2486 4.54e-04

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 45.95  E-value: 4.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2335 LQRLKAEVTEAARQR----GQVEEELfsLRVQMEELGKLKaRIEAEnRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLS 2410
Cdd:PRK09510    67 QQQQQKSAKRAEEQRkkkeQQQAEEL--QQKQAAEQERLK-QLEKE-RLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAA 142
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2411 VAAQEAARLRQLAEEDLAQQrALAEKMLKEKMQAVQEA---TRLKAEAELLQQQKELAQEQArrlQEDKEQMAQQLAQE 2486
Cdd:PRK09510   143 AAAKAKAEAEAKRAAAAAKK-AAAEAKKKAEAEAAKKAaaeAKKKAEAEAAAKAAAEAKKKA---EAEAKKKAAAEAKK 217
PRK05035 PRK05035
electron transport complex protein RnfC; Provisional
1804-2033 4.63e-04

electron transport complex protein RnfC; Provisional


Pssm-ID: 235334 [Multi-domain]  Cd Length: 695  Bit Score: 46.48  E-value: 4.63e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEE--------QAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEEL--------ARLQR 1867
Cdd:PRK05035   447 KAEEakarfearQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARpdnsaviaAREAR 526
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1868 EAAAATQKRRELEAE-----LAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAgrfrelaeeAARLRALAeeAKRQ 1942
Cdd:PRK05035   527 KAQARARQAEKQAAAaadpkKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVA---------AAIARAKA--KKAA 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1943 RQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAedeafqrrlleeqAAQHKAdiEARLA 2022
Cdd:PRK05035   596 QQAASAEPEEQVAEVDPKKAAVAAAIARAKAKKAEQQANAEPEEPVDPRKAAVA-------------AAIARA--KARKA 660
                          250
                   ....*....|.
gi 1920237946 2023 QLRKASESELE 2033
Cdd:PRK05035   661 AQQQANAEPEE 671
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1554-1688 5.01e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 46.44  E-value: 5.01e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 SSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlRRQVQDET 1633
Cdd:PRK12678    69 TPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARR-GAARKAGE 147
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 1634 QRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQ 1688
Cdd:PRK12678   148 GGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRREERGRDGDD 202
PLEC smart00250
Plectin repeat;
3512-3547 5.04e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.16  E-value: 5.04e-04
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1920237946  3512 TLLLEAQAATGFLVDPVRNQRLYVHEAVKAGVVGPE 3547
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
COG3903 COG3903
Predicted ATPase [General function prediction only];
1728-2183 5.13e-04

Predicted ATPase [General function prediction only];


Pssm-ID: 443109 [Multi-domain]  Cd Length: 933  Bit Score: 46.55  E-value: 5.13e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1728 AVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEE 1807
Cdd:COG3903    477 AAERLAEAGERAAARRRHADYYLALAERAAAELRGPDQLAWLARLDAEHDNLRAALRWALAHGDAELALRLAAALAPFWF 556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1808 QAVRQRElAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVR 1887
Cdd:COG3903    557 LRGLLRE-GRRWLERALAAAGEAAAALAAAAALAAAAAAARAAAAAAAAAAAAAAAAAAAAAAAAAALLLLAALAAAAAA 635
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1888 AEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA 1967
Cdd:COG3903    636 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAAAAAAAAALAAAAAALAAAAAAAALAAAAAAALAAAA 715
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1968 ISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRR 2047
Cdd:COG3903    716 AAAAAAAAAAALLAAAAAAALAAAAAAAALALAAAAAAAAAAAAAAALAAAAAAAALAALLLALAAAAAALAAAAAAAAA 795
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2048 QVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAA 2127
Cdd:COG3903    796 AAAAAAAAAAAAAAAAAAAALAAAAAAAAAAAAALAAALAAAAAAAAAAAAAAAAAAALAAALAAAAAAAAAAALAAAAA 875
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 2128 RQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFA 2183
Cdd:COG3903    876 AAAAAAAALLAAAAAAAAAAAAAAAAAAALAAAAAAAAAAALAAAAAAAAAAAAAA 931
rne PRK10811
ribonuclease E; Reviewed
1471-1736 5.32e-04

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 46.57  E-value: 5.32e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERER----------LAEVEAALEKQR---QLAEAHAQAKAQAEREAQGLQRR----MQEEVARREEV 1533
Cdd:PRK10811   507 EEAMALPSEEEFAERKRpeqpalatfaMPDVPPAPTPAEpaaPVVAAAPKAAAATPPAQPGLLSRffgaLKALFSGGEET 586
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1534 AVEAQEQKRSIQEElqhlRQSSEaeiQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEA 1613
Cdd:PRK10811   587 KPQEQPAPKAEAKP----ERQQD---RRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRNRRQAQQQTAETRESQQA 659
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1614 QKRQAQEEAERLRRQVQDETQRKRQAEAELAlrvQAEAEAarekqralQALEELRLQAEEAERRLRQAEAERAR------ 1687
Cdd:PRK10811   660 EVTEKARTQDEQQQAPRRERQRRRNDEKRQA---QQEAKA--------LNVEEQSVQETEQEERVQQVQPRRKQrqlnqk 728
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1688 ---QVQVALETAQRSAEAELQSEHASfAEKTAQLERTLKEEHVAVVQLREEA 1736
Cdd:PRK10811   729 vriEQSVAEEAVAPVVEETVAAEPVV-QEVPAPRTELVKVPLPVVAQTAPEQ 779
PLEC smart00250
Plectin repeat;
3183-3216 5.35e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 40.16  E-value: 5.35e-04
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1920237946  3183 LLEAQAGTGHIIDPTTSARLTVDEAVRAGLVGPE 3216
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1485-1736 5.51e-04

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 45.68  E-value: 5.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1485 RERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKAR 1564
Cdd:pfam13868   12 NSKLLAAKCNKERDAQIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQE 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1565 QVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELA 1644
Cdd:pfam13868   92 EYEEKLQEREQMDEIVERIQEEDQAEAEEKLEKQRQLREEIDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEER 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1645 LRVQAEAEAAREKQRAlqaleELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELqsehasfAEKTAQLERTLKE 1724
Cdd:pfam13868  172 EAEREEIEEEKEREIA-----RLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREE-------AEKKARQRQELQQ 239
                          250
                   ....*....|..
gi 1920237946 1725 EHVAVVQLREEA 1736
Cdd:pfam13868  240 AREEQIELKERR 251
MARTX_Nterm NF012221
MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model ...
2410-2688 6.35e-04

MARTX multifunctional-autoprocessing repeats-in-toxin holotoxin N-terminal region; This model describes the N-terminal 1900 amino acids of MARTX family multifunctional-autoprocessing repeats-in-toxin holotoxins, which contain both repeat regions that facilitate their entry into eukaryotic target cells, and multiple effector domains.


Pssm-ID: 467957 [Multi-domain]  Cd Length: 1848  Bit Score: 46.37  E-value: 6.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2410 SVAAQEAARLRQLAEEDLAQQRALAEKmlkekmqavqeatrlkaeaellqqqkELAQEQARRLQEDKeqmAQQLAqETQG 2489
Cdd:NF012221  1538 SESSQQADAVSKHAKQDDAAQNALADK--------------------------ERAEADRQRLEQEK---QQQLA-AISG 1587
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2490 FQKTLETERQRQLEMSAEAERlrlrvaemsraqARAEEDARRFRKQAEDIGERLyrtelatqekvmlvQTLETQRQQSDR 2569
Cdd:NF012221  1588 SQSQLESTDQNALETNGQAQR------------DAILEESRAVTKELTTLAQGL--------------DALDSQATYAGE 1641
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2570 DAERLREAIAE--LEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSllqrerciEQEKAKLEQLFQ 2647
Cdd:NF012221  1642 SGDQWRNPFAGglLDRVQEQLDDAKKISGKQLADAKQRHVDNQQKVKDAVAKSEAGVAQG--------EQNQANAEQDID 1713
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 2648 DEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRR-QHEA 2688
Cdd:NF012221  1714 DAKADAEKRKDDALAKQNEAQQAESDANAAANDAQSRgEQDA 1755
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
2286-2574 6.61e-04

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 46.10  E-value: 6.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2286 KQAADAEMEKHKQFaEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAaRQRGQVEEELfslRVQMEE 2365
Cdd:COG3096    402 QQALDVQQTRAIQY-QQAVQALEKARALCGLPDLTPENAEDYLAAFRAKEQQATEEVLEL-EQKLSVADAA---RRQFEK 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2366 LGKLKARIEAEnralVLRDK--DSAQRLLQEEAEKMKQvaeeAARLSVAAQEAARLRQLAEEdlaQQRA--LAEKMLKEK 2441
Cdd:COG3096    477 AYELVCKIAGE----VERSQawQTARELLRRYRSQQAL----AQRLQQLRAQLAELEQRLRQ---QQNAerLLEEFCQRI 545
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2442 MQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLaQETQGFQKTLETERQRQLEMSAEAERLRlrvaEMSRA 2521
Cdd:COG3096    546 GQQLDAAEELEELLAELEAQLEELEEQAAEAVEQRSELRQQL-EQLRARIKELAARAPAWLAAQDALERLR----EQSGE 620
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2522 QARAEEDARRFRKQAedigerLYRTELATQEKvmlvQTLETQRQQSDRDAERL 2574
Cdd:COG3096    621 ALADSQEVTAAMQQL------LEREREATVER----DELAARKQALESQIERL 663
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
2497-2742 6.62e-04

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 46.20  E-value: 6.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2497 ERQRQLEMSAEA-ERLRLRVAEMSRAQARAEEDARRFRKQAEdigerlYRTELATQEKVMLVQTLETQRQQsdrdAERLR 2575
Cdd:TIGR02168  176 ETERKLERTRENlDRLEDILNELERQLKSLERQAEKAERYKE------LKAELRELELALLVLRLEELREE----LEELQ 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2576 EAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEqlLQETQALQQSFLSEKDSLLQR-ERCIEQEKAKLEQLFQDEVAKAQ 2654
Cdd:TIGR02168  246 EELKEAEEELEELTAELQELEEKLEELRLEVSE--LEEEIEELQKELYALANEISRlEQQKQILRERLANLERQLEELEA 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2655 ALREeqqrqqqqmqqekqqLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAA 2734
Cdd:TIGR02168  324 QLEE---------------LESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKV 388

                   ....*...
gi 1920237946 2735 LARSEEIA 2742
Cdd:TIGR02168  389 AQLELQIA 396
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1460-1703 6.76e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 45.59  E-value: 6.76e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1460 IRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQE 1539
Cdd:COG3883     18 IQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALYR 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1540 QKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALR-------AraeEAE 1612
Cdd:COG3883     98 SGGSVSYLDVLLGSESFSDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKaeleaakA---ELE 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1613 AQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVA 1692
Cdd:COG3883    175 AQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGA 254
                          250
                   ....*....|.
gi 1920237946 1693 LETAQRSAEAE 1703
Cdd:COG3883    255 AGAAAGSAGAA 265
COG4995 COG4995
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
2146-2597 6.91e-04

Uncharacterized conserved protein, contains CHAT domain [Function unknown];


Pssm-ID: 444019 [Multi-domain]  Cd Length: 711  Bit Score: 46.12  E-value: 6.91e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2146 EARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARER 2225
Cdd:COG4995      3 ALALLALLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLALAL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2226 AEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALR 2305
Cdd:COG4995     83 AALALALLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAAAA 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2306 QKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDK 2385
Cdd:COG4995    163 AALLALALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLALLA 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2386 DSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELA 2465
Cdd:COG4995    243 LAAAAAALAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLLLAA 322
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2466 QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYR 2545
Cdd:COG4995    323 LLLLLAALALLALLLLLAAAALLAAALAAALALAAALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAALLA 402
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2546 TELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQL 2597
Cdd:COG4995    403 LAAAQLLRLLLAALALLLALAAYAAARLALLALIEYIILPDRLYAFVQLYQL 454
AtpF COG0711
FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ...
1601-1725 7.29e-04

FoF1-type ATP synthase, membrane subunit b or b' [Energy production and conversion]; FoF1-type ATP synthase, membrane subunit b or b' is part of the Pathway/BioSystem: FoF1-type ATP synthase


Pssm-ID: 440475 [Multi-domain]  Cd Length: 152  Bit Score: 43.24  E-value: 7.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1601 LQALRARAEEAE---AQKRQAQEEAERLRRQVQDEtQRKRQAEAElALRVQAEAEAAREKQRALQALEelrlqaEEAERR 1677
Cdd:COG0711     26 LKALDERQEKIAdglAEAERAKEEAEAALAEYEEK-LAEARAEAA-EIIAEARKEAEAIAEEAKAEAE------AEAERI 97
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 1678 LRQAEAErarqvqvaLETAQRSAEAELQSEHASFAEKTAqlERTLKEE 1725
Cdd:COG0711     98 IAQAEAE--------IEQERAKALAELRAEVADLAVAIA--EKILGKE 135
SPEC smart00150
Spectrin repeats;
745-837 7.39e-04

Spectrin repeats;


Pssm-ID: 197544 [Multi-domain]  Cd Length: 101  Bit Score: 41.93  E-value: 7.39e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   745 HGFVAAATKELMWLSDREEEEVGFDWSDRNTNMAAKKEGYSALMHELELKEKKIKEIQSTGDRLLREDHPARPTAESFQA 824
Cdd:smart00150    1 QQFLRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLIEEGHPDAEEIEERLE 80
                            90
                    ....*....|...
gi 1920237946   825 ALQTQWSWMLQLC 837
Cdd:smart00150   81 ELNERWEELKELA 93
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
1469-1585 7.73e-04

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 45.97  E-value: 7.73e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1469 RMEEEERLAEQQRAEERERLAEVEA---ALEKQ-RQLAEAHAQAKAQAEREAQGLQRRMQEEVAR-----REEVAVEAQE 1539
Cdd:PRK00409   524 SLEELERELEQKAEEAEALLKEAEKlkeELEEKkEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEiikelRQLQKGGYAS 603
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 1540 QKRS-IQEELQHLRQSSEAEIQAKARQVEAAErsRLRIEEEIRVVRL 1585
Cdd:PRK00409   604 VKAHeLIEARKRLNKANEKKEKKKKKQKEKQE--ELKVGDEVKYLSL 648
GBP_C cd16269
Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal ...
1478-1605 7.82e-04

Guanylate-binding protein, C-terminal domain; Guanylate-binding protein (GBP), C-terminal domain. Guanylate-binding proteins (GBPs) are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence, and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. This C-terminal domain has been shown to mediate inhibition of endothelial cell proliferation by inflammatory cytokines.


Pssm-ID: 293879 [Multi-domain]  Cd Length: 291  Bit Score: 44.88  E-value: 7.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1478 EQQRAEERERLAEVEAALEKQRQLAEAHAQAKAqAEREaqglqRRMQEEVARREEVavEAQEQKRSIQEELQHLRQSSEA 1557
Cdd:cd16269    177 QSKEAEAEAILQADQALTEKEKEIEAERAKAEA-AEQE-----RKLLEEQQRELEQ--KLEDQERSYEEHLRQLKEKMEE 248
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*...
gi 1920237946 1558 EIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRggaegELQALR 1605
Cdd:cd16269    249 ERENLLKEQERALESKLKEQEALLEEGFKEQAELLQE-----EIRSLK 291
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1468-1656 8.54e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 45.67  E-value: 8.54e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQlAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEE 1547
Cdd:PRK12678    78 RRAARAAAAARQAEQPAAEAAAAKAEAAPAARA-AAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEA 156
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1548 LQHLRQSSEAEIQAKARQVEAAERSRLRieeeiRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRR 1627
Cdd:PRK12678   157 RADAAERTEEEERDERRRRGDREDRQAE-----AERGERGRREERGRDGDDRDRRDRREQGDRREERGRRDGGDRRGRRR 231
                          170       180
                   ....*....|....*....|....*....
gi 1920237946 1628 QVQDETQRKRQAEAELALRVQAEAEAARE 1656
Cdd:PRK12678   232 RRDRRDARGDDNREDRGDRDGDDGEGRGG 260
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1804-2185 8.80e-04

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 45.42  E-value: 8.80e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAEL 1883
Cdd:COG3064     20 QAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAEAAKKLAEAEKAAAEAEKKAAAEKAK 99
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1884 AKVRAEMEVLLASKARAEEESRSTSEKSK--QRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVL 1961
Cdd:COG3064    100 AAKEAEAAAAAEKAAAAAEKEKAEEAKRKaeEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAA 179
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1962 AEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVED 2041
Cdd:COG3064    180 AALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADLAAVGV 259
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2042 TLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLA 2121
Cdd:COG3064    260 LGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEA 339
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 2122 AEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQ 2185
Cdd:COG3064    340 ALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGL 403
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1473-1718 9.16e-04

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 45.27  E-value: 9.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1473 EERLAEQQRaEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQ----------EEVARREEVAVEAQEQKR 1542
Cdd:pfam07888   33 QNRLEECLQ-ERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAelkeelrqsrEKHEELEEKYKELSASSE 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1543 SIQEE---LQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQ 1619
Cdd:pfam07888  112 ELSEEkdaLLAQRAAHEARIRELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLS 191
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1620 EEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVAleTAQRS 1699
Cdd:pfam07888  192 KEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEALLEELRSLQERLNASERKVEGLGEELSSMA--AQRDR 269
                          250
                   ....*....|....*....
gi 1920237946 1700 AEAELQSEHASFAEKTAQL 1718
Cdd:pfam07888  270 TQAELHQARLQAAQLTLQL 288
FAM184 pfam15665
Family with sequence similarity 184, A and B; The function of FAM184 is not known.
1545-1735 9.20e-04

Family with sequence similarity 184, A and B; The function of FAM184 is not known.


Pssm-ID: 464788 [Multi-domain]  Cd Length: 211  Bit Score: 43.88  E-value: 9.20e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1545 QEELQHLRQSSEAEIQakarqveaaersrlRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAER 1624
Cdd:pfam15665   13 EAEIQALKEAHEEEIQ--------------QILAETREKILQYKSKIGEELDLKRRIQTLEESLEQHERMKRQALTEFEQ 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1625 LRRQVQDetqRKRQAEAELALRVQAEA----EAAREKQRALQALEELRLQAE-EAERRLRQAEAERARQVQVALETaqrs 1699
Cdd:pfam15665   79 YKRRVEE---RELKAEAEHRQRVVELSreveEAKRAFEEKLESFEQLQAQFEqEKRKALEELRAKHRQEIQELLTT---- 151
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1920237946 1700 aeaeLQSEHASFAEKTAQLERTLKEEhvaVVQLREE 1735
Cdd:pfam15665  152 ----QRAQSASSLAEQEKLEELHKAE---LESLRKE 180
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
1803-2185 9.24e-04

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 45.39  E-value: 9.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1803 GKAEEQAVRQRELAEQELEK-----QRQLAEGTAQQRLAAEQELIRLRAE--TEQGEQQRQLLEEELARLQREAAAATQK 1875
Cdd:NF033838    50 SSGNESQKEHAKEVESHLEKilseiQKSLDKRKHTQNVALNKKLSDIKTEylYELNVLKEKSEAELTSKTKKELDAAFEQ 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1876 RRELEAELAKVRAEMEVLLA-----SKARAEEESRSTSEKSKQRLEAEAGRFrELAEEAARLRALAEEAKRQRqlaEEDA 1950
Cdd:NF033838   130 FKKDTLEPGKKVAEATKKVEeaekkAKDQKEEDRRNYPTNTYKTLELEIAES-DVEVKKAELELVKEEAKEPR---DEEK 205
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1951 VRQrAEAErVLAEKlaaiSEATRLKteaEIALKEKEAENERLRRLaedEAFQRRLLEEQAAQHKADIEARLAqlRKASES 2030
Cdd:NF033838   206 IKQ-AKAK-VESKK----AEATRLE---KIKTDREKAEEEAKRRA---DAKLKEAVEKNVATSEQDKPKRRA--KRGVLG 271
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2031 ELERQKGLVEDTLRQRRQVEEEIL---ALKGSFEKAAAGKAELELElGRIRGTAEDTLR-----SKEQAEQEAARqrqla 2102
Cdd:NF033838   272 EPATPDKKENDAKSSDSSVGEETLpspSLKPEKKVAEAEKKVEEAK-KKAKDQKEEDRRnyptnTYKTLELEIAE----- 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2103 aeEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKA---KVEEARRLRERAEQESARQLQLAQEAAQKrlQAEEKA 2179
Cdd:NF033838   346 --SDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAeatRLEKIKTDRKKAEEEAKRKAAEEDKVKEK--PAEQPQ 421

                   ....*.
gi 1920237946 2180 HAFAVQ 2185
Cdd:NF033838   422 PAPAPQ 427
PLEC smart00250
Plectin repeat;
4511-4548 9.46e-04

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 39.39  E-value: 9.46e-04
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1920237946  4511 QRFLEVQYLTGGLIEPDTPGRVALDEALQRGTVDARTA 4548
Cdd:smart00250    1 QRLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1477-1638 9.95e-04

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 45.28  E-value: 9.95e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSE 1556
Cdd:PRK12678    65 AAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARK 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1557 AEIQAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGELQALRARAEEAEAQKRQAQEEaERLRRQVQDETQRK 1636
Cdd:PRK12678   145 AGEGGEQPATEARADAAERTEEE--------ERDERRRRGDREDRQAEAERGERGRREERGRDGD-DRDRRDRREQGDRR 215

                   ..
gi 1920237946 1637 RQ 1638
Cdd:PRK12678   216 EE 217
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
2336-2530 1.01e-03

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 45.33  E-value: 1.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2336 QRLKAEVTEAARQRGQVEEElfslrvqmeelgklkarieaenralvlrdkdsaQRLLQEEAEKMKQVAEEAARLSVAAQE 2415
Cdd:pfam15709  341 ERAEMRRLEVERKRREQEEQ---------------------------------RRLQQEQLERAEKMREELELEQQRRFE 387
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2416 AARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEA---ELLQQQKELAQEQARRLQEDKE---QMAQQLAQETQG 2489
Cdd:pfam15709  388 EIRLRKQRLEEERQRQEEEERKQRLQLQAAQERARQQQEEfrrKLQELQRKKQQEEAERAEAEKQrqkELEMQLAEEQKR 467
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|.
gi 1920237946 2490 FQKTLETERQRQLEMSAEAERLRLRVAEMSRaqARAEEDAR 2530
Cdd:pfam15709  468 LMEMAEEERLEYQRQKQEAEEKARLEAEERR--QKEEEAAR 506
PRK10929 PRK10929
putative mechanosensitive channel protein; Provisional
1608-1992 1.03e-03

putative mechanosensitive channel protein; Provisional


Pssm-ID: 236798 [Multi-domain]  Cd Length: 1109  Bit Score: 45.43  E-value: 1.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1608 AEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAR---EKQRALQALEELRLQAEEAERRLRQAEAE 1684
Cdd:PRK10929   104 TDALEQEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTEARRqlnEIERRLQTLGTPNTPLAQAQLTALQAESA 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1685 RARQVQVALETAQRSAE-----AELQSEhaSFAEKTAQLERTLkeehvavvqlreeatrraqqqaeaeraraeaereler 1759
Cdd:PRK10929   184 ALKALVDELELAQLSANnrqelARLRSE--LAKKRSQQLDAYL------------------------------------- 224
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1760 wqlkanEALRLRLQAEevaqqksltqaeaekqkeeaerearrrgkaeeqavRQRElAEQELEKQRQLAEGTAQQRLAAEQ 1839
Cdd:PRK10929   225 ------QALRNQLNSQ-----------------------------------RQRE-AERALESTELLAEQSGDLPKSIVA 262
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1840 ELIRLRAETEQGEQQRQLLeEELARLQREAAAATQKRREleaELAKVRAEMEVLLASKARAE----EESRSTSEKSKQRL 1915
Cdd:PRK10929   263 QFKINRELSQALNQQAQRM-DLIASQQRQAASQTLQVRQ---ALNTLREQSQWLGVSNALGEalraQVARLPEMPKPQQL 338
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1916 EAEAGRFRelaeeAARLR--ALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAA---------------ISEATRLK--- 1975
Cdd:PRK10929   339 DTEMAQLR-----VQRLRyeDLLNKQPQLRQIRQADGQPLTAEQNRILDAQLRTqrellnsllsggdtlILELTKLKvan 413
                          410
                   ....*....|....*...
gi 1920237946 1976 TEAEIALKE-KEAENERL 1992
Cdd:PRK10929   414 SQLEDALKEvNEATHRYL 431
Caldesmon pfam02029
Caldesmon;
1471-1735 1.07e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 45.24  E-value: 1.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREA-----------QGL-----------QRRMQEEVA 1528
Cdd:pfam02029    5 EEAARERRRRAREERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELkpsgqggldeeEAFldrtakreerrQKRLQEALE 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1529 RREEVAVEAQEQKRSIQEELQHlRQSSEAEIQAKARQVEaAERSRLRIEEEIRVVRL---QLEATERQRGGAEGELQALR 1605
Cdd:pfam02029   85 RQKEFDPTIADEKESVAERKEN-NEEEENSSWEKEEKRD-SRLGRYKEEETEIREKEyqeNKWSTEVRQAEEEGEEEEDK 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARAEEAEAQKRQAQEEAERLRRQVQDE---------------TQRKRQA--EAELALRVQAEAEAAREKQRALQALE-EL 1667
Cdd:pfam02029  163 SEEAEEVPTENFAKEEVKDEKIKKEKKvkyeskvfldqkrghPEVKSQNgeEEVTKLKVTTKRRQGGLSQSQEREEEaEV 242
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1668 RLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHASFAE--KTAQLERTLKEEHVAVVQLREE 1735
Cdd:pfam02029  243 FLEAEQKLEELRRRRQEKESEEFEKLRQKQQEAELELEELKKKREErrKLLEEEEQRRKQEEAERKLREE 312
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
1840-2187 1.09e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 45.27  E-value: 1.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1840 ELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAElaKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEA 1919
Cdd:pfam07888   30 ELLQNRLEECLQERAELLQAQEAANRQREKEKERYKRDREQWE--RQRRELESRVAELKEELRQSREKHEELEEKYKELS 107
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1920 GRFRELAEEAARLRAlAEEAKRQRQLAEEDAVrqRAEAERVLaeklaaiseatrlkteaeialkEKEAENERLRRLAEDE 1999
Cdd:pfam07888  108 ASSEELSEEKDALLA-QRAAHEARIRELEEDI--KTLTQRVL----------------------ERETELERMKERAKKA 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2000 AFQRRLLEEQAAQHKADIEARLAQLRKASeSELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRG 2079
Cdd:pfam07888  163 GAQRKEEEAERKQLQAKLQQTEEELRSLS-KEFQELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRKEAENEALLEELRS 241
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2080 TAE-------------------DTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALE----- 2135
Cdd:pfam07888  242 LQErlnaserkveglgeelssmAAQRDRTQAELHQARLQAAQLTLQLADASLALREGRARWAQERETLQQSAEADkdrie 321
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2136 ----EVERLKAKVEEARRLRERAEQESARQ--LQLAQEAAQKRLQAEEKAhAFAVQQK 2187
Cdd:pfam07888  322 klsaELQRLEERLQEERMEREKLEVELGREkdCNRVQLSESRRELQELKA-SLRVAQK 378
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
1474-1735 1.21e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 45.20  E-value: 1.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1474 ERLAEQQRAEERERLAEVEaALEKQRQLAEAHAQAkaqaereaqgLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRq 1553
Cdd:pfam10174  453 ERLKEQREREDRERLEELE-SLKKENKDLKEKVSA----------LQPELTEKESSLIDLKEHASSLASSGLKKDSKLK- 520
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1554 SSEAEIQA-------------KARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQK----- 1615
Cdd:pfam10174  521 SLEIAVEQkkeecsklenqlkKAHNAEEAVRTNPEINDRIRLLEQEVARYKEESGKAQAEVERLLGILREVENEKndkdk 600
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1616 ----------RQAQEEAERLR--RQVQDETQRKRQAEAELALRVQ-AEAEAAREKQRA--LQALEELRLQAEEAERRLRQ 1680
Cdd:pfam10174  601 kiaelesltlRQMKEQNKKVAniKHGQQEMKKKGAQLLEEARRREdNLADNSQQLQLEelMGALEKTRQELDATKARLSS 680
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 1681 AE--------------AERARQVQVALETAQRSAEAELQSEHASFA--EKTAQLERTLKEEhvaVVQLREE 1735
Cdd:pfam10174  681 TQqslaekdghltnlrAERRKQLEEILEMKQEALLAAISEKDANIAllELSSSKKKKTQEE---VMALKRE 748
PRK12704 PRK12704
phosphodiesterase; Provisional
1905-2053 1.24e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.77  E-value: 1.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1905 RSTSEKSKQRLEAEAGRFRELAEEAArlralaEEAKRQRQL-AEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALK 1983
Cdd:PRK12704    26 KKIAEAKIKEAEEEAKRILEEAKKEA------EAIKKEALLeAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLD 99
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1984 EKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKasesELERQKGLVEDTLRQR--RQVEEEI 2053
Cdd:PRK12704   100 RKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQ----ELERISGLTAEEAKEIllEKVEEEA 167
PLEC smart00250
Plectin repeat;
2892-2928 1.32e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 39.00  E-value: 1.32e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  2892 KLLSAERAVTGYKDPYTGEQISLFQAMKKDLIVREHG 2928
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
2303-2736 1.35e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 45.20  E-value: 1.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2303 ALRQKAQVEQ-ELTALRLQLEEtdhQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEaeNRALV 2381
Cdd:pfam10174  335 AKEQRAAILQtEVDALRLRLEE---KESFLNKKTKQLQDLTEEKSTLAGEIRDLKDMLDVKERKINVLQKKIE--NLQEQ 409
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2382 LRDKDsaqRLLQEEAEKMKQVAEEAARLSVA---AQEAARLRQLAEEDLAQQRALAEKMLKEKM-QAVQEATRLKAEAEL 2457
Cdd:pfam10174  410 LRDKD---KQLAGLKERVKSLQTDSSNTDTAlttLEEALSEKERIIERLKEQREREDRERLEELeSLKKENKDLKEKVSA 486
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2458 LQQQKelaQEQARRLQEDKEQmAQQLAQ---ETQGFQKTLETERQRQLEmsaEAERLrlrVAEMSRAQaRAEEDARrfrk 2534
Cdd:pfam10174  487 LQPEL---TEKESSLIDLKEH-ASSLASsglKKDSKLKSLEIAVEQKKE---ECSKL---ENQLKKAH-NAEEAVR---- 551
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2535 QAEDIGERLYRTELATQEKVMlvqtlETQRQQSDrdAERLREAIAELEHEK-DKLKQEAQLLQLKSEEMQTvrQEQLLQE 2613
Cdd:pfam10174  552 TNPEINDRIRLLEQEVARYKE-----ESGKAQAE--VERLLGILREVENEKnDKDKKIAELESLTLRQMKE--QNKKVAN 622
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2614 TQALQQsflsekdsllqrercieQEKAKLEQLFQDevakaqALREEQQRQQQQMQQEKQQLAASMEEARRrqhEAEEGVR 2693
Cdd:pfam10174  623 IKHGQQ-----------------EMKKKGAQLLEE------ARRREDNLADNSQQLQLEELMGALEKTRQ---ELDATKA 676
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 2694 RQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRAALA 2736
Cdd:pfam10174  677 RLSSTQQSLAEKDGHLTNLRAERRKQLEEILEMKQEALLAAIS 719
PRK05035 PRK05035
electron transport complex protein RnfC; Provisional
1549-1837 1.36e-03

electron transport complex protein RnfC; Provisional


Pssm-ID: 235334 [Multi-domain]  Cd Length: 695  Bit Score: 44.94  E-value: 1.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1549 QHLRQSsEAEIQAKARQVEAAERSRLRIEEeirvvrlqleateRQrggaegelqalrARAEeaeaqkRQAQEEAERLRRQ 1628
Cdd:PRK05035   429 QYYRQA-KAEIRAIEQEKKKAEEAKARFEA-------------RQ------------ARLE------REKAAREARHKKA 476
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1629 VQDETQRKRQAEAELALRVQAEAEAAREK----------QRALQALEELR-LQAEEAERRLRQAEAERARQVQVALETAQ 1697
Cdd:PRK05035   477 AEARAAKDKDAVAAALARVKAKKAAATQPivikagarpdNSAVIAAREARkAQARARQAEKQAAAAADPKKAAVAAAIAR 556
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1698 RSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERElerwqlkANEALRLRLQAEEV 1777
Cdd:PRK05035   557 AKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPKKA-------AVAAAIARAKAKKA 629
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1778 AQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAA 1837
Cdd:PRK05035   630 EQQANAEPEEPVDPRKAAVAAAIARAKARKAAQQQANAEPEEAEDPKKAAVAAAIARAKA 689
Rabaptin pfam03528
Rabaptin;
2402-2645 1.52e-03

Rabaptin;


Pssm-ID: 367545 [Multi-domain]  Cd Length: 486  Bit Score: 44.71  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2402 VAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKML--KEKMQAVQEATRLKAEAELLQQQKELAQEQAR--------- 2470
Cdd:pfam03528    6 LQQRVAELEKENAEFYRLKQQLEAEFNQKRAKFKELYlaKEEDLKRQNAVLQEAQVELDALQNQLALARAEmenikavat 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2471 ----RLQEDKEQMAQQLAQETQGFQKTL-ETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRfrkqaedigerlyR 2545
Cdd:pfam03528   86 vsenTKQEAIDEVKSQWQEEVASLQAIMkETVREYEVQFHRRLEQERAQWNQYRESAEREIADLRR-------------R 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2546 TELATQEkvmlvQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQA--------- 2616
Cdd:pfam03528  153 LSEGQEE-----ENLEDEMKKAQEDAEKLRSVVMPMEKEIAALKAKLTEAEDKIKELEASKMKELNHYLEAekscrtdle 227
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1920237946 2617 -------LQQSFLSEKDSLLQRE-----RCIEQEKAKLEQL 2645
Cdd:pfam03528  228 myvavlnTQKSVLQEDAEKLRKElhevcHLLEQERQQHNQL 268
PRK12705 PRK12705
hypothetical protein; Provisional
1495-1701 1.56e-03

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 44.70  E-value: 1.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1495 LEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEvaRREEVAVEAQEQKRSIQEELQHLrQSSEAEIQAKARQVEAaersrl 1574
Cdd:PRK12705    25 LKKRQRLAKEAERILQEAQKEAEEKLEAALLE--AKELLLRERNQQRQEARREREEL-QREEERLVQKEEQLDA------ 95
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1575 RIEEeirvvrlqLEATERQRGGAEgelQALRARAEEAEAQKRQAQEEAERLrrqvqdETQRKRQAEAELALRVQAEAEaa 1654
Cdd:PRK12705    96 RAEK--------LDNLENQLEERE---KALSARELELEELEKQLDNELYRV------AGLTPEQARKLLLKLLDAELE-- 156
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 1655 REKQRALQALEelrlqaEEAerrlrQAEAERARQVQVAlETAQRSAE 1701
Cdd:PRK12705   157 EEKAQRVKKIE------EEA-----DLEAERKAQNILA-QAMQRIAS 191
COG4995 COG4995
Uncharacterized conserved protein, contains CHAT domain [Function unknown];
2162-2597 1.58e-03

Uncharacterized conserved protein, contains CHAT domain [Function unknown];


Pssm-ID: 444019 [Multi-domain]  Cd Length: 711  Bit Score: 44.96  E-value: 1.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2162 LQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEAE 2241
Cdd:COG4995      1 LLALALLALLAALLAALALALLALALLLLLAALAAAALLLLALLALLLALAAAAAAALAAAALALALLAAAALALLLLAL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2242 RLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQL 2321
Cdd:COG4995     81 ALAALALALLAAALALALAAAALAALALLAALLALAAAAALLALLAALALLALLAALAAALAAAAAAALAAALAAAAAAA 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2322 EETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQ 2401
Cdd:COG4995    161 AAAALLALALALAAAALALLALLLAALAAALAAAAAALALLLALLLLAALAAALAAALAALLLALLALAAALLALLLLAL 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2402 VAEEAARLSvAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQ 2481
Cdd:COG4995    241 LALAAAAAA-LAAAAAALLALAAALLLLAALAALAAAAAAAALAALALAAALALAAAALALALLLAAAAAAALAALALLL 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2482 QLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEM-SRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTL 2560
Cdd:COG4995    320 LAALLLLLAALALLALLLLLAAAALLAAALAAALALAaALALALLAALLLLLAALLALLLEALLLLLLALLAALLLLAAA 399
                          410       420       430
                   ....*....|....*....|....*....|....*..
gi 1920237946 2561 ETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQL 2597
Cdd:COG4995    400 LLALAAAQLLRLLLAALALLLALAAYAAARLALLALI 436
CH_FIMB_rpt1 cd21294
first calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar ...
180-283 1.59e-03

first calponin homology (CH) domain found in Saccharomyces cerevisiae fimbrin and similar proteins; Fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the first CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409143  Cd Length: 125  Bit Score: 41.66  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  180 DERDRVQkktFTKWVNK---------HLIKAQRHISDLYEDLRDGHNLISLLEvlsgDSLP-------------REKGRM 237
Cdd:cd21294      4 NEDERRE---FTKHINAvlagdpdvgSRLPFPTDTFQLFDECKDGLVLSKLIN----DSVPdtidervlnkpprKNKPLN 76
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946  238 RFHKLQNVQIALDYLRHRQVKLVNIRNDDIADGNPKLTLGLIWTII 283
Cdd:cd21294     77 NFQMIENNNIVINSAKAIGCSVVNIGAGDIIEGREHLILGLIWQII 122
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
2283-2447 1.60e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 43.37  E-value: 1.60e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2283 LRQKQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQ 2362
Cdd:COG1579      9 LLDLQELDSELDRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNN 88
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2363 mEELGKLKARIEAENRALVLRDkDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKM 2442
Cdd:COG1579     89 -KEYEALQKEIESLKRRISDLE-DEILELMERIEELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAERE 166

                   ....*
gi 1920237946 2443 QAVQE 2447
Cdd:COG1579    167 ELAAK 171
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
1519-2184 1.61e-03

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 44.74  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1519 LQRRMQeevARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRL---RIEEEIRVVR-LQLEATERQR 1594
Cdd:pfam07111   21 LERRLD---TQRPTVTMWEQDVSGDGQGPGRRGRSLELEGSQALSQQAELISRQLQelrRLEEEVRLLReTSLQQKMRLE 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1595 GGA-EGELQALRARAEEAEAQK-RQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKqralqALEELRLQAE 1672
Cdd:pfam07111   98 AQAmELDALAVAEKAGQAEAEGlRAALAGAEMVRKNLEEGSQRELEEIQRLHQEQLSSLTQAHEE-----ALSSLTSKAE 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1673 EAERRLRQAEAERARQVQvALETAQRsaEAELQSEHASFAEKTAQLERTLKEEHVAVV--QLREEATRRAQQQAEAERAR 1750
Cdd:pfam07111  173 GLEKSLNSLETKRAGEAK-QLAEAQK--EAELLRKQLSKTQEELEAQVTLVESLRKYVgeQVPPEVHSQTWELERQELLD 249
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1751 AEAERELERWQLKAN-EALRLRLQaeevaqqkSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEG 1829
Cdd:pfam07111  250 TMQHLQEDRADLQATvELLQVRVQ--------SLTHMLALQEEELTRKIQPSDSLEPEFPKKCRSLLNRWREKVFALMVQ 321
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1830 TAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQReaaAATQKRRELEAELAKVRA-EMEVLLASKARAEEESRSTS 1908
Cdd:pfam07111  322 LKAQDLEHRDSVKQLRGQVAELQEQVTSQSQEQAILQR---ALQDKAAEVEVERMSAKGlQMELSRAQEARRRQQQQTAS 398
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1909 EKSKQRLEAEAGRFRELAEEAARLRaLAEEAKRQRQLAEE--DAVRQRAEAERVLAEKLAAIS---EATRLKTEAEIALK 1983
Cdd:pfam07111  399 AEEQLKFVVNAMSSTQIWLETTMTR-VEQAVARIPSLSNRlsYAVRKVHTIKGLMARKVALAQlrqESCPPPPPAPPVDA 477
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1984 EKEAENERLRRlaedeafQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKA 2063
Cdd:pfam07111  478 DLSLELEQLRE-------ERNRLDAELQLSAHLIQQEVGRAREQGEAERQQLSEVAQQLEQELQRAQESLASVGQQLEVA 550
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2064 AAGKAELELELGRIRGTAEDTLRSKEQAEQEA-----ARQRQLAAEEERRRREAEERVQK---SLAAEEEAARQRKAALE 2135
Cdd:pfam07111  551 RQGQQESTEEAASLRQELTQQQEIYGQALQEKvaeveTRLREQLSDTKRRLNEARREQAKavvSLRQIQHRATQEKERNQ 630
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2136 EVERLK--AKVEEARRLRERAEQ-ESARQLQLAQEAAQKRLQAEEKAHAFAV 2184
Cdd:pfam07111  631 ELRRLQdeARKEEGQRLARRVQElERDKNLMLATLQQEGLLSRYKQQRLLAV 682
Rabaptin pfam03528
Rabaptin;
1465-1737 1.62e-03

Rabaptin;


Pssm-ID: 367545 [Multi-domain]  Cd Length: 486  Bit Score: 44.71  E-value: 1.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEaHAQAKAQAEREAQGLQRRMQEEvarREEVAVEAQEQKrsI 1544
Cdd:pfam03528   96 DEVKSQWQEEVASLQAIMKETVREYEVQFHRRLEQERAQ-WNQYRESAEREIADLRRRLSEG---QEEENLEDEMKK--A 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1545 QEELQHLRQ---SSEAEIQA-KARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALraraeeaEAQKRQAQE 1620
Cdd:pfam03528  170 QEDAEKLRSvvmPMEKEIAAlKAKLTEAEDKIKELEASKMKELNHYLEAEKSCRTDLEMYVAVL-------NTQKSVLQE 242
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1621 EAERLRRQVQDETQRkrqaeaeLALRVQAEAEAAREKQRAL-QALEELRLQAEEAERRLRQAEAERARQVqvalETAQRS 1699
Cdd:pfam03528  243 DAEKLRKELHEVCHL-------LEQERQQHNQLKHTWQKANdQFLESQRLLMRDMQRMESVLTSEQLRQV----EEIKKK 311
                          250       260       270
                   ....*....|....*....|....*....|....*...
gi 1920237946 1700 AEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEAT 1737
Cdd:pfam03528  312 DQEEHKRARTHKEKETLKSDREHTVSIHAVFSPAGVET 349
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1855-2167 1.66e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 44.12  E-value: 1.66e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1855 RQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESrstsEKSKQRLEAEAGRFRELAEEAARLRA 1934
Cdd:COG4372     19 RPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEEL----EQARSELEQLEEELEELNEQLQAAQA 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1935 LAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHK 2014
Cdd:COG4372     95 ELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2015 ADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQE 2094
Cdd:COG4372    175 ALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEE 254
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 2095 AARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQE 2167
Cdd:COG4372    255 VILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKK 327
PRK10920 PRK10920
putative uroporphyrinogen III C-methyltransferase; Provisional
2446-2524 1.66e-03

putative uroporphyrinogen III C-methyltransferase; Provisional


Pssm-ID: 236795  Cd Length: 390  Bit Score: 44.32  E-value: 1.66e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2446 QEATRLKAEAELLQQQKELAQEQArrlQEDKEQMAQQLAQETqgfqKTLETERQRQLEMSAEAERLRLRVAEMSRAQAR 2524
Cdd:PRK10920    60 QQAQNQTATNDALANQLTALQKAQ---ESQKQELEGILKQQA----KALDQANRQQAALAKQLDELQQKVATISGSDAK 131
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
2313-2656 1.67e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 44.63  E-value: 1.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2313 ELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELG----KLKARIEA-ENralvlrDKDS 2387
Cdd:TIGR04523  104 DLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNnkynDLKKQKEElEN------ELNL 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRLLQEEAEKMKQVAEEAAR----LSVAAQEAARLRQLAEE--DLAQQRALAEKMLKEKMQAVQEatrLKAEAELLQQQ 2461
Cdd:TIGR04523  178 LEKEKLNIQKNIDKIKNKLLKlellLSNLKKKIQKNKSLESQisELKKQNNQLKDNIEKKQQEINE---KTTEISNTQTQ 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2462 -KELAQEQarrlQEDKEQMAQQLAQETQGFQKTLETERQRQlEMSAEAERLRlrvaemsraqaraeedarrfRKQAEDIG 2540
Cdd:TIGR04523  255 lNQLKDEQ----NKIKKQLSEKQKELEQNNKKIKELEKQLN-QLKSEISDLN--------------------NQKEQDWN 309
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2541 ERLyRTELATQEKVmlVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQlkseemqtvrqEQLLQETQALQQs 2620
Cdd:TIGR04523  310 KEL-KSELKNQEKK--LEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQ-----------RELEEKQNEIEK- 374
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 1920237946 2621 FLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQAL 2656
Cdd:TIGR04523  375 LKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQK 410
SPEC smart00150
Spectrin repeats;
648-742 1.69e-03

Spectrin repeats;


Pssm-ID: 197544 [Multi-domain]  Cd Length: 101  Bit Score: 40.78  E-value: 1.69e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946   648 LRYLQDLLAWVEENQRRLDSAEWGVDLPSVEAQLGSHRGLHQSVEEFRTKIERARTDEGQL---SPATRGAYRDCLGRLD 724
Cdd:smart00150    4 LRDADELEAWLEEKEQLLASEDLGKDLESVEALLKKHEAFEAELEAHEERVEALNELGEQLieeGHPDAEEIEERLEELN 83
                            90
                    ....*....|....*...
gi 1920237946   725 LQYAKLLSSSKARLRSLE 742
Cdd:smart00150   84 ERWEELKELAEERRQKLE 101
Tektin pfam03148
Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular ...
1821-2094 1.70e-03

Tektin family; Tektins are cytoskeletal proteins. They have been demonstrated in such cellular sites as centrioles, basal bodies, and along ciliary and flagellar doublet microtubules. Tektins form unique protofilaments, organized as longitudinal polymers of tektin heterodimers with axial periodicity matching tubulin. Tektin polypeptides consist of several alpha-helical regions that are predicted to form coiled coils. Indeed, tektins share considerable structural similarities with intermediate filament proteins. Possible functional roles for tektins are: stabilization of tubulin protofilaments; attachment of A and B-tubules in ciliary/flagellar microtubule doublets and C-tubules in centrioles; binding of axonemal components.


Pssm-ID: 460827 [Multi-domain]  Cd Length: 383  Bit Score: 44.08  E-value: 1.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1821 EKQRQLAEgtaQQRLAAE---QELIRLRAETEQGEQQRQllEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASK 1897
Cdd:pfam03148    6 QELYREAE---AQRNDAErlrQESRRLRNETDAKTKWDQ--YDSNRRLGERIQDITFWKSELEKELEELDEEIELLLEEK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1898 ARAEEESRSTSEK---SKQRLEAEAGRF----------RELAEEA-------ARLRALAEEAkrQRQLAEEDAVRQRAEA 1957
Cdd:pfam03148   81 RRLEKALEALEEPlhiAQECLTLREKRQgidlvhdeveKELLKEVeliegiqELLQRTLEQA--WEQLRLLRAARHKLEK 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1958 ErvLAEKLAAI---SEATRLK-TEAEIALKEKEAENERLRRLAED-EAFQRRLLE--EQAAQHKADIEARLAQLRKASES 2030
Cdd:pfam03148  159 D--LSDKKEALeidEKCLSLNnTSPNISYKPGPTRIPPNSSTPEEwEKFTQDNIEraEKERAASAQLRELIDSILEQTAN 236
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 2031 ELERQKGLVEDTLRQRrqVEEeilalkgsFEKAaagKAELELELGRIRgtaedtlrsKEQAEQE 2094
Cdd:pfam03148  237 DLRAQADAVNFALRKR--IEE--------TEDA---KNKLEWQLKKTL---------QEIAELE 278
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1586-2049 2.00e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 44.26  E-value: 2.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1586 QLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALE 1665
Cdd:COG3064     17 RLEQAEAEKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEQRAAELAAEAAKKLAEAEKAAAEAEKKAAAE 96
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1666 ELRLQAEEAERRLRQAEAERARQVQValETAQRSAE--AELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQ 1743
Cdd:COG3064     97 KAKAAKEAEAAAAAEKAAAAAEKEKA--EEAKRKAEeeAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARA 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1744 AEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQ 1823
Cdd:COG3064    175 AAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADL 254
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1824 RQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEE 1903
Cdd:COG3064    255 AAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGA 334
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1904 SRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALK 1983
Cdd:COG3064    335 ASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGLRLDLGAALLEA 414
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1984 EKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQV 2049
Cdd:COG3064    415 ASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDGGAVLADL 480
PRK09039 PRK09039
peptidoglycan -binding protein;
1586-1701 2.05e-03

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 43.80  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1586 QLEATERQR-GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQAL 1664
Cdd:PRK09039    67 DLLSLERQGnQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQI 146
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1920237946 1665 EELR---------LQAEEAERRLRQAE-AERARQVQVALetAQRSAE 1701
Cdd:PRK09039   147 AALRrqlaaleaaLDASEKRDRESQAKiADLGRRLNVAL--AQRVQE 191
PRK00409 PRK00409
recombination and DNA strand exchange inhibitor protein; Reviewed
1601-1725 2.06e-03

recombination and DNA strand exchange inhibitor protein; Reviewed


Pssm-ID: 234750 [Multi-domain]  Cd Length: 782  Bit Score: 44.43  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1601 LQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEaarekqralQALEELRLQAEEAERRLRQ 1680
Cdd:PRK00409   525 LEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQ---------QAIKEAKKEADEIIKELRQ 595
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 1920237946 1681 AEAERARQVqvaletaqrsAEAELQSEHASFAEKTAQLERTLKEE 1725
Cdd:PRK00409   596 LQKGGYASV----------KAHELIEARKRLNKANEKKEKKKKKQ 630
Crescentin pfam19220
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ...
1246-1656 2.08e-03

Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.


Pssm-ID: 437057 [Multi-domain]  Cd Length: 401  Bit Score: 43.90  E-value: 2.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1246 ATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGaqevgerLQQRHGERDVEVERWRERvtllLERWQAVLAQTDVRQ 1325
Cdd:pfam19220   38 AILRELPQAKSRLLELEALLAQERAAYGKLRRELAG-------LTRRLSAAEGELEELVAR----LAKLEAALREAEAAK 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1326 RELEQLGRQLRYYRESADplgawlRDAKQRQEQIQAVPLANsQAVREQLRQEKALLEDIERHGEKVEECQRFAKQyinai 1405
Cdd:pfam19220  107 EELRIELRDKTAQAEALE------RQLAAETEQNRALEEEN-KALREEAQAAEKALQRAEGELATARERLALLEQ----- 174
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1406 kdyelqlvtykaqlepvaspaKKPKVQSGSESIIQEYVDLRTRYSELSTL---TSQYIRFISETLRRMEEEERLAEQQRA 1482
Cdd:pfam19220  175 ---------------------ENRRLQALSEEQAAELAELTRRLAELETQldaTRARLRALEGQLAAEQAERERAEAQLE 233
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1483 EERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLrqssEAEIQAK 1562
Cdd:pfam19220  234 EAVEAHRAERASLRMKLEALTARAAATEQLLAEARNQLRDRDEAIRAAERRLKEASIERDTLERRLAGL----EADLERR 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1563 ARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRqaeAE 1642
Cdd:pfam19220  310 TQQFQEMQRARAELEERAEMLTKALAAKDAALERAEERIASLSDRIAELTKRFEVERAALEQANRRLKEELQRER---AE 386
                          410
                   ....*....|....
gi 1920237946 1643 LALrVQAEAEAARE 1656
Cdd:pfam19220  387 RAL-AQGALEIARE 399
Borrelia_P83 pfam05262
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
1807-1951 2.18e-03

Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.


Pssm-ID: 114011 [Multi-domain]  Cd Length: 489  Bit Score: 44.22  E-value: 2.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1886
Cdd:pfam05262  209 QEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQI 288
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 1887 RAEM---EVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAV 1951
Cdd:pfam05262  289 EIKKndeEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPTSLNEDAI 356
mukB PRK04863
chromosome partition protein MukB;
2400-2741 2.31e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 44.56  E-value: 2.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2400 KQVAEEAARLSVAA--QEAARLRQLAEEDLAQQRALAEKmlkekmqavqEATRLKAEAELLQQQKELAQEQARR--LQED 2475
Cdd:PRK04863   260 KHLITESTNYVAADymRHANERRVHLEEALELRRELYTS----------RRQLAAEQYRLVEMARELAELNEAEsdLEQD 329
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2476 KEQMA--QQLAQETQGFQKTLE------TERQRQLEMSAEAERLRLRVAEMSRAQAR-AEEDARRFRKQAEDIGERL--- 2543
Cdd:PRK04863   330 YQAASdhLNLVQTALRQQEKIEryqadlEELEERLEEQNEVVEEADEQQEENEARAEaAEEEVDELKSQLADYQQALdvq 409
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2544 YRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEaqLLQLksEEMQTVRQEQLLQETQALQ--QSF 2621
Cdd:PRK04863   410 QTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEE--LLSL--EQKLSVAQAAHSQFEQAYQlvRKI 485
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2622 LSEKDsllqRERCIEQEKAKLEQL--FQDEVAKAQALReeqqrqqqqmqqekqqlaASMEEARRRQHEAEEGVRRQQEEL 2699
Cdd:PRK04863   486 AGEVS----RSEAWDVARELLRRLreQRHLAEQLQQLR------------------MRLSELEQRLRQQQRAERLLAEFC 543
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 2700 QRLAQQQQQQEkLLAEENQRLRERLQHLEEERRAALARSEEI 2741
Cdd:PRK04863   544 KRLGKNLDDED-ELEQLQEELEARLESLSESVSEARERRMAL 584
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
1458-1652 2.32e-03

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 43.75  E-value: 2.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1458 QYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQlaeahaqakAQAEREAQgLQRRMQEEVARREEVAVEA 1537
Cdd:pfam13868  156 RILEYLKEKAEREEEREAEREEIEEEKEREIARLRAQQEKAQD---------EKAERDEL-RAKLYQEEQERKERQKERE 225
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRsiQEELQHLRQSSEAEIQAKARQvEAAERSRLRIEEEiRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQ 1617
Cdd:pfam13868  226 EAEKK--ARQRQELQQAREEQIELKERR-LAEEAEREEEEFE-RMLRKQAEDEEIEQEEAEKRRMKRLEHRRELEKQIEE 301
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 1920237946 1618 AQEEAERLRRQVQDETQRKRQAEAELALRVQAEAE 1652
Cdd:pfam13868  302 REEQRAAEREEELEEGERLREEEAERRERIEEERQ 336
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
2444-2698 2.37e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 43.67  E-value: 2.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2444 AVQEATRLKAEAELLQQQKEL--AQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRA 2521
Cdd:COG3883      5 ALAAPTPAFADPQIQAKQKELseLQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEER 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2522 QARAEEDARRFRKQ------------AEDIGE---RLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKD 2586
Cdd:COG3883     85 REELGERARALYRSggsvsyldvllgSESFSDfldRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKA 164
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2587 KLKQEAQLLQLKSEEmqtvrQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQ 2666
Cdd:COG3883    165 ELEAAKAELEAQQAE-----QEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 239
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1920237946 2667 MQQEKQQLAASMEEARRRQHEAEEGVRRQQEE 2698
Cdd:COG3883    240 AAAAASAAGAGAAGAAGAAAGSAGAAGAAAGA 271
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
2558-2741 2.43e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 44.34  E-value: 2.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2558 QTLETQRQQSDR----DAERLREAIAELEHEKDKLKqeaqllqlKSEEMQTVRQEQLLQETQ--ALQQSFLSEKDSLLQR 2631
Cdd:pfam17380  281 QKAVSERQQQEKfekmEQERLRQEKEEKAREVERRR--------KLEEAEKARQAEMDRQAAiyAEQERMAMERERELER 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2632 ERcIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEK 2711
Cdd:pfam17380  353 IR-QEERKRELERIRQEEIAMEISRMRELERLQMERQQKNERVRQELEAARKVKILEEERQRKIQQQKVEMEQIRAEQEE 431
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1920237946 2712 LLAEENQRL-RERLQHLEEERRAALARSEEI 2741
Cdd:pfam17380  432 ARQREVRRLeEERAREMERVRLEEQERQQQV 462
Caldesmon pfam02029
Caldesmon;
2443-2740 2.47e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 44.09  E-value: 2.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2443 QAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRvaemsrAQ 2522
Cdd:pfam02029    6 EAARERRRRAREERRRQKEEEEPSGQVTESVEPNEHNSYEEDSELKPSGQGGLDEEEAFLDRTAKREERRQK------RL 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2523 ARAEEDARRFRKQAEDIGErlyrtELATQEKVMLVQTLETQRQQSDRDAERLREAIAELE------------------HE 2584
Cdd:pfam02029   80 QEALERQKEFDPTIADEKE-----SVAERKENNEEEENSSWEKEEKRDSRLGRYKEEETEirekeyqenkwstevrqaEE 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2585 KDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQ 2664
Cdd:pfam02029  155 EGEEEEDKSEEAEEVPTENFAKEEVKDEKIKKEKKVKYESKVFLDQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQ 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2665 QQMQQEKQQLAA--SMEEARRRQHEAEEgvrrqqEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEERRA-------AL 2735
Cdd:pfam02029  235 EREEEAEVFLEAeqKLEELRRRRQEKES------EEFEKLRQKQQEAELELEELKKKREERRKLLEEEEQRrkqeeaeRK 308

                   ....*
gi 1920237946 2736 ARSEE 2740
Cdd:pfam02029  309 LREEE 313
CH_PLS2_rpt3 cd21330
third calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or ...
181-289 2.48e-03

third calponin homology (CH) domain found in plastin-2; Plastin-2, also called L-plastin, or LC64P, or lymphocyte cytosolic protein 1 (LCP-1), is an actin-binding protein that plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-2 contains four copies of the CH domain. This model corresponds to the third CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409179  Cd Length: 125  Bit Score: 41.13  E-value: 2.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  181 ERDRVQKKTFTKWVNKhlIKAQRHISDLYEDLRDGHNLISLLEVL---------SGDSLPREKGRMRfhKLQNVQIALDY 251
Cdd:cd21330      9 EGETREERTFRNWMNS--LGVNPRVNHLYSDLSDALVIFQLYEKIkvpvdwnrvNKPPYPKLGENMK--KLENCNYAVEL 84
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 1920237946  252 LRHR-QVKLVNIRNDDIADGNPKLTLGLIWTIILHFQIS 289
Cdd:cd21330     85 GKNKaKFSLVGIAGQDLNEGNRTLTLALIWQLMRRYTLN 123
PspA COG1842
Phage shock protein A [Transcription, Signal transduction mechanisms];
1575-1705 2.48e-03

Phage shock protein A [Transcription, Signal transduction mechanisms];


Pssm-ID: 441447 [Multi-domain]  Cd Length: 217  Bit Score: 42.50  E-value: 2.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1575 RIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAA 1654
Cdd:COG1842     20 KAEDPEKMLDQAIRDMEEDLVEARQALAQVIANQKRLERQLEELEAEAEKWEEKARLALEKGREDLAREALERKAELEAQ 99
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1655 REKQRAL-----QALEELRLQAEEAERRLRQAEAERArqvqvALETAQRSAEAELQ 1705
Cdd:COG1842    100 AEALEAQlaqleEQVEKLKEALRQLESKLEELKAKKD-----TLKARAKAAKAQEK 150
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
2301-2742 2.54e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 44.29  E-value: 2.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2301 EQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQ-RGQVEEELFSLRVQMEELGKLKARIEAENRA 2379
Cdd:TIGR02169  233 EALERQKEAIERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKiKDLGEEEQLRVKEKIGELEAEIASLERSIAE 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2380 LVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEE------DLAQQRALAEKMLKEKMQAVQEATRLKA 2453
Cdd:TIGR02169  313 KERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEyaelkeELEDLRAELEEVDKEFAETRDELKDYRE 392
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2454 EAELLQQQKELAQEQARRLQEDKEQMAQQLAQetqgFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFR 2533
Cdd:TIGR02169  393 KLEKLKREINELKRELDRLQEELQRLSEELAD----LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYE 468
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2534 KQAEDIGERLYRTELATQEKVMLVQTLETQRQQSdRDAERLREAIAELEHEKDK--LKQEAQLLQLKSE----------- 2600
Cdd:TIGR02169  469 QELYDLKEEYDRVEKELSKLQRELAEAEAQARAS-EERVRGGRAVEEVLKASIQgvHGTVAQLGSVGERyataievaagn 547
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2601 -------EMQTVRQE--QLLQETQALQQSFL-------SEKD-SLLQRERCI----------EQEKAKLEQLFQDEV--- 2650
Cdd:TIGR02169  548 rlnnvvvEDDAVAKEaiELLKRRKAGRATFLplnkmrdERRDlSILSEDGVIgfavdlvefdPKYEPAFKYVFGDTLvve 627
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2651 ----------------------------------------------AKAQALREEQQRQQQQMQQEKQQLA---ASMEEA 2681
Cdd:TIGR02169  628 dieaarrlmgkyrmvtlegelfeksgamtggsraprggilfsrsepAELQRLRERLEGLKRELSSLQSELRrieNRLDEL 707
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2682 RRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEenqrLRERLQHLEEERRAALARSEEIA 2742
Cdd:TIGR02169  708 SQELSDASRKIGEIEKEIEQLEQEEEKLKERLEE----LEEDLSSLEQEIENVKSELKELE 764
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
1140-1737 2.63e-03

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 44.02  E-value: 2.63e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1140 EPARECAQRITEQQKAQAEVDGLGKGVARLSaeaekvLALPepspaAPTLRSELEltlGKLEQVRSLSAIYlEKLKTISL 1219
Cdd:PRK10246   254 ELQQEASRRQQALQQALAAEEKAQPQLAALS------LAQP-----ARQLRPHWE---RIQEQSAALAHTR-QQIEEVNT 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1220 VIRSTQEAEEVLRAHeeQLKEAQAVPATLPELeatkaalkklrAQAEAQQPVFDALRDELRG-----AQEVGERLQQRhg 1294
Cdd:PRK10246   319 RLQSTMALRARIRHH--AAKQSAELQAQQQSL-----------NTWLAEHDRFRQWNNELAGwraqfSQQTSDREQLR-- 383
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1295 erdveveRWRERVTLLLERWQAVLAQT-----DVRQRELEQLGRQlRYYRESADPLGAWLRDAKQRQEQIQAvplANSQA 1369
Cdd:PRK10246   384 -------QWQQQLTHAEQKLNALPAITltltaDEVAAALAQHAEQ-RPLRQRLVALHGQIVPQQKRLAQLQV---AIQNV 452
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1370 VREQLRQEKALLEDIERHGEKVEE-------CQRFAKqyinaIKDYELQ--------------------LVTYKAqLEPV 1422
Cdd:PRK10246   453 TQEQTQRNAALNEMRQRYKEKTQQladvktiCEQEAR-----IKDLEAQraqlqagqpcplcgstshpaVEAYQA-LEPG 526
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1423 ASPAKKPKVQSGSESIIQEYVDLRtrySELSTLTSQYIRFISETLRRMEEEERLAEQQRaeererlaEVEAALEKQRQLA 1502
Cdd:PRK10246   527 VNQSRLDALEKEVKKLGEEGAALR---GQLDALTKQLQRDESEAQSLRQEEQALTQQWQ--------AVCASLNITLQPQ 595
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1503 EAHAQ-AKAQAEREAQGLQRRMQEEVarrEEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQV--EAAERSRL--RIE 1577
Cdd:PRK10246   596 DDIQPwLDAQEEHERQLRLLSQRHEL---QGQIAAHNQQIIQYQQQIEQRQQQLLTALAGYALTLpqEDEEASWLatRQQ 672
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1578 EEIRVVRLQLEATERQRGGAE--------GELQALRARAEEAEAQK-RQAQEEAERLRRQVQ-------DETQRKRQAEA 1641
Cdd:PRK10246   673 EAQSWQQRQNELTALQNRIQQltplletlPQSDDLPHSEETVALDNwRQVHEQCLSLHSQLQtlqqqdvLEAQRLQKAQA 752
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1642 ELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLrqaeaERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERT 1721
Cdd:PRK10246   753 QFDTALQASVFDDQQAFLAALLDEETLTQLEQLKQNL-----ENQRQQAQTLVTQTAQALAQHQQHRPDGLDLTVTVEQI 827
                          650
                   ....*....|....*.
gi 1920237946 1722 LKEEHVAVVQLREEAT 1737
Cdd:PRK10246   828 QQELAQLAQQLRENTT 843
CH_PLS_FIM_rpt2 cd21218
second calponin homology (CH) domain found in the plastin/fimbrin family; This family includes ...
192-282 2.65e-03

second calponin homology (CH) domain found in the plastin/fimbrin family; This family includes plastin and fimbrin. Plastin has three isoforms, plastin-1, -2, and -3, which are all actin-bundling proteins. Plastin-1, also called intestine-specific plastin, or I-plastin, is an actin-bundling protein in the absence of calcium. Plastin-2, also called L-plastin, LC64P, or lymphocyte cytosolic protein 1 (LCP-1), plays a role in the activation of T-cells in response to costimulation through TCR/CD3 and CD2 or CD28. It modulates the cell surface expression of IL2RA/CD25 and CD69. Plastin-3, also called T-plastin, is found in intestinal microvilli, hair cell stereocilia, and fibroblast filopodia. It may play a role in the regulation of bone development. Fimbrin has been found in plants and fungi. Arabidopsis thaliana fimbrin (AtFIM) includes fimbrin-1, -2, -3, -4, and -5; they cross-link actin filaments (F-actin) in a calcium independent manner. They stabilize and prevent F-actin depolymerization mediated by profilin. They act as key regulators of actin cytoarchitecture, probably involved in cell cycle, cell division, cell elongation and cytoplasmic tractus. AtFIM5 is an actin bundling factor that is required for pollen germination and pollen tube growth. Fungal fimbrin binds to actin, and functionally associates with actin structures involved in the development and maintenance of cell polarity. Members of this family contain four copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409067  Cd Length: 114  Bit Score: 40.75  E-value: 2.65e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  192 KWVNKHLIKAQR---HISDLYEDLRDGHNLISLLEVLSGDSLPREKGRM---RFHKLQNVQIALDYLRhrQVKLVN-IRN 264
Cdd:cd21218     17 RWVNYHLKKAGPtkkRVTNFSSDLKDGEVYALLLHSLAPELCDKELVLEvlsEEDLEKRAEKVLQAAE--KLGCKYfLTP 94
                           90
                   ....*....|....*...
gi 1920237946  265 DDIADGNPKLTLGLIWTI 282
Cdd:cd21218     95 EDIVSGNPRLNLAFVATL 112
TolC COG1538
Outer membrane protein TolC [Cell wall/membrane/envelope biogenesis];
1444-1705 2.67e-03

Outer membrane protein TolC [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 441147 [Multi-domain]  Cd Length: 367  Bit Score: 43.49  E-value: 2.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1444 DLRTRYSELSTLTSQyIRFISETLRRMEEEERLAEQQRAEERERLAEVEAAlekQRQLAEAHAQaKAQAEREAQGLQRRM 1523
Cdd:COG1538     77 EVAQAYFDLLAAQEQ-LALAEENLALAEELLELARARYEAGLASRLDVLQA---EAQLAQARAQ-LAQAEAQLAQARNAL 151
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1524 QEEVARREEVAVEAQEQKRSIQEELQHLRQSSEA------EIQAKARQVEAAERsRLRIEEEIRVVRLQLEATERQRGGA 1597
Cdd:COG1538    152 ALLLGLPPPAPLDLPDPLPPLPPLPPSLPGLPSEalerrpDLRAAEAQLEAAEA-EIGVARAAFLPSLSLSASYGYSSSD 230
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1598 EGELQ-------------------ALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1658
Cdd:COG1538    231 DLFSGgsdtwsvglslslplfdggRNRARVRAAKAQLEQAEAQYEQTVLQALQEVEDALAALRAAREQLEALEEALEAAE 310
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1659 RALQALEEL-------RLQAEEAERRLRQAEAERarqvqVALETAQRSAEAELQ 1705
Cdd:COG1538    311 EALELARARyraglasLLDVLDAQRELLQAQLNL-----IQARYDYLLALVQLY 359
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
2232-2418 2.67e-03

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 43.79  E-value: 2.67e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2232 QSRRQVEEAERLKQsAEEQAQAQAQAQAAAEKLRKeaeqeaarraqaeqAALRQKQAADAEMEKHKQFAEQALRQKAQVE 2311
Cdd:pfam15709  360 QRRLQQEQLERAEK-MREELELEQQRRFEEIRLRK--------------QRLEEERQRQEEEERKQRLQLQAAQERARQQ 424
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2312 QEltALRLQLEETDHQKsildeelQRLKAEVTEAARQRGQVEEElfslrvQMEELGKLKARIEAENRALVLRDKDSAQRL 2391
Cdd:pfam15709  425 QE--EFRRKLQELQRKK-------QQEEAERAEAEKQRQKELEM------QLAEEQKRLMEMAEEERLEYQRQKQEAEEK 489
                          170       180       190
                   ....*....|....*....|....*....|..
gi 1920237946 2392 LQEEAEKMKQVAEEAARLSVA-----AQEAAR 2418
Cdd:pfam15709  490 ARLEAEERRQKEEEAARLALEeamkqAQEQAR 521
PRK11281 PRK11281
mechanosensitive channel MscK;
1311-1584 2.76e-03

mechanosensitive channel MscK;


Pssm-ID: 236892 [Multi-domain]  Cd Length: 1113  Bit Score: 44.13  E-value: 2.76e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1311 LERWQAVLAQTDVRQRELEQLGRQLryyrESADplgawlRDAKQRQEQIQAVPLANSQAVREQLrqEKALLEDIERhgeK 1390
Cdd:PRK11281    65 LEQTLALLDKIDRQKEETEQLKQQL----AQAP------AKLRQAQAELEALKDDNDEETRETL--STLSLRQLES---R 129
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1391 VEECQRFAKQYINAIKDYELQLVTYKAQLEpvaspakkpKVQSGSESIIQEYVDLRTRYSelSTLTSQyiRFISETLR-R 1469
Cdd:PRK11281   130 LAQTLDQLQNAQNDLAEYNSQLVSLQTQPE---------RAQAALYANSQRLQQIRNLLK--GGKVGG--KALRPSQRvL 196
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1470 MEEEERLAEQQRAEERERLA---EVEAALEKQRQLAEAHAQakaQAEREAQGLQRRM-QEEVARREEVAVEAQEQKRS-- 1543
Cdd:PRK11281   197 LQAEQALLNAQNDLQRKSLEgntQLQDLLQKQRDYLTARIQ---RLEHQLQLLQEAInSKRLTLSEKTVQEAQSQDEAar 273
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1544 ------IQEELQHLRQSSEAEIQAKAR-------------QVEAAERSRLRIEEEIRVVR 1584
Cdd:PRK11281   274 iqanplVAQELEINLQLSQRLLKATEKlntltqqnlrvknWLDRLTQSERNIKEQISVLK 333
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
1192-1497 2.77e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.14  E-value: 2.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1192 ELELTLGKLEQVRSLSAIYLEKLKTISLVIRSTQEAEEVLRAHEEQLKEAQAVPATLPELEATKAALkklraqaEAQQPV 1271
Cdd:COG4913    614 ALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEREIAELEAELERL-------DASSDD 686
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1272 FDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDvrqrELEQLGRQLRYYResadpLGAWLRD 1351
Cdd:COG4913    687 LAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLE----AAEDLARLELRAL-----LEERFAA 757
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1352 AKQRqeqiqavplANSQAVREQLRQE-KALLEDIERHGEKVEEC-QRFAKQYINAIKDYELQLVTYkaqlepvaspakkp 1429
Cdd:COG4913    758 ALGD---------AVERELRENLEERiDALRARLNRAEEELERAmRAFNREWPAETADLDADLESL-------------- 814
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1430 kvqsgsESIIQEYVDLRT----RYSE--LSTLTSQYIRFISETLRRMEEEERLAeqqraeeRERLAEVEAALEK 1497
Cdd:COG4913    815 ------PEYLALLDRLEEdglpEYEErfKELLNENSIEFVADLLSKLRRAIREI-------KERIDPLNDSLKR 875
PRK12704 PRK12704
phosphodiesterase; Provisional
2408-2526 2.77e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 44.00  E-value: 2.77e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2408 RLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEA---TRLKAEAELLQQQKELaQEQARRLQEDKEQMAQQLA 2484
Cdd:PRK12704    25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEihkLRNEFEKELRERRNEL-QKLEKRLLQKEENLDRKLE 103
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2485 ------QETQGFQKTLETERQRQLEMSAEAERLR-------LRVAEMSRAQARAE 2526
Cdd:PRK12704   104 llekreEELEKKEKELEQKQQELEKKEEELEELIeeqlqelERISGLTAEEAKEI 158
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1469-1823 2.79e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 43.35  E-value: 2.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1469 RMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEEL 1548
Cdd:COG4372      3 RLGEKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEEL 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1549 QHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQ 1628
Cdd:COG4372     83 EELNE----QLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQ 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1629 VQDETQRKRQAEAELALRVQAEAEaaREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEH 1708
Cdd:COG4372    159 LESLQEELAALEQELQALSEAEAE--QALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALS 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1709 ASFaEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQAEA 1788
Cdd:COG4372    237 ALL-DALELEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALED 315
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 1920237946 1789 EKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQ 1823
Cdd:COG4372    316 ALLAALLELAKKLELALAILLAELADLLQLLLVGL 350
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
1937-2186 2.82e-03

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 43.82  E-value: 2.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1937 EEAKRQRQLAEEDAVRQRAEAERVLAEKLAA-ISEATRLKTEAEIALKEKEAENerlrrlaedeafqrrlLEEQAAQHKA 2015
Cdd:PRK07735     2 DPEKDLEDLKKEAARRAKEEARKRLVAKHGAeISKLEEENREKEKALPKNDDMT----------------IEEAKRRAAA 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2016 DIEARLAQLRKASESELERQkglvedtlrqrrqVEEEILALKGSFEKAAAGKAElELELGRIRGTAEDTLRSKEQAEQEA 2095
Cdd:PRK07735    66 AAKAKAAALAKQKREGTEEV-------------TEEEKAKAKAKAAAAAKAKAA-ALAKQKREGTEEVTEEEKAAAKAKA 131
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2096 ARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQA 2175
Cdd:PRK07735   132 AAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEEKAKAKAKAAAAA 211
                          250
                   ....*....|.
gi 1920237946 2176 EEKAHAFAVQQ 2186
Cdd:PRK07735   212 KAKAAALAKQK 222
CusB_dom_1 pfam00529
Cation efflux system protein CusB domain 1; The cation efflux system protein CusB from E. coli ...
1552-1705 2.87e-03

Cation efflux system protein CusB domain 1; The cation efflux system protein CusB from E. coli can be divided into four different domains, the first three domains of the protein are mostly beta-strands and the fourth forms an all alpha-helical domain. This entry represents the first beta-domain (domain 1) of CusB and it is formed by the N and C-terminal ends of the polypeptide (residues 89-102 and 324-385). CusB is part of the copper-transporting efflux system CusCFBA. This domain can also be found in other membrane-fusion proteins, such as HlyD, MdtN, MdtE and AaeA. HlyD is a component of the prototypical alpha-haemolysin (HlyA) bacterial type I secretion system, along with the other components HlyB and TolC. HlyD is anchored in the cytoplasmic membrane by a single transmembrane domain and has a large periplasmic domain within the carboxy-terminal 100 amino acids, HlyB and HlyD form a stable complex that binds the recombinant protein bearing a C-terminal HlyA signal sequence and ATP in the cytoplasm. HlyD, HlyB and TolC combine to form the three-component ABC transporter complex that forms a trans-membrane channel or pore through which HlyA can be transferred directly to the extracellular medium. Cutinase has been shown to be transported effectively through this pore.


Pssm-ID: 425733 [Multi-domain]  Cd Length: 322  Bit Score: 43.18  E-value: 2.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1552 RQSSEAEIQAKARQVEA-AERSRLRIE-EEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERlRRQV 1629
Cdd:pfam00529   54 TDYQAALDSAEAQLAKAqAQVARLQAElDRLQALESELAISRQDYDGATAQLRAAQAAVKAAQAQLAQAQIDLAR-RRVL 132
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1630 QDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQ 1705
Cdd:pfam00529  133 APIGGISRESLVTAGALVAQAQANLLATVAQLDQIYVQITQSAAENQAEVRSELSGAQLQIAEAEAELKLAKLDLE 208
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
2061-2543 2.96e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.99  E-value: 2.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2061 EKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERvQKSLAAEEEAARQRKAALEEVERL 2140
Cdd:COG4717     49 ERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEE-LEELEAELEELREELEKLEKLLQL 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2141 KAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRseaeaARRAAEEAE 2220
Cdd:COG4717    128 LPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEE-----LQDLAEELE 202
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2221 AARERAEREAAQSRRQVEEAERLKQsAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFA 2300
Cdd:COG4717    203 ELQQRLAELEEELEEAQEELEELEE-ELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSLILTIAGVLFL 281
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2301 EQAL---------RQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAArqrgqveEELFSLRVQMEELGKLKA 2371
Cdd:COG4717    282 VLGLlallflllaREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSP-------EELLELLDRIEELQELLR 354
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2372 RIE-AENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLkekmqAVQEATR 2450
Cdd:COG4717    355 EAEeLEEELQLEELEQEIAALLAEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELL-----EALDEEE 429
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2451 LKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQ-ETQGFQKTLETERQRQLEMSAEAER--LRLRVAE--MSRAQARA 2525
Cdd:COG4717    430 LEEELEELEEELEELEEELEELREELAELEAELEQlEEDGELAELLQELEELKAELRELAEewAALKLALelLEEAREEY 509
                          490
                   ....*....|....*....
gi 1920237946 2526 EEDAR-RFRKQAEDIGERL 2543
Cdd:COG4717    510 REERLpPVLERASEYFSRL 528
PRK07735 PRK07735
NADH-quinone oxidoreductase subunit C;
1847-2069 2.97e-03

NADH-quinone oxidoreductase subunit C;


Pssm-ID: 236081 [Multi-domain]  Cd Length: 430  Bit Score: 43.43  E-value: 2.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1847 ETEQGEQQRQLLEEELARLQ-----------REAAAATQKRRELEAELAKVRAEMevllASKARAEEESRSTSEKSKQRL 1915
Cdd:PRK07735    11 KKEAARRAKEEARKRLVAKHgaeiskleeenREKEKALPKNDDMTIEEAKRRAAA----AAKAKAAALAKQKREGTEEVT 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1916 EAEAGRFRELAEEAARLRAlAEEAKRQRQLAEEDAVRQRAEAERVLAE------------KLAAISEATRLKTEAEIAL- 1982
Cdd:PRK07735    87 EEEKAKAKAKAAAAAKAKA-AALAKQKREGTEEVTEEEKAAAKAKAAAaakakaaalakqKREGTEEVTEEEEETDKEKa 165
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1983 --KEKEAENERLRRLAEDEAFQRRLLEEQAAQH-KADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEE-ILALKG 2058
Cdd:PRK07735   166 kaKAAAAAKAKAAALAKQKAAEAGEGTEEVTEEeKAKAKAKAAAAAKAKAAALAKQKASQGNGDSGDEDAKAKaIAAAKA 245
                          250
                   ....*....|.
gi 1920237946 2059 SFEKAAAGKAE 2069
Cdd:PRK07735   246 KAAAAARAKTK 256
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
2361-2656 3.02e-03

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 43.74  E-value: 3.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2361 VQMEE---LGKLKARIEAEN-RALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLA-EEDLAQQRALAE 2435
Cdd:pfam15964  341 VQMTEeanFEKTKALIQCEQlKSELERQKERLEKELASQQEKRAQEKEALRKEMKKEREELGATMLAlSQNVAQLEAQVE 420
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2436 KMLKEKMQAVQEATrlKAEAELLQQQKELAQEQAR-RLQEDKEQMAQQLAQETQgfqKTLETERQRQLEMS-AEAERLRL 2513
Cdd:pfam15964  421 KVTREKNSLVSQLE--EAQKQLASQEMDVTKVCGEmRYQLNQTKMKKDEAEKEH---REYRTKTGRQLEIKdQEIEKLGL 495
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2514 RVAEMSRAQARAEEDARRFRKQAEDIGERLYRTE----LATQEKVMLVQTL----ETQRQQSDRDAERLREAIAELE--H 2583
Cdd:pfam15964  496 ELSESKQRLEQAQQDAARAREECLKLTELLGESEhqlhLTRLEKESIQQSFsneaKAQALQAQQREQELTQKMQQMEaqH 575
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2584 EKD----------------KLKQEAQLLQLKSEEM-QTVRQE--QLLQETQALQQSFLSEK-------DSLLQRERCIEQ 2637
Cdd:pfam15964  576 DKTvneqyslltsqntfiaKLKEECCTLAKKLEEItQKSRSEveQLSQEKEYLQDRLEKLQkrneeleEQCVQHGRMHER 655
                          330
                   ....*....|....*....
gi 1920237946 2638 EKAKLEQLFQDEVAKAQAL 2656
Cdd:pfam15964  656 MKQRLRQLDKHCQATAQQL 674
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
2311-2741 3.04e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 44.04  E-value: 3.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2311 EQELTALRLQLEETDHQKSILDEELQRLKAEVTeAARQRGQV-EEELFSLRVQMEElgklKARIEAEnRALVLRDKDSAQ 2389
Cdd:pfam10174  302 ESELLALQTKLETLTNQNSDCKQHIEVLKESLT-AKEQRAAIlQTEVDALRLRLEE----KESFLNK-KTKQLQDLTEEK 375
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2390 RLLQEEAEKMKQVAEEAARLSVAAQEaaRLRQLAEEDLAQQRALAEkmLKEKMQAVQEATRLKAEAelLQQQKELAQEQA 2469
Cdd:pfam10174  376 STLAGEIRDLKDMLDVKERKINVLQK--KIENLQEQLRDKDKQLAG--LKERVKSLQTDSSNTDTA--LTTLEEALSEKE 449
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2470 R---RLQEDKEQMAQQLAQEtqgfqktLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRT 2546
Cdd:pfam10174  450 RiieRLKEQREREDRERLEE-------LESLKKENKDLKEKVSALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSL 522
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2547 ELATQEKVMLVQTLETQRQ--QSDRDAERLREAIAelehekDKLKQEAQLLQLKSEEMQTVRqeqllqetqalqqsflSE 2624
Cdd:pfam10174  523 EIAVEQKKEECSKLENQLKkaHNAEEAVRTNPEIN------DRIRLLEQEVARYKEESGKAQ----------------AE 580
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2625 KDSLLQRERCIEQEK-------AKLEQLFQDEVaKAQALREEQQRQQQQMQQEKQqlAASMEEARRRQHEAEEGVRRQQ- 2696
Cdd:pfam10174  581 VERLLGILREVENEKndkdkkiAELESLTLRQM-KEQNKKVANIKHGQQEMKKKG--AQLLEEARRREDNLADNSQQLQl 657
                          410       420       430       440       450
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2697 ----EELQRLAQQQQQQEKLLAEENQRLRERLQHLEE---ERRAALarsEEI 2741
Cdd:pfam10174  658 eelmGALEKTRQELDATKARLSSTQQSLAEKDGHLTNlraERRKQL---EEI 706
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
2235-2616 3.05e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 43.73  E-value: 3.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2235 RQVEEAERlkqsaeeqaqaqaqaqaaaeKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQFAeQALRQKAQVEQEL 2314
Cdd:pfam07888   73 RQRRELES--------------------RVAELKEELRQSREKHEELEEKYKELSASSEELSEEKD-ALLAQRAAHEARI 131
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2315 TALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRAL--VLRDKDSAQRLL 2392
Cdd:pfam07888  132 RELEEDIKTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKEFQELrnSLAQRDTQVLQL 211
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2393 QEEAEKMKQVAEEAARlSVAAQEAAR--LRQLAEEDLAQQRALAekMLKEKMQAVQeATRLKAEAELLQQQKELAQ---- 2466
Cdd:pfam07888  212 QDTITTLTQKLTTAHR-KEAENEALLeeLRSLQERLNASERKVE--GLGEELSSMA-AQRDRTQAELHQARLQAAQltlq 287
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2467 --EQARRLQEDKEQMaqqlAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEmsraqaraeedARRFRKQAEdigerly 2544
Cdd:pfam07888  288 laDASLALREGRARW----AQERETLQQSAEADKDRIEKLSAELQRLEERLQE-----------ERMEREKLE------- 345
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2545 rTELATQEKVMLVQTLETQRQQSDrdaerLREAIAELEHEKDKLKQEAQllqlksEEMQTVRQEQLLQETQA 2616
Cdd:pfam07888  346 -VELGREKDCNRVQLSESRRELQE-----LKASLRVAQKEKEQLQAEKQ------ELLEYIRQLEQRLETVA 405
GBP_C pfam02841
Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral ...
2397-2491 3.06e-03

Guanylate-binding protein, C-terminal domain; Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.


Pssm-ID: 460721 [Multi-domain]  Cd Length: 297  Bit Score: 43.04  E-value: 3.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2397 EKMKQVAEEAARLSVAAQEAARLRQLAEEDL----AQQRALAE--KMLKEKMQAVQEatRLKAEAELLQQQKElaQEQAR 2470
Cdd:pfam02841  201 AKEKAIEAERAKAEAAEAEQELLREKQKEEEqmmeAQERSYQEhvKQLIEKMEAERE--QLLAEQERMLEHKL--QEQEE 276
                           90       100
                   ....*....|....*....|.
gi 1920237946 2471 RLQEDKEQMAQQLAQETQGFQ 2491
Cdd:pfam02841  277 LLKEGFKTEAESLQKEIQDLK 297
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1549-2038 3.09e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 43.49  E-value: 3.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1549 QHLRQSSEAEIQAKARqVEAAERSRLRIEEEIRVVRLQLEATERQRGGA--EGELQALRARAEEAEAQKRQAQEEAERLR 1626
Cdd:COG3064      2 QEALEEKAAEAAAQER-LEQAEAEKRAAAEAEQKAKEEAEEERLAELEAkrQAEEEAREAKAEAEQRAAELAAEAAKKLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1627 RQVQDETQRKRQAEAELAlRVQAEAEAAREKQRALQALEELRLQAEEaerrlRQAEAERARQVQVALETAQRSAEAELQS 1706
Cdd:COG3064     81 EAEKAAAEAEKKAAAEKA-KAAKEAEAAAAAEKAAAAAEKEKAEEAK-----RKAEEEAKRKAEEERKAAEAEAAAKAEA 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1707 EHASFAEKTAQLERTLKEEhVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQA 1786
Cdd:COG3064    155 EAARAAAAAAAAAAAAAAR-AAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAA 233
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1787 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQ 1866
Cdd:COG3064    234 LAAVEATEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAA 313
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1867 REAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLA 1946
Cdd:COG3064    314 EEAVLAAAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILA 393
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1947 EEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRK 2026
Cdd:COG3064    394 AAGGGGLLGLRLDLGAALLEAASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDG 473
                          490
                   ....*....|..
gi 1920237946 2027 ASESELERQKGL 2038
Cdd:COG3064    474 GAVLADLLLLGG 485
RecN COG0497
DNA repair ATPase RecN [Replication, recombination and repair];
1600-1734 3.17e-03

DNA repair ATPase RecN [Replication, recombination and repair];


Pssm-ID: 440263 [Multi-domain]  Cd Length: 555  Bit Score: 43.53  E-value: 3.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1600 ELQALRARAEEAEAQKRQAQEEAERLRRQVQdetqrkrqaE-AELALRVQAEAEAAREKQRaLQALEELRLQAEEAERRL 1678
Cdd:COG0497    166 AWRALKKELEELRADEAERARELDLLRFQLE---------ElEAAALQPGEEEELEEERRR-LSNAEKLREALQEALEAL 235
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1679 RQAEAerarQVQVALETAQRSAEaELQSEHASFAEKTAQLERtlkeehvAVVQLRE 1734
Cdd:COG0497    236 SGGEG----GALDLLGQALRALE-RLAEYDPSLAELAERLES-------ALIELEE 279
PLEC smart00250
Plectin repeat;
2855-2888 3.30e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 37.85  E-value: 3.30e-03
                            10        20        30
                    ....*....|....*....|....*....|....
gi 1920237946  2855 LLEAQAASGFLLDPVRNRRLAVNEAVKEGIVGPE 2888
Cdd:smart00250    3 LLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPE 36
PLEC smart00250
Plectin repeat;
3886-3917 3.30e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 37.85  E-value: 3.30e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1920237946  3886 RLLSAERAVTGYRDPYTEQTISLFQAMKKDLI 3917
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLI 33
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
1832-2094 3.33e-03

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 42.98  E-value: 3.33e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1832 QQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRstseKS 1911
Cdd:COG1340      1 SKTDELSSSLEELEEKIEELREEIEELKEKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKVK----EL 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1912 KQRLEAEAGRFRELAEEAARLRALAEEAKRQRQlaEEDAVRQR--------------AEAERVLAEKLAaiseatRLKTE 1977
Cdd:COG1340     77 KEERDELNEKLNELREELDELRKELAELNKAGG--SIDKLRKEierlewrqqtevlsPEEEKELVEKIK------ELEKE 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1978 AEIALKEKEAENERLRRLAEDEAFQrrlleEQAAQHKADIEARLAQLRKASES------ELERQKGLVEDTLRQRRQVEE 2051
Cdd:COG1340    149 LEKAKKALEKNEKLKELRAELKELR-----KEAEEIHKKIKELAEEAQELHEEmielykEADELRKEADELHKEIVEAQE 223
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 2052 EILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQE 2094
Cdd:COG1340    224 KADELHEEIIELQKELRELRKELKKLRKKQRALKREKEKEELE 266
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
2010-2186 3.39e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 43.21  E-value: 3.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2010 AAQHKADIEARLAQLRKasesELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTLRSKE 2089
Cdd:COG4942     18 QADAAAEAEAELEQLQQ----EIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIA 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2090 QAEQEAARQRQLAAEEERRRREAEER--------------VQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRErAE 2155
Cdd:COG4942     94 ELRAELEAQKEELAELLRALYRLGRQpplalllspedfldAVRRLQYLKYLAPARREQAEELRADLAELAALRAELE-AE 172
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1920237946 2156 QESARQLQLAQEAAQKRLQAEEKAHAFAVQQ 2186
Cdd:COG4942    173 RAELEALLAELEEERAALEALKAERQKLLAR 203
PHA03247 PHA03247
large tegument protein UL36; Provisional
1489-1696 3.47e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 3.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1489 AEVEAALEKQRQLAEAHAQAKAQAEREAQGLQ--RRMQEEVARREEVAVEAQEQKRSIQE-----------ELQHLRQSS 1555
Cdd:PHA03247  1150 STVDAAVRAHGVLADAVAALSPAVRDPACPLAflVALADSAAGYVKATRLALDARRAIARlgalgaaaadlAVAVRRENP 1229
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1556 EAE------IQAKARQVEAAERSRLRIEEEIRVVrLQLEATERQRGGAEGELQAL-------RARAEEAEAQkrqAQEEA 1622
Cdd:PHA03247  1230 QAEgdraalLEAAARAVTAAREGLAACEGEFGGL-LHAEGSAGDPSPSGRALQELgkvvgatRRRADELEAA---AADLA 1305
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1623 ERLRRQVQDETQRKRQAEAELAL-RVQAEAEAAREKQRALQALE-ELRLQAEEAERRLRQAEAERARQVQVALETA 1696
Cdd:PHA03247  1306 EKMAARRARASRERWAADVEAALdRVENRAEFDAVELRRLQALAaTHGYNPRDFRKRAEQALAANAKTATLALEAA 1381
PTZ00491 PTZ00491
major vault protein; Provisional
2388-2521 3.48e-03

major vault protein; Provisional


Pssm-ID: 240439 [Multi-domain]  Cd Length: 850  Bit Score: 43.85  E-value: 3.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2388 AQRLLQE-----EAEKMK-QVAEEAAR---LSVAAQEAARlrQLAEEDLAQQRALAEKMLKEkMQAVQEATRLKAEAELL 2458
Cdd:PTZ00491   672 AELLEQEargrlERQKMHdKAKAEEQRtklLELQAESAAV--ESSGQSRAEALAEAEARLIE-AEAEVEQAELRAKALRI 748
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2459 QQQKELAQEQARRLQEdkeqmaqqLAQETQgfQKTLETERQRQLeMSAEAERL--------RLRVAEMSRA 2521
Cdd:PTZ00491   749 EAEAELEKLRKRQELE--------LEYEQA--QNELEIAKAKEL-ADIEATKFerivealgRETLIAIARA 808
Borrelia_P83 pfam05262
Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.
1469-1656 3.48e-03

Borrelia P83/100 protein; This family consists of several Borrelia P83/P100 antigen proteins.


Pssm-ID: 114011 [Multi-domain]  Cd Length: 489  Bit Score: 43.45  E-value: 3.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1469 RMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQglqrrmQEEVARREEVAVEAQEQKRSIQEEL 1548
Cdd:pfam05262  206 RESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQ------QEAKNLPKPADTSSPKEDKQVAENQ 279
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1549 QHLRQSSEAEIQAKARQVeaaersrlrieeeirvvrlqLEATERQRGGAEGElqalrARAEEAEAQKRqaQEEAERLRRQ 1628
Cdd:pfam05262  280 KREIEKAQIEIKKNDEEA--------------------LKAKDHKAFDLKQE-----SKASEKEAEDK--ELEAQKKREP 332
                          170       180
                   ....*....|....*....|....*....
gi 1920237946 1629 VQDETQR-KRQAEAElalrVQAEAEAARE 1656
Cdd:pfam05262  333 VAEDLQKtKPQVEAQ----PTSLNEDAID 357
PHA03247 PHA03247
large tegument protein UL36; Provisional
1477-1730 3.50e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.16  E-value: 3.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1477 AEQQRAEERERlaeVEAALEkqrqlaEAHAQAKAQAEREAQGL---------------------QRRMQEEVARREEVAV 1535
Cdd:PHA03247  1586 AKQQRAEATDR---VTAALR------EALAAHERRAQSEAESLanlktllrvaaipataaktldQARSVAEIVDQIELLL 1656
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1536 EAQEQKRSIQEE----LQHLRQSSEAEIQAKARQVEAAERSRLRieeeirvVRLQLEATERQRggaegeLQALRARAEEA 1611
Cdd:PHA03247  1657 EQTEKAAELDVAavdwLEHARRVFEAHPLTAARGGGPDPLARLH-------ARLDALGETRRR------TEALRRSLEAA 1723
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1612 EAQKRQAQEEAERLRRQvqdetqrkrqaeaelALRVQAEAEAAREKQRALQALEE--LRLQAEEAERRLrqaeAERARQV 1689
Cdd:PHA03247  1724 EAEWDEVWGRFGRVRGG---------------AWKSPEALRAAREQLRALQTATNtvLGLRADAHYERL----PAKYQGA 1784
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 1920237946 1690 qVALETAQRSAEAElqsEHASFAEKTAQLERTLKEEHVAVV 1730
Cdd:PHA03247  1785 -LGAKSAERAGAVE---ELGAAVARHDGLLARLREEVVARV 1821
KpsE COG3524
Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];
1501-1688 3.55e-03

Capsule polysaccharide export protein KpsE/RkpR [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442746 [Multi-domain]  Cd Length: 370  Bit Score: 43.30  E-value: 3.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1501 LAEAHAQAKAQAEREAQGLQRRMQEEVARreevaveAQEQKRSIQEELQHLRQ-----SSEAEIQAKARQVeaaersrlr 1575
Cdd:COG3524    160 LAESEELVNQLSERAREDAVRFAEEEVER-------AEERLRDAREALLAFRNrngilDPEATAEALLQLI--------- 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1576 ieeeirvvrLQLEAterQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAL-RVQAEaeaa 1654
Cdd:COG3524    224 ---------ATLEG---QLAELEAELAALRSYLSPNSPQVRQLRRRIAALEKQIAAERARLTGASGGDSLaSLLAE---- 287
                          170       180       190
                   ....*....|....*....|....*....|....*.
gi 1920237946 1655 rekqralqaLEELRLQAEEAERRLRQAEA--ERARQ 1688
Cdd:COG3524    288 ---------YERLELEREFAEKAYTSALAalEQARI 314
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
2338-2632 3.61e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 43.35  E-value: 3.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2338 LKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRAlvlrdkdsAQRLLQEEAEKMKQVAEEAARLSVAAQEAA 2417
Cdd:COG4372     29 LSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQ--------ARSELEQLEEELEELNEQLQAAQAELAQAQ 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2418 RLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETE 2497
Cdd:COG4372    101 EELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAE 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2498 RQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREA 2577
Cdd:COG4372    181 AEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEI 260
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2578 IAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRE 2632
Cdd:COG4372    261 EELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALED 315
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
1924-2065 3.69e-03

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 43.32  E-value: 3.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1924 ELAEEAARLRALAE-EAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQ 2002
Cdd:COG2268    196 EIIRDARIAEAEAErETEIAIAQANREAEEAELEQEREIETARIAEAEAELAKKKAEERREAETARAEAEAAYEIAEANA 275
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 2003 RRLLEEQAAQHKADIEARLAQLRKA-SESELERQKGLVEDTLRQRRQVEEEiLALKGSFEKAAA 2065
Cdd:COG2268    276 EREVQRQLEIAEREREIELQEKEAErEEAELEADVRKPAEAEKQAAEAEAE-AEAEAIRAKGLA 338
PRK12705 PRK12705
hypothetical protein; Provisional
1595-1735 4.03e-03

hypothetical protein; Provisional


Pssm-ID: 237178 [Multi-domain]  Cd Length: 508  Bit Score: 43.16  E-value: 4.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1595 GGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQrkrqaEAELALRVQAEAEAAREKQralqaleelRLQAEEa 1674
Cdd:PRK12705    19 GVLVVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAK-----ELLLRERNQQRQEARRERE---------ELQREE- 83
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 1675 ERRLRQAEAERARQVQVALETAQRS-AEAELQSEHASFAEKTAQLERTLKEehvaVVQLREE 1735
Cdd:PRK12705    84 ERLVQKEEQLDARAEKLDNLENQLEeREKALSARELELEELEKQLDNELYR----VAGLTPE 141
ClpA COG0542
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein ...
1567-1684 4.05e-03

ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 440308 [Multi-domain]  Cd Length: 836  Bit Score: 43.53  E-value: 4.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1567 EAAerSRLRIE-----EEIRVVRLQLEATERqrggaegELQALRaraeeaEAQKRQAQEEAERLRRQVQDETQRKRQAEA 1641
Cdd:COG0542    397 EAA--ARVRMEidskpEELDELERRLEQLEI-------EKEALK------KEQDEASFERLAELRDELAELEEELEALKA 461
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 1642 elalRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAE 1684
Cdd:COG0542    462 ----RWEAEKELIEEIQELKEELEQRYGKIPELEKELAELEEE 500
PRK09039 PRK09039
peptidoglycan -binding protein;
2364-2492 4.08e-03

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 43.03  E-value: 4.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2364 EELGKLKARIEAENRALVLRDKDSAQrlLQEeaekmkQVAEEAARLSVAAQEAARLRQLAEEdLAQQRALAEKMLKEKMQ 2443
Cdd:PRK09039    53 SALDRLNSQIAELADLLSLERQGNQD--LQD------SVANLRASLSAAEAERSRLQALLAE-LAGAGAAAEGRAGELAQ 123
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1920237946 2444 AVQEATRLKAEA----ELLQQQKELAQEQARRLQ--------EDKEQMAQ----------QLAQETQGFQK 2492
Cdd:PRK09039   124 ELDSEKQVSARAlaqvELLNQQIAALRRQLAALEaaldasekRDRESQAKiadlgrrlnvALAQRVQELNR 194
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
1805-1905 4.13e-03

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 43.40  E-value: 4.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1805 AEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELA 1884
Cdd:PRK11448   143 LLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQERKQKRKEIT 222
                           90       100
                   ....*....|....*....|.
gi 1920237946 1885 KvRAEMEVLLaskarAEEESR 1905
Cdd:PRK11448   223 D-QAAKRLEL-----SEEETR 237
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1812-1905 4.14e-03

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 40.29  E-value: 4.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1812 QRELAEQELEKQRQLAEGTAQQRLAAEQELIRLraeteqgEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEME 1891
Cdd:pfam20492   21 ETKKAQEELEESEETAEELEEERRQAEEEAERL-------EQKRQEAEEEKERLEESAEMEAEEKEQLEAELAEAQEEIA 93
                           90
                   ....*....|....
gi 1920237946 1892 VLLASKARAEEESR 1905
Cdd:pfam20492   94 RLEEEVERKEEEAR 107
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
1623-2605 4.29e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 43.50  E-value: 4.29e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1623 ERLRRQVQDETQRKRQAEAELALRVQAEAEAA--REKQRALQALEEL------RLQAEEAERRLRQAEAERARQVQVALE 1694
Cdd:TIGR00606  189 ETLRQVRQTQGQKVQEHQMELKYLKQYKEKACeiRDQITSKEAQLESsreivkSYENELDPLKNRLKEIEHNLSKIMKLD 268
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1695 ---TAQRSAEAELQSEHASFAEKTAQL----ERTLKE-EHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1766
Cdd:TIGR00606  269 neiKALKSRKKQMEKDNSELELKMEKVfqgtDEQLNDlYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVE 348
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1767 ALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQrelaEQELEKQRQLAEG-TAQQRLAAEQELIRLR 1845
Cdd:TIGR00606  349 QGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKN----FHTLVIERQEDEAkTAAQLCADLQSKERLK 424
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1846 aeteqgEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKAR--AEEESRSTSEKSKQRLEAEAGRFR 1923
Cdd:TIGR00606  425 ------QEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRilELDQELRKAERELSKAEKNSLTET 498
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1924 ELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLaaiseaTRLKTEA-EIALKEKEAENERLRRLAEDEAFQ 2002
Cdd:TIGR00606  499 LKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEML------TKDKMDKdEQIRKIKSRHSDELTSLLGYFPNK 572
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2003 RRLLE--EQAAQHKADIEARLAQLRKasesELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAElELELGRIRGT 2080
Cdd:TIGR00606  573 KQLEDwlHSKSKEINQTRDRLAKLNK----ELASLEQNKNHINNELESKEEQLSSYEDKLFDVCGSQDE-ESDLERLKEE 647
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2081 AEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEARRLRERAEQESAR 2160
Cdd:TIGR00606  648 IEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDE 727
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2161 QLQLAqEAAQKRLQAEEKAhafavqqkeqelqqtlqqeqsvLERLRSEAEAARRAAEEAEAARERAEREAAQSRRQVEEA 2240
Cdd:TIGR00606  728 MLGLA-PGRQSIIDLKEKE----------------------IPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESA 784
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2241 ERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQaADAEMEKHKQFAEQALRQKAQVEQEltalrlq 2320
Cdd:TIGR00606  785 KVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEK-QEKQHELDTVVSKIELNRKLIQDQQ------- 856
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2321 lEETDHQKSILDeELQRLKAEVTEAARQRGQVEEELFSLRVQMEELgklkARIEAENRALVLRDKDSAQRLLQEEAEKMK 2400
Cdd:TIGR00606  857 -EQIQHLKSKTN-ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSL----IREIKDAKEQDSPLETFLEKDQQEKEELIS 930
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2401 QVAEEAARLSVAAQE-AARLRQLAEEDLAQQRALAEKMLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQM 2479
Cdd:TIGR00606  931 SKETSNKKAQDKVNDiKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRQDIDTQ 1010
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2480 AQQLAQETQGFQKTLETERQRQLEMSAE---AERLRLRVAEMSRAQARAEEDARRFRKQaedigERLYRTELATQEKVML 2556
Cdd:TIGR00606 1011 KIQERWLQDNLTLRKRENELKEVEEELKqhlKEMGQMQVLQMKQEHQKLEENIDLIKRN-----HVLALGRQKGYEKEIK 1085
                          970       980       990      1000      1010
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2557 VQTLETQRQQSDRDAERLREAIAELEHEKDKLKQ--------EAQLLQLKSEEMQTV 2605
Cdd:TIGR00606 1086 HFKKELREPQFRDAEEKYREMMIVMRTTELVNKDldiyyktlDQAIMKFHSMKMEEI 1142
CH_PARVA_rpt2 cd21337
second calponin homology (CH) domain found in alpha-parvin; Alpha-parvin, also called ...
179-287 4.32e-03

second calponin homology (CH) domain found in alpha-parvin; Alpha-parvin, also called actopaxin, calponin-like integrin-linked kinase-binding protein (CH-ILKBP), or matrix-remodeling-associated protein 2, plays a role in sarcomere organization and in smooth muscle cell contraction. It is required for normal development of the embryonic cardiovascular system, and for normal septation of the heart outflow tract. It is also involved in the reorganization of the actin cytoskeleton, the formation of lamellipodia and ciliogenesis, as well as in the establishement of cell polarity, cell adhesion, cell spreading, and directed cell migration. Alpha-parvin contains two copies of the CH domain. This model corresponds to the second CH domain. CH domains are actin filament (F-actin) binding motifs.


Pssm-ID: 409186  Cd Length: 129  Bit Score: 40.36  E-value: 4.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  179 ADERDRVQKKTFTKWVNKHLIKAQRHISDLYEDLRDGHNLISLLEVLSGDSLPREKGRMR----FHKLQNVQIALDYLRH 254
Cdd:cd21337     14 APDKLNVVKKTLITFVNKHLNKLNLEVTELETQFADGVYLVLLMGLLEGYFVPLHSFFLTpdsfEQKVLNVSFAFELMQD 93
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1920237946  255 RQVKLVNIRNDDIADGNPKLTLGLIWTIILHFQ 287
Cdd:cd21337     94 GGLEKPKPRPEDIVNCDLKSTLRVLYNLFTKYR 126
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1446-1779 4.41e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.97  E-value: 4.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1446 RTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQE 1525
Cdd:COG4372     12 RLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1526 EVARREEvaveAQEQKRSIQEELQHLRQsseaEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALR 1605
Cdd:COG4372     92 AQAELAQ----AQEELESLQEEAEELQE----ELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQ 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1606 ARAEEAEAQ-----KRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQ 1680
Cdd:COG4372    164 EELAALEQElqalsEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALE 243
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1681 AEAERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERW 1760
Cdd:COG4372    244 LEEDKEELLEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLE 323
                          330
                   ....*....|....*....
gi 1920237946 1761 QLKANEALRLRLQAEEVAQ 1779
Cdd:COG4372    324 LAKKLELALAILLAELADL 342
PspA_IM30 pfam04012
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent ...
1462-1678 4.44e-03

PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homolog in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma(54) containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator.


Pssm-ID: 461130 [Multi-domain]  Cd Length: 215  Bit Score: 41.97  E-value: 4.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1462 FISETLRRMEEEERLAEQQRAEERERLAEVEaalekqRQLAEAHAQAKaQAEREAqglqRRMQEEVARREEVAVEAQEQK 1541
Cdd:pfam04012   12 NIHEGLDKAEDPEKMLEQAIRDMQSELVKAR------QALAQTIARQK-QLERRL----EQQTEQAKKLEEKAQAALTKG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1542 RsiqeelQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEE 1621
Cdd:pfam04012   81 N------EELAREALAEKKSLEKQAEALETQLAQQRSAVEQLRKQLAALETKIQQLKAKKNLLKARLKAAKAQEAVQTSL 154
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 1622 AERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQaLEELRLQAEEAERRL 1678
Cdd:pfam04012  155 GSLSTSSATDSFERIEEKIEEREARADAAAELASAVDLDAK-LEQAGIQMEVSEDVL 210
SAC6 COG5069
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];
188-400 4.48e-03

Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton];


Pssm-ID: 227401 [Multi-domain]  Cd Length: 612  Bit Score: 43.39  E-value: 4.48e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  188 KTFTKWVNKHLIKAQrhISDLYEDLRDGHNLISLLEVLSGD---SLPREKGR-------MRFHKLQNVQIALDYLRHRQV 257
Cdd:COG5069    382 RVFTFWLNSLDVSPE--ITNLFGDLRDQLILLQALSKKLMPmtvTHKLVKKQpasgieeNRFKAFENENYAVDLGITEGF 459
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  258 KLVNIRNDDIADGNpKLTLGLIW-------TIILHFQISDiqvsgqsEDMTAKEKLLLWSQRMV------EGCQGLRCDN 324
Cdd:COG5069    460 SLVGIKGLEILDGI-RLKLTLVWqvlrsntALFNHVLKKD-------GCGLSDSDLCAWLGSLGlkgdkeEGIRSFGDPA 531
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946  325 FTTSWRDGRLFNAIIHrhkPTLIDMNKVyRQTNLENLDQA-------FSVAERDLGVTRLLDPEDVDVPQPdEKSIITYV 397
Cdd:COG5069    532 GSVSGVFYLDVLKGIH---SELVDYDLV-TRGFTEFDDIAdarslaiSSKILRSLGAIIKFLPEDINGVRP-RLDVLTFI 606

                   ...
gi 1920237946  398 SSL 400
Cdd:COG5069    607 ESL 609
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1502-1690 4.50e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.35  E-value: 4.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1502 AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEir 1581
Cdd:PRK12678    67 AATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARK-- 144
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1582 vvRLQLEATERQRGGAEGELQALRARAEEAEA--QKRQAQEEAERLRRQVQDETQRKRQaEAELALRVQAEAEAAREKQR 1659
Cdd:PRK12678   145 --AGEGGEQPATEARADAAERTEEEERDERRRrgDREDRQAEAERGERGRREERGRDGD-DRDRRDRREQGDRREERGRR 221
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1920237946 1660 ALQALEELRLQAEEAERRLRQAEAERARQVQ 1690
Cdd:PRK12678   222 DGGDRRGRRRRRDRRDARGDDNREDRGDRDG 252
MAP7 pfam05672
MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is ...
1465-1578 4.56e-03

MAP7 (E-MAP-115) family; The organization of microtubules varies with the cell type and is presumably controlled by tissue-specific microtubule-associated proteins (MAPs). The 115-kDa epithelial MAP (E-MAP-115/MAP7) has been identified as a microtubule-stabilising protein predominantly expressed in cell lines of epithelial origin. The binding of this microtubule associated protein is nucleotide independent.


Pssm-ID: 461709 [Multi-domain]  Cd Length: 153  Bit Score: 40.79  E-value: 4.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRA-EERERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGL-QRRMQEEVARREEVAVEAQEQKR 1542
Cdd:pfam05672   11 EAARILAEKRRQAREQRErEEQERLEKEEEERLRKEELRRRAEEERARREEEARRLeEERRREEEERQRKAEEEAEEREQ 90
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1920237946 1543 SIQEELQHLRQSSEAeiqAKARQVEAAERSRLRIEE 1578
Cdd:pfam05672   91 REQEEQERLQKQKEE---AEAKAREEAERQRQEREK 123
PRK12704 PRK12704
phosphodiesterase; Provisional
1597-1706 4.58e-03

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 43.23  E-value: 4.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1597 AEGELQALRARAE-EAEAQKR----QAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALE-ELRLQ 1670
Cdd:PRK12704    36 AEEEAKRILEEAKkEAEAIKKeallEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREeELEKK 115
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1920237946 1671 AEEAERRLRQAEAERARqvqvaLETAQRSAEAELQS 1706
Cdd:PRK12704   116 EKELEQKQQELEKKEEE-----LEELIEEQLQELER 146
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1613-2085 4.82e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 43.10  E-value: 4.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1613 AQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQR--ALQALEELRLQAEEAERRLRQAEAERARQvQ 1690
Cdd:COG3064      1 AQEALEEKAAEAAAQERLEQAEAEKRAAAEAEQKAKEEAEEERLAELeaKRQAEEEAREAKAEAEQRAAELAAEAAKK-L 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1691 VALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVA----VVQLREEATRRAQQQAEAERARAEAERELERWQLKANE 1766
Cdd:COG3064     80 AEAEKAAAEAEKKAAAEKAKAAKEAEAAAAAEKAAAAAekekAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARA 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1767 ALRLRLQAEEVAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRA 1846
Cdd:COG3064    160 AAAAAAAAAAAAARAAAGAAAALVAAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEA 239
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1847 ETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELA 1926
Cdd:COG3064    240 TEEAALGGAEEAADLAAVGVLGAALAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLA 319
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1927 EEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAEDEAFQRRLL 2006
Cdd:COG3064    320 AAAAAGALVVRGGGAASLEAALSLLAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGG 399
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1920237946 2007 EEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTL 2085
Cdd:COG3064    400 LLGLRLDLGAALLEAASAVELRVLLALAGAAGAVVALLVKLVADLAGGLVGIGKALTGDADALLGILKAVALDGGAVLA 478
Apolipoprotein pfam01442
Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a ...
1486-1657 4.99e-03

Apolipoprotein A1/A4/E domain; These proteins contain several 22 residue repeats which form a pair of alpha helices. This family includes: Apolipoprotein A-I. Apolipoprotein A-IV. Apolipoprotein E.


Pssm-ID: 460211 [Multi-domain]  Cd Length: 175  Bit Score: 41.10  E-value: 4.99e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1486 ERLAEVEAALEK-QRQLAEAHAQAKAQAEREAQGLQRRMQEEV-ARREEVAVEAQEQKRSIQEELQHLRQsseaeiQAKA 1563
Cdd:pfam01442    4 DSLDELSTYAEElQEQLGPVAQELVDRLEKETEALRERLQKDLeEVRAKLEPYLEELQAKLGQNVEELRQ------RLEP 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1564 RQVEAAERSRLRIEEEIRVVRlqlEATERQRGGAEGELQALRARAEE-AEAQKRQAQEEAERLRRQVQDETQ----RKRQ 1638
Cdd:pfam01442   78 YTEELRKRLNADAEELQEKLA---PYGEELRERLEQNVDALRARLAPyAEELRQKLAERLEELKESLAPYAEevqaQLSQ 154
                          170
                   ....*....|....*....
gi 1920237946 1639 AEAELALRVQAEAEAAREK 1657
Cdd:pfam01442  155 RLQELREKLEPQAEDLREK 173
PLN02939 PLN02939
transferase, transferring glycosyl groups
2300-2553 5.15e-03

transferase, transferring glycosyl groups


Pssm-ID: 215507 [Multi-domain]  Cd Length: 977  Bit Score: 43.35  E-value: 5.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2300 AEQALRQKAQVEQELTALRLQLEETDHQK----------SILDEELQRLKAEVTEAARQRGQVEEELF-SLRVQMEELGK 2368
Cdd:PLN02939   158 LEKILTEKEALQGKINILEMRLSETDARIklaaqekihvEILEEQLEKLRNELLIRGATEGLCVHSLSkELDVLKEENML 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2369 LKARIEAEnRALVLRDKDSAQRLLQEEAEKM---KQVAEEAARLSVAAQEAARLRQLAEED------------------- 2426
Cdd:PLN02939   238 LKDDIQFL-KAELIEVAETEERVFKLEKERSlldASLRELESKFIVAQEDVSKLSPLQYDCwwekvenlqdlldratnqv 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2427 ------LAQQRALAEK--MLKEKMQAVQEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTL---- 2494
Cdd:PLN02939   317 ekaalvLDQNQDLRDKvdKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSYIQLYQESIKEFQDTLsklk 396
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1920237946 2495 ETERQRQLEMSAEA------ERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEK 2553
Cdd:PLN02939   397 EESKKRSLEHPADDmpsefwSRILLLIDGWLLEKKISNNDAKLLREMVWKRDGRIREAYLSCKGK 461
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
1918-2027 5.19e-03

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 43.40  E-value: 5.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1918 EAGRFRELAEEAARLRALAEEAKRQRQLAEE--------DAVRQRAEAERVLAEKLAAISEATRLKTEAEIA-LKEKEAE 1988
Cdd:PRK11448   130 KPGPFVPPEDPENLLHALQQEVLTLKQQLELqarekaqsQALAEAQQQELVALEGLAAELEEKQQELEAQLEqLQEKAAE 209
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 1989 NErlrrlAEDEAFQRRLLEEQAAQHKAD-IEARL---AQLRKA 2027
Cdd:PRK11448   210 TS-----QERKQKRKEITDQAAKRLELSeEETRIlidQQLRKA 247
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
1304-1575 5.31e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.44  E-value: 5.31e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1304 RERVTLLLERWQAVLAQTDVR---QRELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKAL 1380
Cdd:COG4942      2 RKLLLLALLLALAAAAQADAAaeaEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAAL 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1381 LEDIERHGEKVEECQRFAKQYINAIKdyELQLVTYKAQLEPvaspakKPKVQSGSESIIQEYvdlrtRYSELSTLTSQYI 1460
Cdd:COG4942     82 EAELAELEKEIAELRAELEAQKEELA--ELLRALYRLGRQP------PLALLLSPEDFLDAV-----RRLQYLKYLAPAR 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1461 RFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRqlaeahaQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQ 1540
Cdd:COG4942    149 REQAEELRADLAELAALRAELEAERAELEALLAELEEER-------AALEALKAERQKLLARLEKELAELAAELAELQQE 221
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 1920237946 1541 KRSIQEELQHLRQSSEAEIQAKARQVEAAERSRLR 1575
Cdd:COG4942    222 AEELEALIARLEAEAAAAAERTPAAGFAALKGKLP 256
SPFH_like_u3 cd03406
Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily; This ...
1463-1571 5.49e-03

Uncharacterized family; SPFH (stomatin, prohibitin, flotillin, and HflK/C) superfamily; This model summarizes an uncharacterized family of proteins similar to stomatin, prohibitin, flotillin, HflK/C (SPFH) and podocin. The conserved domain common to the SPFH superfamily has also been referred to as the Band 7 domain. Many superfamily members are associated with lipid rafts. Individual proteins of the SPFH superfamily may cluster to form membrane microdomains which may in turn recruit multiprotein complexes. Microdomains formed from flotillin proteins may in addition be dynamic units with their own regulatory functions. Flotillins have been implicated in signal transduction, vesicle trafficking, cytoskeleton rearrangement and are known to interact with a variety of proteins. Stomatin interacts with and regulates members of the degenerin/epithelia Na+ channel family in mechanosensory cells of Caenorhabditis elegans and vertebrate neurons and participates in trafficking of Glut1 glucose transporters. Prohibitin may act as a chaperone for the stabilization of mitochondrial proteins. Prokaryotic HflK/C plays a role in the decision between lysogenic and lytic cycle growth during lambda phage infection. Flotillins have been implicated in the progression of prion disease, in the pathogenesis of neurodegenerative diseases such as Parkinson's and Alzheimer's disease and, in cancer invasion and metastasis. Mutations in the podocin gene give rise to autosomal recessive steroid resistant nephritic syndrome.


Pssm-ID: 259804 [Multi-domain]  Cd Length: 293  Bit Score: 42.28  E-value: 5.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1463 ISETLRR----MEEEE-RLAeqqRAEERERLAEVEAALEKQRQLAEahaqakaqAEREAQGLQRRMQEEVARReevavEA 1537
Cdd:cd03406    160 IPEAIRRnyeaMEAEKtKLL---IAEQHQKVVEKEAETERKRAVIE--------AEKDAEVAKIQMQQKIMEK-----EA 223
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 1920237946 1538 QEQKRSIQEELQHLRQSS--EAEIQAKARQVEAAER 1571
Cdd:cd03406    224 EKKISEIEDEMHLAREKAraDAEYYRALREAEANKL 259
WEMBL pfam05701
Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required ...
1467-1850 5.56e-03

Weak chloroplast movement under blue light; WEMBL consists of several plant proteins required for the chloroplast avoidance response under high intensity blue light. This avoidance response consists in the relocation of chloroplasts on the anticlinal side of exposed cells. Acts in association with PMI2 to maintain the velocity of chloroplast photo-relocation movement via the regulation of cp-actin filaments. Thus several member-sequences are described as "myosin heavy chain-like".


Pssm-ID: 461718 [Multi-domain]  Cd Length: 562  Bit Score: 42.71  E-value: 5.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1467 LRRMEEEERLAEQQRAEERERLAEVEAALEKQRQL-AEAHAQAKAQAEREAQGLQRRMQEEVARREEVAveaqeqkrSIQ 1545
Cdd:pfam05701  228 LKQAEEELQRLNQQLLSAKDLKSKLETASALLLDLkAELAAYMESKLKEEADGEGNEKKTSTSIQAALA--------SAK 299
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1546 EELQHLRQSSE-AEIQAKARQVeAAERSRLRIEEEIRVVrlqleATERQRGGA--------EGELQALRARAEEAEAQKR 1616
Cdd:pfam05701  300 KELEEVKANIEkAKDEVNCLRV-AAASLRSELEKEKAEL-----ASLRQREGMasiavsslEAELNRTKSEIALVQAKEK 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1617 QAQEEAERLRRQVQdetqrkrqaeaelalrvQAEAEAAREKQRALQALEELRLQAEEAErrlrQAEAErARQVQVALETA 1696
Cdd:pfam05701  374 EAREKMVELPKQLQ-----------------QAAQEAEEAKSLAQAAREELRKAKEEAE----QAKAA-ASTVESRLEAV 431
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1697 QRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATrraqqqaeaeraraeaerelerwqLKANEALRLRLQAEE 1776
Cdd:pfam05701  432 LKEIEAAKASEKLALAAIKALQESESSAESTNQEDSPRGVT------------------------LSLEEYYELSKRAHE 487
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1777 VAQQKSLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQ 1850
Cdd:pfam05701  488 AEELANKRVAEAVSQIEEAKESELRSLEKLEEVNREMEERKEALKIALEKAEKAKEGKLAAEQELRKWRAEHEQ 561
Nop14 pfam04147
Nop14-like family; Emg1 and Nop14 are novel proteins whose interaction is required for the ...
1862-1999 5.62e-03

Nop14-like family; Emg1 and Nop14 are novel proteins whose interaction is required for the maturation of the 18S rRNA and for 40S ribosome production.


Pssm-ID: 461196  Cd Length: 835  Bit Score: 42.99  E-value: 5.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1862 LARLQREAAAATQKRRELEAELAKVRAEM--EVLLASKARAEEesrstseKSKQRLEAEAGRFR---ELAEEAARLRALA 1936
Cdd:pfam04147  139 LKRVRRAHFGGGEDDEEEEPERKKSKKEVmeEVIAKSKLHKYE-------RQKAKEEDEELREEldkELKDLRSLLSGSK 211
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1920237946 1937 EEAKRQRQLAEEDAVRQRAEAE-----RVLA-EKLAAISEatRLKTEAEIALKEKE----AENERLRRLAEDE 1999
Cdd:pfam04147  212 RPKPEQAKKPEEKPDRKKPDDDydklvRELAfDKRAKPSD--RTKTEEELAEEEKErlekLEEERLRRMRGEE 282
Cast pfam10174
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ...
2297-2488 5.69e-03

RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.


Pssm-ID: 431111 [Multi-domain]  Cd Length: 766  Bit Score: 42.89  E-value: 5.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2297 KQFAEQALRQKAQVEQELTALRlqleETDHQKSILDEELQRLkaevtEAARQRGQVEEELFSLRVQMEELGKLKARIEAE 2376
Cdd:pfam10174  568 ARYKEESGKAQAEVERLLGILR----EVENEKNDKDKKIAEL-----ESLTLRQMKEQNKKVANIKHGQQEMKKKGAQLL 638
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2377 NRALVLRD---KDSAQRLLQE---EAEKMKQVAEEA-ARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEKMQAVQEAT 2449
Cdd:pfam10174  639 EEARRREDnlaDNSQQLQLEElmgALEKTRQELDATkARLSSTQQSLAEKDGHLTNLRAERRKQLEEILEMKQEALLAAI 718
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 2450 RLK----AEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQ 2488
Cdd:pfam10174  719 SEKdaniALLELSSSKKKKTQEEVMALKREKDRLVHQLKQQTQ 761
2A1904 TIGR00927
K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying ...
1471-1658 5.75e-03

K+-dependent Na+/Ca+ exchanger; [Transport and binding proteins, Cations and iron carrying compounds]


Pssm-ID: 273344 [Multi-domain]  Cd Length: 1096  Bit Score: 43.06  E-value: 5.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1471 EEEERLAEQQRAEERERLAEVEaalEKQRQLAEAHAQAKAQAER--------EAQGLQRRMQ-----EEVARREEVAVEA 1537
Cdd:TIGR00927  649 GERPTEAEGENGEESGGEAEQE---GETETKGENESEGEIPAERkgeqegegEIEAKEADHKgeteaEEVEHEGETEAEG 725
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1538 QEQKRSIQ--EELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEirvvrlqlEATERQRGGAEGELQA---LRARAEEAE 1612
Cdd:TIGR00927  726 TEDEGEIEtgEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGE--------TEAEGKEDEDEGEIQAgedGEMKGDEGA 797
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 1920237946 1613 AQKRQAQEEAERlRRQVQDETQRKRQAEAELALRVQAEAEAAREKQ 1658
Cdd:TIGR00927  798 EGKVEHEGETEA-GEKDEHEGQSETQADDTEVKDETGEQELNAENQ 842
TPH pfam13868
Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of ...
2362-2657 5.83e-03

Trichohyalin-plectin-homology domain; This family is a mixtrue of two different families of eukaryotic proteins. Trichoplein or mitostatin, was first defined as a meiosis-specific nuclear structural protein. It has since been linked with mitochondrial movement. It is associated with the mitochondrial outer membrane, and over-expression leads to reduction in mitochondrial motility whereas lack of it enhances mitochondrial movement. The activity appears to be mediated through binding the mitochondria to the actin intermediate filaments (IFs). The family is in the trichohyalin-plectin-homology domain.


Pssm-ID: 464007 [Multi-domain]  Cd Length: 341  Bit Score: 42.21  E-value: 5.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2362 QMEELGKLKARIEAENRALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKEK 2441
Cdd:pfam13868   27 QIAEKKRIKAEEKEEERRLDEMMEEERERALEEEEEKEEERKEERKRYRQELEEQIEEREQKRQEEYEEKLQEREQMDEI 106
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2442 MQAVQEATRLKAEAELLQQQKELA--QEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMS 2519
Cdd:pfam13868  107 VERIQEEDQAEAEEKLEKQRQLREeiDEFNEEQAEWKELEKEEEREEDERILEYLKEKAEREEEREAEREEIEEEKEREI 186
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2520 RAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKS 2599
Cdd:pfam13868  187 ARLRAQQEKAQDEKAERDELRAKLYQEEQERKERQKEREEAEKKARQRQELQQAREEQIELKERRLAEEAEREEEEFERM 266
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1920237946 2600 EEMQTVRQEQLLQETQALQQSFLSEKDSLLQ----RERCIEQEKAKLEQLFQDEVAKAQALR 2657
Cdd:pfam13868  267 LRKQAEDEEIEQEEAEKRRMKRLEHRRELEKqieeREEQRAAEREEELEEGERLREEEAERR 328
PRK07353 PRK07353
F0F1 ATP synthase subunit B'; Validated
1468-1563 5.88e-03

F0F1 ATP synthase subunit B'; Validated


Pssm-ID: 235999 [Multi-domain]  Cd Length: 140  Bit Score: 40.37  E-value: 5.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAEVEA-ALEKQRQLAEAHAQAK---AQAEREAQGLqrrmqeevaRREEVAV---EAQEQ 1540
Cdd:PRK07353    32 KVVEEREDYIRTNRAEAKERLAEAEKlEAQYEQQLASARKQAQaviAEAEAEADKL---------AAEALAEaqaEAQAS 102
                           90       100
                   ....*....|....*....|...
gi 1920237946 1541 KRSIQEELQHLRQSSEAEIQAKA 1563
Cdd:PRK07353   103 KEKARREIEQQKQAALAQLEQQV 125
ATAD3_N pfam12037
ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal ...
1804-1944 6.04e-03

ATPase family AAA domain-containing protein 3, N-terminal; This is the conserved N-terminal domain of ATPase family AAA domain-containing protein 3 (ATAD3) which is involved in dimerization and interacts with the inner surface of the outer mitochondrial membrane. This domain is found associated with the AAA ATPase domain (pfam00004). ATAD3 is essential for mitochondrial network organization, mitochondrial metabolism and cell growth at organizm and cellular level. It may also play an important role in mitochondrial protein synthesis.


Pssm-ID: 463442 [Multi-domain]  Cd Length: 264  Bit Score: 41.89  E-value: 6.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1804 KAEEQaVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEA-- 1881
Cdd:pfam12037   52 KKQEQ-TRQAELQAKIKEYEAAQEQLKIERQRVEYEERRKTLQEETKQKQQRAQYQDELARKRYQDQLEAQRRRNEELlr 130
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 1882 ---ELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEA-GRFRELAE-EAARLRALAEEAKRQRQ 1944
Cdd:pfam12037  131 kqeESVAKQEAMRIQAQRRQTEEHEAELRRETERAKAEAEAeARAKEEREnEDLNLEQLREKANEERE 198
PRK11637 PRK11637
AmiB activator; Provisional
1344-1544 6.36e-03

AmiB activator; Provisional


Pssm-ID: 236942 [Multi-domain]  Cd Length: 428  Bit Score: 42.37  E-value: 6.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1344 PLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVEECQRfakqyinAIKDYELQLVTYKAQL-EPV 1422
Cdd:PRK11637    37 AFSAHASDNRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASR-------KLRETQNTLNQLNKQIdELN 109
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1423 ASPAKKPKVQSGSESIIQEYVDLRTRYSE-------LSTLTSQ-------YIRFISE----------------TLRRMEE 1472
Cdd:PRK11637   110 ASIAKLEQQQAAQERLLAAQLDAAFRQGEhtglqliLSGEESQrgerilaYFGYLNQarqetiaelkqtreelAAQKAEL 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1473 EERLAEQQ----------------RAEERERLAEVEAALEK-QRQLAE--------------AHAQAKAQAEREAQGLQR 1521
Cdd:PRK11637   190 EEKQSQQKtllyeqqaqqqkleqaRNERKKTLTGLESSLQKdQQQLSElranesrlrdsiarAEREAKARAEREAREAAR 269
                          250       260
                   ....*....|....*....|....
gi 1920237946 1522 -RMQEEVARREEVAVEAQEQKRSI 1544
Cdd:PRK11637   270 vRDKQKQAKRKGSTYKPTESERSL 293
TolA COG3064
Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];
1468-1875 6.44e-03

Membrane protein TolA involved in colicin uptake [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442298 [Multi-domain]  Cd Length: 485  Bit Score: 42.72  E-value: 6.44e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1468 RRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEAHAQAK-AQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQE 1546
Cdd:COG3064     24 EKRAAAEAEQKAKEEAEEERLAELEAKRQAEEEAREAKAEAEqRAAELAAEAAKKLAEAEKAAAEAEKKAAAEKAKAAKE 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1547 ELQHLRQSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLR 1626
Cdd:COG3064    104 AEAAAAAEKAAAAAEKEKAEEAKRKAEEEAKRKAEEERKAAEAEAAAKAEAEAARAAAAAAAAAAAAAARAAAGAAAALV 183
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1627 RQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQS 1706
Cdd:COG3064    184 AAAAAAVEAADTAAAAAAALAAAAAAAAADAALLALAVAARAAAASREAALAAVEATEEAALGGAEEAADLAAVGVLGAA 263
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1707 EHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQLKANEALRLRLQAEEVAQQKSLTQA 1786
Cdd:COG3064    264 LAAAAAGAAALSSGLVVVAAALAGLAAAAAGLVLDDSAALAAELLGAVAAEEAVLAAAAAAGALVVRGGGAASLEAALSL 343
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1787 EAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQ 1866
Cdd:COG3064    344 LAAGAAAAAAGAGALATGALGDALAAEAAGALLLGKLADVEEAAGAGILAAAGGGGLLGLRLDLGAALLEAASAVELRVL 423

                   ....*....
gi 1920237946 1867 REAAAATQK 1875
Cdd:COG3064    424 LALAGAAGA 432
Plectin pfam00681
Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous ...
3811-3849 6.52e-03

Plectin repeat; This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.


Pssm-ID: 459901  Cd Length: 39  Bit Score: 37.31  E-value: 6.52e-03
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1920237946 3811 YLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVARQL 3849
Cdd:pfam00681    1 LLEAQAATGGIIDPVTGERLSVEEAVKRGLIDPETAQKL 39
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1557-1710 6.58e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 42.58  E-value: 6.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1557 AEIQAKARQVEAAERSRLRIEEEIrvvrlqlEATERQRGGAEGElqALRARAEEAEAQKRQAQEEAERLRRQVQDETQRK 1636
Cdd:PRK12678    29 PELRALAKQLGIKGTSGMRKGELI-------AAIKEARGGGAAA--AAATPAAPAAAARRAARAAAAARQAEQPAAEAAA 99
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1920237946 1637 RQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQVQVALETAQRSAEAELQSEHAS 1710
Cdd:PRK12678   100 AKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERR 173
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2046-2608 6.79e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 42.74  E-value: 6.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2046 RRQVEEEILALKgSFEKAAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQlAAEEERRRREAEERVQKSLAAEEE 2125
Cdd:PRK03918   147 REKVVRQILGLD-DYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEK-ELEEVLREINEISSELPELREELE 224
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2126 AARQRKaalEEVERLKAKVEEARRLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERL 2205
Cdd:PRK03918   225 KLEKEV---KELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEF 301
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2206 RSEAEAARRAAEEAEAARERAEREAaqsRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQ 2285
Cdd:PRK03918   302 YEEYLDELREIEKRLSRLEEEINGI---EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLK 378
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2286 KQAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLK----------AEVTEAARQRGQVE-- 2353
Cdd:PRK03918   379 KRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKkakgkcpvcgRELTEEHRKELLEEyt 458
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2354 EELFSLRVQMEELGKLKARIEAENRAL-VLRDKDSAQRLLQEEAEKMKQVAEE-----AARLSVAAQEAARLRQLAEEDL 2427
Cdd:PRK03918   459 AELKRIEKELKEIEEKERKLRKELRELeKVLKKESELIKLKELAEQLKELEEKlkkynLEELEKKAEEYEKLKEKLIKLK 538
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2428 AQQRALAE-----KMLKEKMQAVQEATRlKAEAELLQQQKELAQEQARRLQEDKEQMaqqlaQETQGFQK---TLETERQ 2499
Cdd:PRK03918   539 GEIKSLKKeleklEELKKKLAELEKKLD-ELEEELAELLKELEELGFESVEELEERL-----KELEPFYNeylELKDAEK 612
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2500 RQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDIGERLYRTELATQEKVMLvqtletqrqQSDRDAERLREAIA 2579
Cdd:PRK03918   613 ELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYL---------ELSRELAGLRAELE 683
                          570       580
                   ....*....|....*....|....*....
gi 1920237946 2580 ELEHEKDKLKQEAQLLQLKSEEMQTVRQE 2608
Cdd:PRK03918   684 ELEKRREEIKKTLEKLKEELEEREKAKKE 712
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
2550-2800 6.81e-03

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 43.04  E-value: 6.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2550 TQEKVMLVQTLETQRQQSDRDAERLREaIAELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLL 2629
Cdd:pfam02463  165 SRLKRKKKEALKKLIEETENLAELIID-LEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQ 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2630 QRERCIEQEKAKLEQLFQDEVAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQ 2709
Cdd:pfam02463  244 ELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEK 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2710 EKLLAEENQRLRERLQHLEEERRAALARSEEIAPSRAAAARALPNGQDAADGPAAAAEPEHAFD-GLRRKVPAQRLQEVG 2788
Cdd:pfam02463  324 KKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSsAAKLKEEELELKSEE 403
                          250
                   ....*....|..
gi 1920237946 2789 VLSAEELQQLAQ 2800
Cdd:pfam02463  404 EKEAQLLLELAR 415
CHASE3 COG5278
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
1246-1689 6.81e-03

Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];


Pssm-ID: 444089 [Multi-domain]  Cd Length: 530  Bit Score: 42.59  E-value: 6.81e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1246 ATLPELEATKAALKKLRAQAEAQQPVFDALRDELRGAQEVGERLQQRHGERDVEVERWRERVTLLLERWQAVLAQTDVRQ 1325
Cdd:COG5278     83 EARAEIDELLAELRSLTADNPEQQARLDELEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLA 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1326 RELEQLGRQLRYYRESADPLGAWLRDAKQRQEQIQAVPLANSQAVREQLRQEKALLEDIERHGEKVEECQRFAKQYINAI 1405
Cdd:COG5278    163 LALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALAL 242
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1406 KDYELQLVTYKAQLEPVASPAKKPKVQSGSESIIQEYVDLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEER 1485
Cdd:COG5278    243 ALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAA 322
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1486 ERLAEVEAALEKQRQLAEAHAQAKAQAEREAQGLQRRMQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQ 1565
Cdd:COG5278    323 AALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAA 402
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1566 VEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELAL 1645
Cdd:COG5278    403 AAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAA 482
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....
gi 1920237946 1646 RVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAEAERARQV 1689
Cdd:COG5278    483 ALAEAEAAAALAAAAALSLALALAALLLAAAEAALAAALAAALA 526
tolA PRK09510
cell envelope integrity inner membrane protein TolA; Provisional
1860-2085 6.93e-03

cell envelope integrity inner membrane protein TolA; Provisional


Pssm-ID: 236545 [Multi-domain]  Cd Length: 387  Bit Score: 42.10  E-value: 6.93e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1860 EELARLQREAAAATQKRRELEaelaKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEagrfRELAEEAARLralaeEA 1939
Cdd:PRK09510    62 EQYNRQQQQQKSAKRAEEQRK----KKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ----KKQAEEAAKQ-----AA 128
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1940 KRQRQlAEEDAVRQRAEAervlaeKLAAISEATRLKTEAEIALKEKEA-ENERLRRLAEDEAfQRRLLEEQAAQHKADIE 2018
Cdd:PRK09510   129 LKQKQ-AEEAAAKAAAAA------KAKAEAEAKRAAAAAKKAAAEAKKkAEAEAAKKAAAEA-KKKAEAEAAAKAAAEAK 200
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 2019 ARLAQLRKASESELERQKGLVEdtlrQRRQVEEEILALKGSFEKAAAGKAELELELGRIRGTAEDTL 2085
Cdd:PRK09510   201 KKAEAEAKKKAAAEAKKKAAAE----AKAAAAKAAAEAKAAAEKAAAAKAAEKAAAAKAAAEVDDLF 263
ERM_helical pfam20492
Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related ...
1573-1690 6.97e-03

Ezrin/radixin/moesin, alpha-helical domain; The ERM family consists of three closely-related proteins, ezrin, radixin and moesin. Ezrin was first identified as a constituent of microvilli, radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions, and moesin as a heparin binding protein. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain, an extended alpha-helical domain and a charged C-terminal domain (pfam00769). Ezrin, radixin and merlin also contain a polyproline linker region between the helical and C-terminal domains. The N-terminal domain is highly conserved and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily, designated the FERM domain. ERM proteins crosslink actin filaments with plasma membranes. They co-localize with CD44 at actin filament plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains. This is the alpha-helical domain, which is involved in intramolecular masking of protein-protein interaction sites, regulating the activity of this proteins.


Pssm-ID: 466641 [Multi-domain]  Cd Length: 120  Bit Score: 39.52  E-value: 6.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1573 RLRIEEEIRVVRLQLEATERQRGGAEGELQALRARAEEAEAQKRQAQEEAERL---RRQVQDETQR-KRQAEAELALRVQ 1648
Cdd:pfam20492    1 REEAEREKQELEERLKQYEEETKKAQEELEESEETAEELEEERRQAEEEAERLeqkRQEAEEEKERlEESAEMEAEEKEQ 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1920237946 1649 AEAEAAREKQRALQALEELRLQAEEAERrlRQAEAERARQVQ 1690
Cdd:pfam20492   81 LEAELAEAQEEIARLEEEVERKEEEARR--LQEELEEAREEE 120
hsdR PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
2331-2426 7.08e-03

type I restriction enzyme EcoKI subunit R; Provisional


Pssm-ID: 236912 [Multi-domain]  Cd Length: 1123  Bit Score: 42.63  E-value: 7.08e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2331 LDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENRALvlrdkdsAQRLLQEEAEKMKQVAEEAARLS 2410
Cdd:PRK11448   147 LQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQEL-------EAQLEQLQEKAAETSQERKQKRK 219
                           90
                   ....*....|....*.
gi 1920237946 2411 VAAQEAARLRQLAEED 2426
Cdd:PRK11448   220 EITDQAAKRLELSEEE 235
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
1807-1972 7.14e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 42.12  E-value: 7.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1807 EQAVRQRELAEQELEKQRQLAEGTAQQRLAAEQELIRLRAETEQGEQQRQLLEEELARLQREAAAATQKRRELEAELAKV 1886
Cdd:COG3883    115 SDFLDRLSALSKIADADADLLEELKADKAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAA 194
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1887 RAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLA 1966
Cdd:COG3883    195 EAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAASAAGAGAAGAAGAAAGSAGAAGAAAGAAGA 274

                   ....*.
gi 1920237946 1967 AISEAT 1972
Cdd:COG3883    275 GAAAAS 280
CHASE3 COG5278
Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];
1832-2243 7.17e-03

Extracytoplasmic sensor domain CHASE3 (specificity unknown) [Signal transduction mechanisms];


Pssm-ID: 444089 [Multi-domain]  Cd Length: 530  Bit Score: 42.59  E-value: 7.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1832 QQRLAAEQELIRLRAETEQGEQQRQLL---EEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTS 1908
Cdd:COG5278     83 EARAEIDELLAELRSLTADNPEQQARLdelEALIDQWLAELEQVIALRRAGGLEAALALVRSGEGKALMDEIRARLLLLA 162
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1909 EKSKQRLEAEAGRFRELAEEAARLRALAEEAKRQRQLAEEDAVRQRAEAERVLAEKLAAISEATRLKTEAEIALKEKEAE 1988
Cdd:COG5278    163 LALAALLLAAAALLLLLLALAALLALAELLLLALARALAALLLLLLLEAELAAAAALLAAAAALAALAALELLAALALAL 242
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1989 NERLRRLAEDEAFQRRLLEEQAAQHKADIEARLAQLRKASESELERQKGLVEDTLRQRRQVEEEILALKGSFEKAAAGKA 2068
Cdd:COG5278    243 ALLLAALLLALLAALALAALLAAALLALAALLLALAAAAALAAAAALELAAAEALALAELELELLLAAAAAAAAAAAAAA 322
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2069 ELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKAKVEEAR 2148
Cdd:COG5278    323 AALAALLALALATALAAAAAALALLAALLAEAAAAAAEEAEAAAEAAAAALAGLAEVEAEGAAEAVELEVLAIAAAAAAA 402
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2149 RLRERAEQESARQLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEEAEAARERAER 2228
Cdd:COG5278    403 AAEAAAAAAAAAAASAAEALELAEALAEALALAEEEALALAAASSELAEAGAALALAAAEALAEELAAVAALAALAAAAA 482
                          410
                   ....*....|....*
gi 1920237946 2229 EAAQSRRQVEEAERL 2243
Cdd:COG5278    483 ALAEAEAAAALAAAA 497
CCDC22 pfam05667
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ...
1453-1707 7.18e-03

Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.


Pssm-ID: 461708 [Multi-domain]  Cd Length: 600  Bit Score: 42.71  E-value: 7.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1453 STLTSQYIRFISETLRRMEEEE---------RLAEQQ-RAEERERLaeVEAALEKQRQLAEAHAQAKAQAEREAQGLQRR 1522
Cdd:pfam05667  203 SVVPSLLERNAAELAAAQEWEEewnsqglasRLTPEEyRKRKRTKL--LKRIAEQLRSAALAGTEATSGASRSAQDLAEL 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1523 MQEEVARREEVAVEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAAERsRLRIEEEIRVVRLQLEATERQRGGAEGELQ 1602
Cdd:pfam05667  281 LSSFSGSSTTDTGLTKGSRFTHTEKLQFTNEAPAATSSPPTKVETEEEL-QQQREEELEELQEQLEDLESSIQELEKEIK 359
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1603 ALRARAEEAEAQKRQAQEEAERLrrqvQDETQRKRQAEAEL--------ALRVQAEAEAAR--------EKQRA--LQAL 1664
Cdd:pfam05667  360 KLESSIKQVEEELEELKEQNEEL----EKQYKVKKKTLDLLpdaeeniaKLQALVDASAQRlvelagqwEKHRVplIEEY 435
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*....
gi 1920237946 1665 EELRL----QAEEAERRLRQAEAERaRQVQVALETAQRSAE--AELQSE 1707
Cdd:pfam05667  436 RALKEaksnKEDESQRKLEEIKELR-EKIKEVAEEAKQKEElyKQLVAE 483
DUF4670 pfam15709
Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins ...
2524-2703 7.18e-03

Domain of unknown function (DUF4670); This family of proteins is found in eukaryotes. Proteins in this family are typically between 373 and 763 amino acids in length.


Pssm-ID: 464815 [Multi-domain]  Cd Length: 522  Bit Score: 42.63  E-value: 7.18e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2524 RAEEDARRFRKQAEDIGERLYRTELATQEKvmlvqtlETQRQQSDRDAERLREAIAELEHEKDKLKQEAQL--------- 2594
Cdd:pfam15709  328 REQEKASRDRLRAERAEMRRLEVERKRREQ-------EEQRRLQQEQLERAEKMREELELEQQRRFEEIRLrkqrleeer 400
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2595 LQLKSEEMQTVRQEQLLQETQALQQSFLSEKDSLLQRERCIEQ-EKAKLEQLFQDEVaKAQALREEQQRQQQQMQQEKQQ 2673
Cdd:pfam15709  401 QRQEEEERKQRLQLQAAQERARQQQEEFRRKLQELQRKKQQEEaERAEAEKQRQKEL-EMQLAEEQKRLMEMAEEERLEY 479
                          170       180       190
                   ....*....|....*....|....*....|
gi 1920237946 2674 LAASMEEARRRQHEAEEGvRRQQEELQRLA 2703
Cdd:pfam15709  480 QRQKQEAEEKARLEAEER-RQKEEEAARLA 508
PRK00106 PRK00106
ribonuclease Y;
2398-2573 7.37e-03

ribonuclease Y;


Pssm-ID: 178867 [Multi-domain]  Cd Length: 535  Bit Score: 42.55  E-value: 7.37e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2398 KMKQvAEEAARLSV--AAQEAARLRQLAEEDLAQQRALAEKMLKEK-----MQAVQEAT--RLKAEAELLQQQKELAQEQ 2468
Cdd:PRK00106    25 KMKS-AKEAAELTLlnAEQEAVNLRGKAERDAEHIKKTAKRESKALkkellLEAKEEARkyREEIEQEFKSERQELKQIE 103
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2469 ARRLQE------------DKEQMAQQLAQETQGFQKTLEtERQRQLEMSAEAERLRL-RVAEMSRAQAR----------- 2524
Cdd:PRK00106   104 SRLTERatsldrkdenlsSKEKTLESKEQSLTDKSKHID-EREEQVEKLEEQKKAELeRVAALSQAEAReiilaetenkl 182
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*....
gi 1920237946 2525 AEEDARRFRKQAEDIGERLYRTelatqEKVMLVQTLetQRQQSDRDAER 2573
Cdd:PRK00106   183 THEIATRIREAEREVKDRSDKM-----AKDLLAQAM--QRLAGEYVTEQ 224
HEC1 COG5185
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ...
2313-2703 7.55e-03

Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 444066 [Multi-domain]  Cd Length: 594  Bit Score: 42.64  E-value: 7.55e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2313 ELTALRLQLEETDHQKSILdEELQRLKAEVTEAARQRGQVEEELFSLrvqmEELGKLKARIEAENRALVLRDKDSAQRLL 2392
Cdd:COG5185    174 QNLKKLEIFGLTLGLLKGI-SELKKAEPSGTVNSIKESETGNLGSES----TLLEKAKEIINIEEALKGFQDPESELEDL 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2393 QEEAEKMKQVAEEAA--RLSVAAQEAARLRQLAEEDLAQQRALAEkmLKEKMQAVQEATRLKAEAELLQQQK---ELAQE 2467
Cdd:COG5185    249 AQTSDKLEKLVEQNTdlRLEKLGENAESSKRLNENANNLIKQFEN--TKEKIAEYTKSIDIKKATESLEEQLaaaEAEQE 326
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2468 QARRLQEDKEQMAQQLAQETQGFQKTLE--TERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAEDigerlyr 2545
Cdd:COG5185    327 LEESKRETETGIQNLTAEIEQGQESLTEnlEAIKEEIENIVGEVELSKSSEELDSFKDTIESTKESLDEIPQN------- 399
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2546 telATQEKVMLVQTLETQRQQSDRDAERLREAI----AELEHEKDKLKQEAQLLQLKSEEMQTVRQEQLLQETQALQQSF 2621
Cdd:COG5185    400 ---QRGYAQEILATLEDTLKAADRQIEELQRQIeqatSSNEEVSKLLNELISELNKVMREADEESQSRLEEAYDEINRSV 476
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2622 LSEKDSLLQRERCIEQEKAKLEqlfqdevAKAQALREEQQRQQQQMQQEKQQLAASMEEARRRQHEAEEgvrRQQEELQR 2701
Cdd:COG5185    477 RSKKEDLNEELTQIESRVSTLK-------ATLEKLRAKLERQLEGVRSKLDQVAESLKDFMRARGYAHI---LALENLIP 546

                   ..
gi 1920237946 2702 LA 2703
Cdd:COG5185    547 AS 548
YqiK COG2268
Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];
1605-1875 7.64e-03

Uncharacterized membrane protein YqiK, contains Band7/PHB/SPFH domain [Function unknown];


Pssm-ID: 441869 [Multi-domain]  Cd Length: 439  Bit Score: 42.17  E-value: 7.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1605 RARAEEAEAQKRQAQEEAERLRRQVQDETQRKRqaeaELALRVQAEAEAAREKQRALQALEELRLQAEeAERRLRQAEAE 1684
Cdd:COG2268    200 DARIAEAEAERETEIAIAQANREAEEAELEQER----EIETARIAEAEAELAKKKAEERREAETARAE-AEAAYEIAEAN 274
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1685 RARQVQVALETAQRSAEAELQsehasfaektaQLERTLKEEhvavvqlreeatrraqqqaeaeraraeaerelerwQLKA 1764
Cdd:COG2268    275 AEREVQRQLEIAEREREIELQ-----------EKEAEREEA-----------------------------------ELEA 308
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1765 NEALRLRLQAEEVAQqksltqaeaekqkeeaerearrrgkaeeqavrqRELAEQELEKQRQLAEGTAQQRLAAeqelirl 1844
Cdd:COG2268    309 DVRKPAEAEKQAAEA---------------------------------EAEAEAEAIRAKGLAEAEGKRALAE------- 348
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1920237946 1845 rAETEQGEQQRQL-LEEELARLQREAAAATQK 1875
Cdd:COG2268    349 -AWNKLGDAAILLmLIEKLPEIAEAAAKPLEK 379
Caldesmon pfam02029
Caldesmon;
1619-1943 7.75e-03

Caldesmon;


Pssm-ID: 460421 [Multi-domain]  Cd Length: 495  Bit Score: 42.16  E-value: 7.75e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1619 QEEAERLRRQVQDETQRKRQAEAELALRVQ---------AEAEAAREKQRALQALEEL-----RLQAEEAERRLRQAEA- 1683
Cdd:pfam02029    4 EEEAARERRRRAREERRRQKEEEEPSGQVTesvepnehnSYEEDSELKPSGQGGLDEEeafldRTAKREERRQKRLQEAl 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1684 ERARQVQVALETAQ---RSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERw 1760
Cdd:pfam02029   84 ERQKEFDPTIADEKesvAERKENNEEEENSSWEKEEKRDSRLGRYKEEETEIREKEYQENKWSTEVRQAEEEGEEEEDK- 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1761 QLKANEALRLRLQAEEVAQQK-------SLTQAEAEKQKEEAEREARRRGKAEEQAVRQRELAEQELEKQRQLAEGTAQQ 1833
Cdd:pfam02029  163 SEEAEEVPTENFAKEEVKDEKikkekkvKYESKVFLDQKRGHPEVKSQNGEEEVTKLKVTTKRRQGGLSQSQEREEEAEV 242
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1834 RLAAEQELIRLRaeteqgeQQRQLLEEElarlqrEAAAATQKRRELEAELAKVRAEMEVllASKARAEEESRSTSEKSKQ 1913
Cdd:pfam02029  243 FLEAEQKLEELR-------RRRQEKESE------EFEKLRQKQQEAELELEELKKKREE--RRKLLEEEEQRRKQEEAER 307
                          330       340       350
                   ....*....|....*....|....*....|
gi 1920237946 1914 RLEAEAGRfRELAEEAARLRALAEEaKRQR 1943
Cdd:pfam02029  308 KLREEEEK-RRMKEEIERRRAEAAE-KRQK 335
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
1444-1772 7.79e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 42.20  E-value: 7.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1444 DLRTRYSELSTLTSQYIRFISETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQLAEahaQAKAQAEREAQGLQRRM 1523
Cdd:COG4372     17 GLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELE---QLEEELEELNEQLQAAQ 93
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1524 QEEVARREEVAvEAQEQKRSIQEELQHLR-QSSEAEIQAKARQVEAAERSRLRIEEEIRVVRLQLEATERQRGGAEGELQ 1602
Cdd:COG4372     94 AELAQAQEELE-SLQEEAEELQEELEELQkERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQE 172
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1603 ALRARAEEAEAQKRQAQEEAERLRRQVQDETQRKRQAEAELALRVQAEAEAAREKQRALQALEELRLQAEEAERRLRQAE 1682
Cdd:COG4372    173 LQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELL 252
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1683 AERARQVQVALETAQRSAEAELQSEHASFAEKTAQLERTLKEEHVAVVQLREEATRRAQQQAEAERARAEAERELERWQL 1762
Cdd:COG4372    253 EEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKKLELAL 332
                          330
                   ....*....|
gi 1920237946 1763 KANEALRLRL 1772
Cdd:COG4372    333 AILLAELADL 342
PLEC smart00250
Plectin repeat;
3810-3846 8.01e-03

Plectin repeat;


Pssm-ID: 197605  Cd Length: 38  Bit Score: 37.08  E-value: 8.01e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1920237946  3810 RYLYGTGCVAGIYRPGSRQTLTIYQALKKGQLSAEVA 3846
Cdd:smart00250    2 RLLEAQSAIGGIIDPETGQKLSVEEALRRGLIDPETG 38
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
2446-2731 8.09e-03

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 42.19  E-value: 8.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2446 QEATRLKAEAELLQQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSA---EAERLRLRVAEMSRAQ 2522
Cdd:pfam07888   41 QERAELLQAQEAANRQREKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEkykELSASSEELSEEKDAL 120
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2523 ARAEEDAR-RFRKQAEDIgerlyrtELATQEKVMLVQTLETQRQQSDRDAERLREAIAELEHEKDKLKQEAQLLQLKSEE 2601
Cdd:pfam07888  121 LAQRAAHEaRIRELEEDI-------KTLTQRVLERETELERMKERAKKAGAQRKEEEAERKQLQAKLQQTEEELRSLSKE 193
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2602 MQTVRQEQLLQETQA--LQQSFLSEKDSLLQRERCIEQEKAKLEQL----------------FQDEVAKAQALREEQQRQ 2663
Cdd:pfam07888  194 FQELRNSLAQRDTQVlqLQDTITTLTQKLTTAHRKEAENEALLEELrslqerlnaserkvegLGEELSSMAAQRDRTQAE 273
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1920237946 2664 QQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLLAEENQRLRERLQHLEEER 2731
Cdd:pfam07888  274 LHQARLQAAQLTLQLADASLALREGRARWAQERETLQQSAEADKDRIEKLSAELQRLEERLQEERMER 341
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
1436-1600 8.15e-03

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 41.67  E-value: 8.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1436 ESIIQEYVDLRTrysELSTLTSQYirfiSETLRRMEEEERLAEQQRAEERERLAEVEAALEKQRQ----LAEAHAQAKAQ 1511
Cdd:pfam09787   64 QKLRGQIQQLRT---ELQELEAQQ----QEEAESSREQLQELEEQLATERSARREAEAELERLQEelryLEEELRRSKAT 136
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1512 aereaqgLQRRMQEEVARREEVA--VEAQEQKRSIQEEL-QHLRQSSEAEIQaKARQVEA--AERSRLRieeeirvvrLQ 1586
Cdd:pfam09787  137 -------LQSRIKDREAEIEKLRnqLTSKSQSSSSQSELeNRLHQLTETLIQ-KQTMLEAlsTEKNSLV---------LQ 199
                          170
                   ....*....|....
gi 1920237946 1587 LEATERQRGGAEGE 1600
Cdd:pfam09787  200 LERMEQQIKELQGE 213
CCDC34 pfam13904
Coiled-coil domain-containing protein 3; This family is found in eukaryotes; it has several ...
1465-1569 8.30e-03

Coiled-coil domain-containing protein 3; This family is found in eukaryotes; it has several conserved tryptophan residues. The function is not known.


Pssm-ID: 464032 [Multi-domain]  Cd Length: 221  Bit Score: 41.23  E-value: 8.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1465 ETLRRMEEEERLAEQQRAEERERLAE--VEAALEKQRQLAEAHAQAKAQ-------AEREAQGLQRRMQEEVARR-EEVA 1534
Cdd:pfam13904   69 QKELQAQKEEREKEEQEAELRKRLAKekYQEWLQRKARQQTKKREESHKqkaaesaSKSLAKPERKVSQEEAKEVlQEWE 148
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1920237946 1535 VEAQEQKRSIQEELQHLRQSSEAEIQAKARQVEAA 1569
Cdd:pfam13904  149 RKKLEQQQRKREEEQREQLKKEEEEQERKQLAEKA 183
mukB PRK04863
chromosome partition protein MukB;
2287-2740 8.34e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 42.64  E-value: 8.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2287 QAADAEMEKHKQFAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQ-----VEEELFSLRV 2361
Cdd:PRK04863   431 GLPDLTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWdvareLLRRLREQRH 510
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2362 QMEELGKLKARI-EAENRalvLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAEKMLKE 2440
Cdd:PRK04863   511 LAEQLQQLRMRLsELEQR---LRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQ 587
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2441 KMQAVQEATRLKAEAELLQQqkelAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSR 2520
Cdd:PRK04863   588 LEQLQARIQRLAARAPAWLA----AQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEIER 663
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2521 AQARAEEDARRFRKQAEDIG-------------------ERLY---------------RTELATQEKVMLVQTLETQrqq 2566
Cdd:PRK04863   664 LSQPGGSEDPRLNALAERFGgvllseiyddvsledapyfSALYgparhaivvpdlsdaAEQLAGLEDCPEDLYLIEG--- 740
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2567 sdrDAERLREAIAEL-EHEKDKLKQEAQLlQLKSEEMQTV----------RQEQLLQETQALQQSF--LSEKDSLLQRer 2633
Cdd:PRK04863   741 ---DPDSFDDSVFSVeELEKAVVVKIADR-QWRYSRFPEVplfgraarekRIEQLRAEREELAERYatLSFDVQKLQR-- 814
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2634 cieqekakLEQLFQDEVAKAQALreeqqRQQQQMQQEKQQLAASMEEARRRQHEAEEGVRRQQEELQRLAQQQQQQEKLL 2713
Cdd:PRK04863   815 --------LHQAFSRFIGSHLAV-----AFEADPEAELRQLNRRRVELERALADHESQEQQQRSQLEQAKEGLSALNRLL 881
                          490       500
                   ....*....|....*....|....*..
gi 1920237946 2714 AEENQRLRERLQHLEEERRAALARSEE 2740
Cdd:PRK04863   882 PRLNLLADETLADRVEEIREQLDEAEE 908
Golgin_A5 pfam09787
Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining ...
2287-2539 8.74e-03

Golgin subfamily A member 5; Members of this family of proteins are involved in maintaining Golgi structure. They stimulate the formation of Golgi stacks and ribbons, and are involved in intra-Golgi retrograde transport. Two main interactions have been characterized: one with RAB1A that has been activated by GTP-binding and another with isoform CASP of CUTL1.


Pssm-ID: 462900 [Multi-domain]  Cd Length: 305  Bit Score: 41.67  E-value: 8.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2287 QAADAEMEKHKQFAEQALRQKAQVEQEL------------TALRLQLEETDHQKSILDEELQRLKAEV----TEAARQRG 2350
Cdd:pfam09787    3 ESAKQELADYKQKAARILQSKEKLIASLkegsgvegldssTALTLELEELRQERDLLREEIQKLRGQIqqlrTELQELEA 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2351 QVEEELFSLRvqmEELGKLKARIEAENRAlvlrdKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQ 2430
Cdd:pfam09787   83 QQQEEAESSR---EQLQELEEQLATERSA-----RREAEAELERLQEELRYLEEELRRSKATLQSRIKDREAEIEKLRNQ 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2431 raLAEKMLKEKMQAVQEAtRLKAEAELLQQQkelaQEQARRLQEDKEQMAQQLAQ-ETQGFQKTLETERQRQLEMSA--- 2506
Cdd:pfam09787  155 --LTSKSQSSSSQSELEN-RLHQLTETLIQK----QTMLEALSTEKNSLVLQLERmEQQIKELQGEGSNGTSINMEGisd 227
                          250       260       270
                   ....*....|....*....|....*....|....
gi 1920237946 2507 -EAERLRLRVAEMSRAQARAEEDARRFRKQAEDI 2539
Cdd:pfam09787  228 gEGTRLRNVPGLFSESDSDRAGMYGKVRKAASVI 261
COG3899 COG3899
Predicted ATPase [General function prediction only];
2063-2581 8.98e-03

Predicted ATPase [General function prediction only];


Pssm-ID: 443106 [Multi-domain]  Cd Length: 1244  Bit Score: 42.54  E-value: 8.98e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2063 AAAGKAELELELGRIRGTAEDTLRSKEQAEQEAARQRQLAAEEERRRREAEERVQKSLAAEEEAARQRKAALEEVERLKA 2142
Cdd:COG3899    736 PPDPEEEYRLALLLELAEALYLAGRFEEAEALLERALAARALAALAALRHGNPPASARAYANLGLLLLGDYEEAYEFGEL 815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2143 KVEEARRLRERAEQESAR----QLQLAQEAAQKRLQAEEKAHAFAVQQKEQELQQTLQQEQSVLERLRSEAEAARRAAEE 2218
Cdd:COG3899    816 ALALAERLGDRRLEARALfnlgFILHWLGPLREALELLREALEAGLETGDAALALLALAAAAAAAAAAAALAAAAAAAAR 895
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2219 AEAARERAEREAAQSRRQVEEAERLKQSAEEQAQAQAQAQAAAEKLRKEAEQEAARRAQAEQAALRQKQAADAEMEKHKQ 2298
Cdd:COG3899    896 LLAAAAAALAAAAAAAALAAAELARLAAAAAAAAALALAAAAAAAAAAALAAAAAAAALAAALALAAAAAAAAAAALAAA 975
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2299 FAEQALRQKAQVEQELTALRLQLEETDHQKSILDEELQRLKAEVTEAARQRGQVEEELFSLRVQMEELGKLKARIEAENR 2378
Cdd:COG3899    976 AAAAAAAAAAAAAAALEAAAAALLALLAAAAAAAAAAAALAAALLAAALAALAAAAAAAALLAAAAALALLAALAAAAAA 1055
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2379 ALVLRDKDSAQRLLQEEAEKMKQVAEEAARLSVAAQEAARLRQLAEEDLAQQRALAekmlkekmqavqeatRLKAEAELL 2458
Cdd:COG3899   1056 AAAAAALAAAAALLAAAAAAAAAAAAAAAAAALAAALAAAALAAAAAAALALAAAL---------------AALALAAAL 1120
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 2459 QQQKELAQEQARRLQEDKEQMAQQLAQETQGFQKTLETERQRQLEMSAEAERLRLRVAEMSRAQARAEEDARRFRKQAED 2538
Cdd:COG3899   1121 AALALAAAARAAAALLLLAAALALALAALLLLAALLLALALLLLALAALALAAALAALAAALLAAAAAAAAAAALLAALL 1200
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|...
gi 1920237946 2539 IGERLYRTELATQEKVMLVQTLETQRQQSDRDAERLREAIAEL 2581
Cdd:COG3899   1201 ALAARLAALLALALLALEAAALLLLLLLAALALAAALLALRLL 1243
PLN02316 PLN02316
synthase/transferase
1609-1672 9.06e-03

synthase/transferase


Pssm-ID: 215180 [Multi-domain]  Cd Length: 1036  Bit Score: 42.55  E-value: 9.06e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1920237946 1609 EEAEAQKRQAQEEAERLRrqvQDETQRKRQAE--AELALRVQAEAEAAREKQRALQALEELRLQAE 1672
Cdd:PLN02316   253 EKRRELEKLAKEEAERER---QAEEQRRREEEkaAMEADRAQAKAEVEKRREKLQNLLKKASRSAD 315
SPEC cd00176
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ...
1852-2027 9.58e-03

Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here


Pssm-ID: 238103 [Multi-domain]  Cd Length: 213  Bit Score: 40.89  E-value: 9.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1852 EQQRQLLEEELARLQREAAAATQKRRELEAELAKVRAEMEVLLASKARAEEESRSTSEKSKQRLEAEAGRFRELAEEAA- 1930
Cdd:cd00176     18 EKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNELGEQLIEEGHPDAEEIQERLEELNQRWEELRELAEe 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1931 RLRALAEEAKRQRQLAEEDAVRQRAEAervlAEKLAAISEATRLKTEAEIALKEKEAENERLRRLAED----EAFQRRLL 2006
Cdd:cd00176     98 RRQRLEEALDLQQFFRDADDLEQWLEE----KEAALASEDLGKDLESVEELLKKHKELEEELEAHEPRlkslNELAEELL 173
                          170       180
                   ....*....|....*....|.
gi 1920237946 2007 EEQAAQHKADIEARLAQLRKA 2027
Cdd:cd00176    174 EEGHPDADEEIEEKLEELNER 194
PspA_IM30 pfam04012
PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent ...
1597-1736 9.89e-03

PspA/IM30 family; This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homolog in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma(54) containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator.


Pssm-ID: 461130 [Multi-domain]  Cd Length: 215  Bit Score: 40.82  E-value: 9.89e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1920237946 1597 AEGELQALRARAEEAEAQKRQAQEEAERLRRQVQdetqrKRQAEAELALRVQAEA---EAAREKQ--------------R 1659
Cdd:pfam04012   34 MQSELVKARQALAQTIARQKQLERRLEQQTEQAK-----KLEEKAQAALTKGNEElarEALAEKKslekqaealetqlaQ 108
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1920237946 1660 ALQALEELRLQAEEAERRLRQAEAERaRQVQVALETAQRSAEAELQSEHASFAEKTAQLERTlkEEHVAVVQLREEA 1736
Cdd:pfam04012  109 QRSAVEQLRKQLAALETKIQQLKAKK-NLLKARLKAAKAQEAVQTSLGSLSTSSATDSFERI--EEKIEEREARADA 182
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH