NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|109659847|ref|NP_001035924|]
View 

filamin A-interacting protein 1-like isoform 3 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CortBP2 pfam09727
Cortactin-binding protein-2; This entry is the first approximately 250 residues of ...
66-248 3.77e-64

Cortactin-binding protein-2; This entry is the first approximately 250 residues of cortactin-binding protein 2. In addition to being a positional candidate for autism this protein is expressed at highest levels in the brain in humans. The human protein has six associated ankyrin repeat domains pfam00023 towards the C-terminus which act as protein-protein interaction domains.


:

Pssm-ID: 462860 [Multi-domain]  Cd Length: 187  Bit Score: 215.16  E-value: 3.77e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847    66 DLSRDDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAQYGFVTPKKVLEALQRDAFQAKSTPWQEDIYE----KPMN 141
Cdd:pfam09727    1 DLSKDDLLKLLSILEGELQARDIVIAVLKAEKVKQLLLEARYGFKYPSDPLLALQRDSELLRDQSQDEDVYEamyeKPLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEK 221
Cdd:pfam09727   81 ELEKLVEKQRETQRRMLEQLAAAEKRHRRVIRELEEEKRKHARDTAQGDDFTYLLEKERERLKQELEQEKAQQKRLEKEL 160
                          170       180
                   ....*....|....*....|....*..
gi 109659847   222 EKRVTTLKEELTKLKSFALMVVDEQQR 248
Cdd:pfam09727  161 KKLLEKLEEELSKQKQIALLLVKERKR 187
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
224-773 1.04e-18

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 92.43  E-value: 1.04e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFH 303
Cdd:TIGR02168  233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   304 QDQDTIM---AKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKIskgEYGNAGIMAEVEELRKRVL 380
Cdd:TIGR02168  313 NLERQLEeleAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL---ESRLEELEEQLETLRSKVA 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   381 DMEGK----DEELIKMEEQCRDLNKRLERET---------LQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCN 447
Cdd:TIGR02168  390 QLELQiaslNNEIERLEARLERLEDRRERLQqeieellkkLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   448 LEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMF--------VDE--RKTMSEKL------ 511
Cdd:TIGR02168  470 LEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILgvlselisVDEgyEAAIEAALggrlqa 549
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   512 ---KKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKrALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNML 588
Cdd:TIGR02168  550 vvvENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTE-IQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVV 628
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   589 KNRLQSLE-AIEKDFLKNKLNQD--------------SGKSTTALHQENN-------------KIKELSQEVERLKLKLK 640
Cdd:TIGR02168  629 DDLDNALElAKKLRPGYRIVTLDgdlvrpggvitggsAKTNSSILERRREieeleekieeleeKIAELEKALAELRKELE 708
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   641 DMKAIEDDLMKTEDEYET----LERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLS 716
Cdd:TIGR02168  709 ELEEELEQLRKELEELSRqisaLRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847   717 REVDALKEkihEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02168  789 AQIEQLKE---ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842
 
Name Accession Description Interval E-value
CortBP2 pfam09727
Cortactin-binding protein-2; This entry is the first approximately 250 residues of ...
66-248 3.77e-64

Cortactin-binding protein-2; This entry is the first approximately 250 residues of cortactin-binding protein 2. In addition to being a positional candidate for autism this protein is expressed at highest levels in the brain in humans. The human protein has six associated ankyrin repeat domains pfam00023 towards the C-terminus which act as protein-protein interaction domains.


Pssm-ID: 462860 [Multi-domain]  Cd Length: 187  Bit Score: 215.16  E-value: 3.77e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847    66 DLSRDDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAQYGFVTPKKVLEALQRDAFQAKSTPWQEDIYE----KPMN 141
Cdd:pfam09727    1 DLSKDDLLKLLSILEGELQARDIVIAVLKAEKVKQLLLEARYGFKYPSDPLLALQRDSELLRDQSQDEDVYEamyeKPLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEK 221
Cdd:pfam09727   81 ELEKLVEKQRETQRRMLEQLAAAEKRHRRVIRELEEEKRKHARDTAQGDDFTYLLEKERERLKQELEQEKAQQKRLEKEL 160
                          170       180
                   ....*....|....*....|....*..
gi 109659847   222 EKRVTTLKEELTKLKSFALMVVDEQQR 248
Cdd:pfam09727  161 KKLLEKLEEELSKQKQIALLLVKERKR 187
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
224-773 1.04e-18

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 92.43  E-value: 1.04e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFH 303
Cdd:TIGR02168  233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   304 QDQDTIM---AKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKIskgEYGNAGIMAEVEELRKRVL 380
Cdd:TIGR02168  313 NLERQLEeleAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL---ESRLEELEEQLETLRSKVA 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   381 DMEGK----DEELIKMEEQCRDLNKRLERET---------LQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCN 447
Cdd:TIGR02168  390 QLELQiaslNNEIERLEARLERLEDRRERLQqeieellkkLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   448 LEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMF--------VDE--RKTMSEKL------ 511
Cdd:TIGR02168  470 LEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILgvlselisVDEgyEAAIEAALggrlqa 549
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   512 ---KKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKrALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNML 588
Cdd:TIGR02168  550 vvvENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTE-IQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVV 628
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   589 KNRLQSLE-AIEKDFLKNKLNQD--------------SGKSTTALHQENN-------------KIKELSQEVERLKLKLK 640
Cdd:TIGR02168  629 DDLDNALElAKKLRPGYRIVTLDgdlvrpggvitggsAKTNSSILERRREieeleekieeleeKIAELEKALAELRKELE 708
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   641 DMKAIEDDLMKTEDEYET----LERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLS 716
Cdd:TIGR02168  709 ELEEELEQLRKELEELSRqisaLRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847   717 REVDALKEkihEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02168  789 AQIEQLKE---ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
224-775 6.33e-17

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 86.53  E-value: 6.33e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  224 RVTTLKEELTKLKSFALmvVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFH 303
Cdd:COG1196   214 RYRELKEELKELEAELL--LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEY 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  304 QDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDME 383
Cdd:COG1196   292 ELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  384 GKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEK--------LSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTT 455
Cdd:COG1196   372 AELAEAEEELEELAEELLEALRAAAELAAQLEELEEaeeallerLERLEEELEELEEALAELEEEEEEEEEALEEAAEEE 451
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  456 KQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSE-----KLKKTEDKLQAASSQLQVEQn 530
Cdd:COG1196   452 AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGflegvKAALLLAGLRGLAGAVAVLI- 530
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  531 KVTTVTEKLIEETKRALkSKTDVEEKMYSVTKERDDLK------------NKLKAEEEKGNDLLSRVNMLKNRLQSLEAI 598
Cdd:COG1196   531 GVEAAYEAALEAALAAA-LQNIVVEDDEVAAAAIEYLKaakagratflplDKIRARAALAAALARGAIGAAVDLVASDLR 609
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  599 EKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELE 678
Cdd:COG1196   610 EADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLA 689
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  679 HVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEKIHEYMATEDLICHLQG--------DHSVLQKKL 750
Cdd:COG1196   690 EEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEAleelpeppDLEELEREL 769
                         570       580
                  ....*....|....*....|....*....
gi 109659847  751 NQQENRNRDLG----REIENLTKELERYR 775
Cdd:COG1196   770 ERLEREIEALGpvnlLAIEEYEELEERYD 798
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
142-731 1.02e-16

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 85.50  E-value: 1.02e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKR------KHKEYMEKSDEFICLLEQECERLKKLIDQ------ 209
Cdd:PRK03918  190 NIEELIKEKEKELEEVLREINEISSELPELREELEKLEKevkeleELKEEIEELEKELESLEGSKRKLEEKIREleerie 269
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  210 EIKSQEEKEQEKEKRVTTLK---EELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKlalaEARVQEEEQ 286
Cdd:PRK03918  270 ELKKEIEELEEKVKELKELKekaEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK----EERLEELKK 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  287 KATRLEKELQtQTTKFHQDQDTIMAKLTNEdsqnRQLQQKLAALsrqidELEETNRSLRKAEEELQDIKEKISKgeygna 366
Cdd:PRK03918  346 KLKELEKRLE-ELEERHELYEEAKAKKEEL----ERLKKRLTGL-----TPEKLEKELEELEKAKEEIEEEISK------ 409
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  367 gIMAEVEELRKRVLDMEGKDEELIKMEEQC----RDLNKRLERETLqsKDFKLEVEKLSKRimaLEKLEDAFNKSKQECY 442
Cdd:PRK03918  410 -ITARIGELKKEIKELKKAIEELKKAKGKCpvcgRELTEEHRKELL--EEYTAELKRIEKE---LKEIEEKERKLRKELR 483
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  443 SLKCNLEKERmttkqlsqELESLKVRIKELEAIESRLEKTeftlkeDLTKLKtltvmfvdERKTMSEKLKKTEDKLQAAS 522
Cdd:PRK03918  484 ELEKVLKKES--------ELIKLKELAEQLKELEEKLKKY------NLEELE--------KKAEEYEKLKEKLIKLKGEI 541
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  523 SQLqveqnkvttvtEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLkaeEEKGndlLSRVNMLKNRLQSLEAIEKDF 602
Cdd:PRK03918  542 KSL-----------KKELEKLEELKKKLAELEKKLDELEEELAELLKEL---EELG---FESVEELEERLKELEPFYNEY 604
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  603 LKNK--------LNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAI--EDDLMKTEDEYETLERRYANERDKAQF 672
Cdd:PRK03918  605 LELKdaekelerEEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKysEEEYEELREEYLELSRELAGLRAELEE 684
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  673 LSKELEHVKMELAKYKlAEKTEtsheqwlFKRLQEEEAKSGHLSREVDALKEKIHEYMA 731
Cdd:PRK03918  685 LEKRREEIKKTLEKLK-EELEE-------REKAKKELEKLEKALERVEELREKVKKYKA 735
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
120-810 6.01e-12

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 70.15  E-value: 6.01e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   120 QRDAFQAKSTPWQEDIYEKPMNELDKVVEKHKesyRRILGQLLVAEKSRRQ-----TILELEEEKRKHKE--YMEKSDEF 192
Cdd:pfam15921  246 QLEALKSESQNKIELLLQQHQDRIEQLISEHE---VEITGLTEKASSARSQansiqSQLEIIQEQARNQNsmYMRQLSDL 322
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   193 ---ICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTA-------QLTLQRQKIQE 262
Cdd:pfam15921  323 estVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLAdlhkrekELSLEKEQNKR 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   263 LTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNED-------SQNRQLQQKLAALSRQID 335
Cdd:pfam15921  403 LWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNeslekvsSLTAQLESTKEMLRKVVE 482
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   336 ELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVlDMEGKDEELIKME-EQCRDLNKRLERETLQSKDFK 414
Cdd:pfam15921  483 ELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRV-DLKLQELQHLKNEgDHLRNVQTECEALKLQMAEKD 561
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   415 LEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLK----VRIKELEAIESRLEKTEFTL---- 486
Cdd:pfam15921  562 KVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKdkkdAKIRELEARVSDLELEKVKLvnag 641
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   487 KEDLTKLKTLTvmfvDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSktdveeKMYSVTKERDD 566
Cdd:pfam15921  642 SERLRAVKDIK----QERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEM-ETTTNKLKM------QLKSAQSELEQ 710
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   567 LKNKLKAEEEKGNDLLS--------------RVNMLKNRLQSLEAI------EKDFLKNKLNQDSGKSTTALHQENNKIK 626
Cdd:pfam15921  711 TRNTLKSMEGSDGHAMKvamgmqkqitakrgQIDALQSKIQFLEEAmtnankEKHFLKEEKNKLSQELSTVATEKNKMAG 790
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   627 EL----SQEvERLKLKLKDMKAIEDDLMKTEDEYETLERRYANE--RDKAQFL--SKELE---HVKMELAKYKLAEKTET 695
Cdd:pfam15921  791 ELevlrSQE-RRLKEKVANMEVALDKASLQFAECQDIIQRQEQEsvRLKLQHTldVKELQgpgYTSNSSMKPRLLQPASF 869
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   696 SHEQWLFKRLQEEEAKSGHLSREVDALKEKiheymATEDLICHLQGDHSVLQKKLNQQENRNRDLGRE---------IEN 766
Cdd:pfam15921  870 TRTHSNVPSSQSTASFLSHHSRKTNALKED-----PTRDLKQLLQELRSVINEEPTVQLSKAEDKGRApslgalddrVRD 944
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 109659847   767 LTKELERYRHFSKSLRPSLNGRRISDPQVFSKE-VQTEAVDNEPP 810
Cdd:pfam15921  945 CIIESSLRSDICHSSSNSLQTEGSKSSETCSREpVLLHAGELEDP 989
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
140-343 1.96e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.22  E-value: 1.96e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  140 MNELDKVVEKHKESYRRILGQLLVAEK---SRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEE 216
Cdd:COG4942    36 IAELEKELAALKKEEKALLKQLAALERriaALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYR 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  217 KEQEKEKRVTTLKEELTKL-KSFALM--VVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEK 293
Cdd:COG4942   116 LGRQPPLALLLSPEDFLDAvRRLQYLkyLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKA 195
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 109659847  294 ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRS 343
Cdd:COG4942   196 ERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
46 PHA02562
endonuclease subunit; Provisional
132-360 6.17e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 43.85  E-value: 6.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  132 QEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQei 211
Cdd:PHA02562  196 QIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQ-- 273
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  212 ksqeekeqekekrvttlkeeLTKLKSFaLMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKatrl 291
Cdd:PHA02562  274 --------------------FQKVIKM-YEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI---- 328
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847  292 EKELQTQTTKFHQDQDTIMAK---LTNEDSQNRQLQqklAALSRQIDELEETNRSLRKAEEELQDIKEKISK 360
Cdd:PHA02562  329 MDEFNEQSKKLLELKNKISTNkqsLITLVDKAKKVK---AAIEELQAEFVDNAEELAKLQDELDKIVKTKSE 397
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
291-430 1.30e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 42.31  E-value: 1.30e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847    291 LEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKA-EEELQDIKEKISKGEYGNAGIM 369
Cdd:smart00787  145 LKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCdPTELDRAKEKLKKLLQEIMIKV 224
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847    370 AEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKL-EVEKLSKRIMALEKL 430
Cdd:smart00787  225 KKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFkEIEKLKEQLKLLQSL 286
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
247-451 7.80e-03

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 39.66  E-value: 7.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  247 QRLTAQLTLQRQKIQELTTNAKETHTKLalaearvqeeeqkaTRLEKELQTQTTKFHQDQDTIMAKLTNEDSqnRQLQQK 326
Cdd:cd22656   117 KTIKALLDDLLKEAKKYQDKAAKVVDKL--------------TDFENQTEKDQTALETLEKALKDLLTDEGG--AIARKE 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  327 LAALSRQIDELEETNrsLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEgkdeeliKMEEQCRDLNkrlere 406
Cdd:cd22656   181 IKDLQKELEKLNEEY--AAKLKAKIDELKALIADDE-------AKLAAALRLIADLT-------AADTDLDNLL------ 238
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 109659847  407 tlqskdfklevEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKE 451
Cdd:cd22656   239 -----------ALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDD 272
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
316-671 9.67e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 40.00  E-value: 9.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  316 EDSQNRQLQQKLAALSRQIdeLEETNRSLRKAEEEL-QDIKEKISkgeygnagimAEVEELRKRVLDMEGKDEELIKMEE 394
Cdd:NF033838   82 KHTQNVALNKKLSDIKTEY--LYELNVLKEKSEAELtSKTKKELD----------AAFEQFKKDTLEPGKKVAEATKKVE 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  395 QCRDLNKRLERE------TLQSKDFKLEVEKLSKRImalekledafNKSKQECYSLKCNLEKERMTTKQLSQELESLKVR 468
Cdd:NF033838  150 EAEKKAKDQKEEdrrnypTNTYKTLELEIAESDVEV----------KKAELELVKEEAKEPRDEEKIKQAKAKVESKKAE 219
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  469 IKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALK 548
Cdd:NF033838  220 ATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKRRAKRGVLGEPATPDKKENDAKSSDSSVGEETLPSPS 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  549 SKTdvEEKMYSVTKERDDLKNKLKAEEEKgndllSRVNMLKNRLQSL--EAIEKDFLKNKLNQDSGKSTTALHQENNKIK 626
Cdd:NF033838  300 LKP--EKKVAEAEKKVEEAKKKAKDQKEE-----DRRNYPTNTYKTLelEIAESDVKVKEAELELVKEEAKEPRNEEKIK 372
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 109659847  627 ELSQEVERLKLKLKDMKAIEDDLMKTEDEyetlERRYANERDKAQ 671
Cdd:NF033838  373 QAKAKVESKKAEATRLEKIKTDRKKAEEE----AKRKAAEEDKVK 413
 
Name Accession Description Interval E-value
CortBP2 pfam09727
Cortactin-binding protein-2; This entry is the first approximately 250 residues of ...
66-248 3.77e-64

Cortactin-binding protein-2; This entry is the first approximately 250 residues of cortactin-binding protein 2. In addition to being a positional candidate for autism this protein is expressed at highest levels in the brain in humans. The human protein has six associated ankyrin repeat domains pfam00023 towards the C-terminus which act as protein-protein interaction domains.


Pssm-ID: 462860 [Multi-domain]  Cd Length: 187  Bit Score: 215.16  E-value: 3.77e-64
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847    66 DLSRDDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAQYGFVTPKKVLEALQRDAFQAKSTPWQEDIYE----KPMN 141
Cdd:pfam09727    1 DLSKDDLLKLLSILEGELQARDIVIAVLKAEKVKQLLLEARYGFKYPSDPLLALQRDSELLRDQSQDEDVYEamyeKPLA 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEK 221
Cdd:pfam09727   81 ELEKLVEKQRETQRRMLEQLAAAEKRHRRVIRELEEEKRKHARDTAQGDDFTYLLEKERERLKQELEQEKAQQKRLEKEL 160
                          170       180
                   ....*....|....*....|....*..
gi 109659847   222 EKRVTTLKEELTKLKSFALMVVDEQQR 248
Cdd:pfam09727  161 KKLLEKLEEELSKQKQIALLLVKERKR 187
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
224-773 1.04e-18

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 92.43  E-value: 1.04e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFH 303
Cdd:TIGR02168  233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   304 QDQDTIM---AKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKIskgEYGNAGIMAEVEELRKRVL 380
Cdd:TIGR02168  313 NLERQLEeleAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL---ESRLEELEEQLETLRSKVA 389
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   381 DMEGK----DEELIKMEEQCRDLNKRLERET---------LQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCN 447
Cdd:TIGR02168  390 QLELQiaslNNEIERLEARLERLEDRRERLQqeieellkkLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   448 LEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMF--------VDE--RKTMSEKL------ 511
Cdd:TIGR02168  470 LEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILgvlselisVDEgyEAAIEAALggrlqa 549
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   512 ---KKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKrALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNML 588
Cdd:TIGR02168  550 vvvENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTE-IQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVV 628
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   589 KNRLQSLE-AIEKDFLKNKLNQD--------------SGKSTTALHQENN-------------KIKELSQEVERLKLKLK 640
Cdd:TIGR02168  629 DDLDNALElAKKLRPGYRIVTLDgdlvrpggvitggsAKTNSSILERRREieeleekieeleeKIAELEKALAELRKELE 708
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   641 DMKAIEDDLMKTEDEYET----LERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLS 716
Cdd:TIGR02168  709 ELEEELEQLRKELEELSRqisaLRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
                          570       580       590       600       610
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847   717 REVDALKEkihEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02168  789 AQIEQLKE---ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
224-775 6.33e-17

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 86.53  E-value: 6.33e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  224 RVTTLKEELTKLKSFALmvVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFH 303
Cdd:COG1196   214 RYRELKEELKELEAELL--LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEY 291
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  304 QDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDME 383
Cdd:COG1196   292 ELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE 371
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  384 GKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEK--------LSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTT 455
Cdd:COG1196   372 AELAEAEEELEELAEELLEALRAAAELAAQLEELEEaeeallerLERLEEELEELEEALAELEEEEEEEEEALEEAAEEE 451
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  456 KQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSE-----KLKKTEDKLQAASSQLQVEQn 530
Cdd:COG1196   452 AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGflegvKAALLLAGLRGLAGAVAVLI- 530
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  531 KVTTVTEKLIEETKRALkSKTDVEEKMYSVTKERDDLK------------NKLKAEEEKGNDLLSRVNMLKNRLQSLEAI 598
Cdd:COG1196   531 GVEAAYEAALEAALAAA-LQNIVVEDDEVAAAAIEYLKaakagratflplDKIRARAALAAALARGAIGAAVDLVASDLR 609
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  599 EKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELE 678
Cdd:COG1196   610 EADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLA 689
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  679 HVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEKIHEYMATEDLICHLQG--------DHSVLQKKL 750
Cdd:COG1196   690 EEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEAleelpeppDLEELEREL 769
                         570       580
                  ....*....|....*....|....*....
gi 109659847  751 NQQENRNRDLG----REIENLTKELERYR 775
Cdd:COG1196   770 ERLEREIEALGpvnlLAIEEYEELEERYD 798
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
142-731 1.02e-16

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 85.50  E-value: 1.02e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKR------KHKEYMEKSDEFICLLEQECERLKKLIDQ------ 209
Cdd:PRK03918  190 NIEELIKEKEKELEEVLREINEISSELPELREELEKLEKevkeleELKEEIEELEKELESLEGSKRKLEEKIREleerie 269
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  210 EIKSQEEKEQEKEKRVTTLK---EELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKlalaEARVQEEEQ 286
Cdd:PRK03918  270 ELKKEIEELEEKVKELKELKekaEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK----EERLEELKK 345
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  287 KATRLEKELQtQTTKFHQDQDTIMAKLTNEdsqnRQLQQKLAALsrqidELEETNRSLRKAEEELQDIKEKISKgeygna 366
Cdd:PRK03918  346 KLKELEKRLE-ELEERHELYEEAKAKKEEL----ERLKKRLTGL-----TPEKLEKELEELEKAKEEIEEEISK------ 409
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  367 gIMAEVEELRKRVLDMEGKDEELIKMEEQC----RDLNKRLERETLqsKDFKLEVEKLSKRimaLEKLEDAFNKSKQECY 442
Cdd:PRK03918  410 -ITARIGELKKEIKELKKAIEELKKAKGKCpvcgRELTEEHRKELL--EEYTAELKRIEKE---LKEIEEKERKLRKELR 483
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  443 SLKCNLEKERmttkqlsqELESLKVRIKELEAIESRLEKTeftlkeDLTKLKtltvmfvdERKTMSEKLKKTEDKLQAAS 522
Cdd:PRK03918  484 ELEKVLKKES--------ELIKLKELAEQLKELEEKLKKY------NLEELE--------KKAEEYEKLKEKLIKLKGEI 541
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  523 SQLqveqnkvttvtEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLkaeEEKGndlLSRVNMLKNRLQSLEAIEKDF 602
Cdd:PRK03918  542 KSL-----------KKELEKLEELKKKLAELEKKLDELEEELAELLKEL---EELG---FESVEELEERLKELEPFYNEY 604
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  603 LKNK--------LNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAI--EDDLMKTEDEYETLERRYANERDKAQF 672
Cdd:PRK03918  605 LELKdaekelerEEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKysEEEYEELREEYLELSRELAGLRAELEE 684
                         570       580       590       600       610
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  673 LSKELEHVKMELAKYKlAEKTEtsheqwlFKRLQEEEAKSGHLSREVDALKEKIHEYMA 731
Cdd:PRK03918  685 LEKRREEIKKTLEKLK-EELEE-------REKAKKELEKLEKALERVEELREKVKKYKA 735
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
321-772 4.14e-15

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 80.49  E-value: 4.14e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  321 RQLQQKLAALSRQIDELEET-------NRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELR---KRVLDMEGKDEELI 390
Cdd:PRK03918  217 PELREELEKLEKEVKELEELkeeieelEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEekvKELKELKEKAEEYI 296
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  391 KMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLE------KERMTTKQLSQELES 464
Cdd:PRK03918  297 KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEeleerhELYEEAKAKKEELER 376
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  465 LKVRIK--ELEAIESRLEKTEFTLKEDLTKLKTLTVMF------VDERKTMSEKLKKTEDKLQAASSQLQVEQNKvtTVT 536
Cdd:PRK03918  377 LKKRLTglTPEKLEKELEELEKAKEEIEEEISKITARIgelkkeIKELKKAIEELKKAKGKCPVCGRELTEEHRK--ELL 454
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  537 EKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEE--KGNDLLSRVNMLKNRLQS--LEAIEKDF-----LKNKL 607
Cdd:PRK03918  455 EEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESEliKLKELAEQLKELEEKLKKynLEELEKKAeeyekLKEKL 534
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  608 NQDSGKSTTaLHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDE-----YETLERR------YANERDKAQFLSKE 676
Cdd:PRK03918  535 IKLKGEIKS-LKKELEKLEELKKKLAELEKKLDELEEELAELLKELEElgfesVEELEERlkelepFYNEYLELKDAEKE 613
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  677 LEHVKMELAkyKLAEKTETSheqwlFKRLQEEEAKSGHLSREVDALKEKI--HEYMATEDLICHLQGDHSVLQKKLNQQE 754
Cdd:PRK03918  614 LEREEKELK--KLEEELDKA-----FEELAETEKRLEELRKELEELEKKYseEEYEELREEYLELSRELAGLRAELEELE 686
                         490
                  ....*....|....*...
gi 109659847  755 NRNRDLGREIENLTKELE 772
Cdd:PRK03918  687 KRREEIKKTLEKLKEELE 704
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
148-773 1.14e-14

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 79.34  E-value: 1.14e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   148 EKHKESYRRILGQLLVAEKSRRQTILELEEEKRKhkeyMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTT 227
Cdd:TIGR02169  219 EKREYEGYELLKEKEALERQKEAIERQLASLEEE----LEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKE 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   228 LKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLalaEARVQEEEQKATRLEKELQTQTTKFhqdqD 307
Cdd:TIGR02169  295 KIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEEL---EREIEEERKRRDKLTEEYAELKEEL----E 367
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   308 TIMAKLTNEDSQNR-------QLQQKLAALSRQIDELEETNRSL----RKAEEELQDIKEKISKGEYGNAGIMAEVEELR 376
Cdd:TIGR02169  368 DLRAELEEVDKEFAetrdelkDYREKLEKLKREINELKRELDRLqeelQRLSEELADLNAAIAGIEAKINELEEEKEDKA 447
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   377 KRVLDMEGKDEELIKMEEqcrDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTK 456
Cdd:TIGR02169  448 LEIKKQEWKLEQLAADLS---KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVH 524
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   457 QLSQELesLKVRIKELEAIES----RLE----KTEFTLKEDLTKLKT-----LTVMFVDERKTMSEKLKKT--------- 514
Cdd:TIGR02169  525 GTVAQL--GSVGERYATAIEVaagnRLNnvvvEDDAVAKEAIELLKRrkagrATFLPLNKMRDERRDLSILsedgvigfa 602
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   515 ------EDKLQAASSQ-----LQVE----------QNKVTTVTEKLIEET----------KRALKSKTDVEEKMYSVTKE 563
Cdd:TIGR02169  603 vdlvefDPKYEPAFKYvfgdtLVVEdieaarrlmgKYRMVTLEGELFEKSgamtggsrapRGGILFSRSEPAELQRLRER 682
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   564 RDDLKNKLKAEEEKGNDLLSRVNMLKNRL----QSLEAIEKDflKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKL 639
Cdd:TIGR02169  683 LEGLKRELSSLQSELRRIENRLDELSQELsdasRKIGEIEKE--IEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSEL 760
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   640 KDMKA----IEDDLMKTEDEYETLERRYANERdkAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHL 715
Cdd:TIGR02169  761 KELEArieeLEEDLHKLEEALNDLEARLSHSR--IPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQEL 838
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847   716 SREVDALKEKIHEYMATEDLichLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02169  839 QEQRIDLKEQIKSIEKEIEN---LNGKKEELEEELEELEAALRDLESRLGDLKKERDE 893
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
306-772 2.89e-14

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 77.80  E-value: 2.89e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  306 QDTIMAKLTNEDSQNRQLQQKLaalsrQIDELEETNRSLRKAEEELQDIKEKISKgeygnagIMAEVEELRKRVLDMEgk 385
Cdd:PRK03918  134 QGEIDAILESDESREKVVRQIL-----GLDDYENAYKNLGEVIKEIKRRIERLEK-------FIKRTENIEELIKEKE-- 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  386 dEELIKMEEQCRDLNKRLEretlqskDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESL 465
Cdd:PRK03918  200 -KELEEVLREINEISSELP-------ELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  466 KVRIKELEAIESRLEKTEftlkedltklktltvmfvdERKTMSEKLKKTEDKLQAASSQLQVEQNKVttvtEKLIEETKR 545
Cdd:PRK03918  272 KKEIEELEEKVKELKELK-------------------EKAEEYIKLSEFYEEYLDELREIEKRLSRL----EEEINGIEE 328
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  546 ALKSKTDVEEKMYSVTKERDDLKNKLkAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKI 625
Cdd:PRK03918  329 RIKELEEKEERLEELKKKLKELEKRL-EELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  626 KELSQEVERLKLKLKDMKAIEDDLMK------------TEDEYETLERRY-------ANERDKAQFLSKELEHVKMELAK 686
Cdd:PRK03918  408 SKITARIGELKKEIKELKKAIEELKKakgkcpvcgrelTEEHRKELLEEYtaelkriEKELKEIEEKERKLRKELRELEK 487
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  687 yKLAEKTETSHEQWLFKRLQEEEAKSGHLSREvdALKEKIHEYMATEDLICHLQGDHSVLQ---KKLNQQENRNRDLGRE 763
Cdd:PRK03918  488 -VLKKESELIKLKELAEQLKELEEKLKKYNLE--ELEKKAEEYEKLKEKLIKLKGEIKSLKkelEKLEELKKKLAELEKK 564

                  ....*....
gi 109659847  764 IENLTKELE 772
Cdd:PRK03918  565 LDELEEELA 573
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
142-598 3.06e-14

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 77.77  E-value: 3.06e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  142 ELDKVVEKHKESyRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLK---KLIDQEIKSQEEKE 218
Cdd:PRK02224  238 EADEVLEEHEER-REELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLaeaGLDDADAEAVEARR 316
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  219 QEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQ 298
Cdd:PRK02224  317 EELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEEL 396
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  299 TTKFhqdqDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYG----NAGIMAEVEE 374
Cdd:PRK02224  397 RERF----GDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPECGqpveGSPHVETIEE 472
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  375 LRKRVLDMEgkdEELIKMEEQCRDLNKRLER-----------ETLQSK---------DFKLEVEKLSKRIMAL----EKL 430
Cdd:PRK02224  473 DRERVEELE---AELEDLEEEVEEVEERLERaedlveaedriERLEERredleeliaERRETIEEKRERAEELreraAEL 549
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  431 EDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEfTLKEDLTKLKtltvmfvDERKTMSEK 510
Cdd:PRK02224  550 EAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIA-DAEDEIERLR-------EKREALAEL 621
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  511 LKKTEDKLQAAS---SQL--QVEQNKVttvtEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNK---LKAEEEKGNDLL 582
Cdd:PRK02224  622 NDERRERLAEKRerkRELeaEFDEARI----EEAREDKERAEEYLEQVEEKLDELREERDDLQAEigaVENELEELEELR 697
                         490
                  ....*....|....*.
gi 109659847  583 SRVNMLKNRLQSLEAI 598
Cdd:PRK02224  698 ERREALENRVEALEAL 713
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
276-691 3.23e-14

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 77.79  E-value: 3.23e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   276 LAEARVQEEEQKATRLEKELQTQTTKFHQDQDTI-----MAKLTNEDSQNRQ-LQQKLAALSRQIDELEETNRSLRKaee 349
Cdd:TIGR02168  622 LGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVrpggvITGGSAKTNSSILeRRREIEELEEKIEELEEKIAELEK--- 698
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   350 ELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRimaLEK 429
Cdd:TIGR02168  699 ALAELRKELEELEEELEQLRKELEELSRQISALR---KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEER---LEE 772
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   430 LEDAFNKSKQEcyslkcnLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSE 509
Cdd:TIGR02168  773 AEEELAEAEAE-------IEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEE 845
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   510 KLKKTEDKLQAASSqlqvEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLL------- 582
Cdd:TIGR02168  846 QIEELSEDIESLAA----EIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRreleelr 921
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   583 SRVNMLKNRLQSLEAiEKDFLKNKLNQDsGKSTTALHQEN-----NKIKELSQEVERLKLKLKDMKAIEddlMKTEDEYE 657
Cdd:TIGR02168  922 EKLAQLELRLEGLEV-RIDNLQERLSEE-YSLTLEEAEALenkieDDEEEARRRLKRLENKIKELGPVN---LAAIEEYE 996
                          410       420       430
                   ....*....|....*....|....*....|....
gi 109659847   658 TLERRYanerdkaQFLSKELEHVkmELAKYKLAE 691
Cdd:TIGR02168  997 ELKERY-------DFLTAQKEDL--TEAKETLEE 1021
PTZ00121 PTZ00121
MAEBL; Provisional
164-799 1.37e-13

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 75.95  E-value: 1.37e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  164 AEKSRRQTILELEEEkRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVT-TLKEELTKLKSFALMV 242
Cdd:PTZ00121 1100 AEEAKKTETGKAEEA-RKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEdARKAEEARKAEDAKKA 1178
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  243 VDEQQRLTAQLTLQRQKIQEL--TTNAKETHTKLALAEARVQEEEQKATRLEKelqTQTTKFHQDQDTIMAKLTNEDSQN 320
Cdd:PTZ00121 1179 EAARKAEEVRKAEELRKAEDArkAEAARKAEEERKAEEARKAEDAKKAEAVKK---AEEAKKDAEEAKKAEEERNNEEIR 1255
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  321 RQLQQKLAALSRQIDELEETNRslRKAEEeLQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEE---QCR 397
Cdd:PTZ00121 1256 KFEEARMAHFARRQAAIKAEEA--RKADE-LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEakkKAD 1332
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  398 DLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEK---ERMTTKQLSQELESLKVRIKELEA 474
Cdd:PTZ00121 1333 AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKkaeEKKKADEAKKKAEEDKKKADELKK 1412
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  475 IESRLEKTEFTLK--EDLTKLKTLTVMfVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTD 552
Cdd:PTZ00121 1413 AAAAKKKADEAKKkaEEKKKADEAKKK-AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  553 VEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALH--QENNKIKELSQ 630
Cdd:PTZ00121 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKkaEEKKKAEEAKK 1571
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  631 EVERLKLKLK---DMKAIE----DDLMKTEDEYETLE----RRYANERDKAQFLSKElEHVKMELAKYKLAEKTETSHEQ 699
Cdd:PTZ00121 1572 AEEDKNMALRkaeEAKKAEeariEEVMKLYEEEKKMKaeeaKKAEEAKIKAEELKKA-EEEKKKVEQLKKKEAEEKKKAE 1650
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  700 WLFKRLQEEEAKSGHLSREVDALKEKIHEYMATEDlichlqgdhsvLQKKLNQQENRNRDLGREIENLTKELERYRHFSK 779
Cdd:PTZ00121 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE-----------DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE 1719
                         650       660
                  ....*....|....*....|
gi 109659847  780 SLRPSLNGRRISDPQVFSKE 799
Cdd:PTZ00121 1720 ELKKAEEENKIKAEEAKKEA 1739
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
113-775 2.20e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 75.09  E-value: 2.20e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   113 KKVLEALQRDAFQAKStpwQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAeksrRQTILELEEEKRKHKEYMEKSDEF 192
Cdd:TIGR02168  245 QEELKEAEEELEELTA---ELQELEEKLEELRLEVSELEEEIEELQKELYAL----ANEISRLEQQKQILRERLANLERQ 317
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   193 ICLLEQECERL--KKLIDQEI-KSQEEKEQEKEKRVTTLKEELTKLKsfalmvvDEQQRLTAQLTLQRQKIQELTTNAKE 269
Cdd:TIGR02168  318 LEELEAQLEELesKLDELAEElAELEEKLEELKEELESLEAELEELE-------AELEELESRLEELEEQLETLRSKVAQ 390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   270 THTKLALAEARVQEEEQKATRLEKELQtqttKFHQDQDTIMAKLtnEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEE 349
Cdd:TIGR02168  391 LELQIASLNNEIERLEARLERLEDRRE----RLQQEIEELLKKL--EEAELKELQAELEELEEELEELQEELERLEEALE 464
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   350 ELQDIKEKISkgeygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEK 429
Cdd:TIGR02168  465 ELREELEEAE----------QALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEG 534
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   430 LEDA----------------FNKSKQECYSLKCNLEKERM--------------TTKQLSQELESLKVRIKELEAIESRL 479
Cdd:TIGR02168  535 YEAAieaalggrlqavvvenLNAAKKAIAFLKQNELGRVTflpldsikgteiqgNDREILKNIEGFLGVAKDLVKFDPKL 614
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   480 EK-----------------------------TEFTLKEDL-------TKLKTLTVMFVDERKTMSEKLKKTEDKLQAASS 523
Cdd:TIGR02168  615 RKalsyllggvlvvddldnalelakklrpgyRIVTLDGDLvrpggviTGGSAKTNSSILERRREIEELEEKIEELEEKIA 694
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   524 QLQVEQNKVTTVTEKLIEETKRALKSKTDVEekmysvtKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEkdfl 603
Cdd:TIGR02168  695 ELEKALAELRKELEELEEELEQLRKELEELS-------RQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEI---- 763
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   604 kNKLNQDSGKSTTALHQENNKIKELSQEVERLKlklKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKME 683
Cdd:TIGR02168  764 -EELEERLEEAEEELAEAEAEIEELEAQIEQLK---EELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERR 839
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   684 LAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEkihEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGRE 763
Cdd:TIGR02168  840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLN---ERASLEEALALLRSELEELSEELRELESKRSELRRE 916
                          730
                   ....*....|..
gi 109659847   764 IENLTKELERYR 775
Cdd:TIGR02168  917 LEELREKLAQLE 928
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
172-525 2.73e-13

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 74.72  E-value: 2.73e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   172 ILELEEEKRKHKEYMEKsdeficlLEQECERLKKLIDQEIKSQE--EKEQEKEKRVTTLKEELTKLKSFALM-----VVD 244
Cdd:TIGR02169  165 VAEFDRKKEKALEELEE-------VEENIERLDLIIDEKRQQLErlRREREKAERYQALLKEKREYEGYELLkekeaLER 237
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   245 EQQRLTAQLTLQRQKIQELTtnakethtklalaeARVQEEEQKATRLEKELQTQTTKfhqdqdtIMAKLTNEdsqNRQLQ 324
Cdd:TIGR02169  238 QKEAIERQLASLEEELEKLT--------------EEISELEKRLEEIEQLLEELNKK-------IKDLGEEE---QLRVK 293
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   325 QKLAALSRQI----DELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLN 400
Cdd:TIGR02169  294 EKIGELEAEIasleRSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT---EEYAELKEELEDLR 370
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   401 KRLEREtlqSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELeaiESRLE 480
Cdd:TIGR02169  371 AELEEV---DKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINEL---EEEKE 444
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*...
gi 109659847   481 KTEFTLKEDLTKLKTLTVMFVDERKTM---SEKLKKTEDKLQAASSQL 525
Cdd:TIGR02169  445 DKALEIKKQEWKLEQLAADLSKYEQELydlKEEYDRVEKELSKLQREL 492
PTZ00121 PTZ00121
MAEBL; Provisional
113-688 4.62e-13

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 74.02  E-value: 4.62e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  113 KKVLEAlqRDAFQAKSTPWQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEf 192
Cdd:PTZ00121 1237 KDAEEA--KKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE- 1313
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  193 icllEQECERLKKLIDQEIKSQEEKEQekekrvttlKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTnakETHT 272
Cdd:PTZ00121 1314 ----AKKADEAKKKAEEAKKKADAAKK---------KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK---EEAK 1377
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  273 KLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQ-------QKLAALSRQIDELEETNRSLR 345
Cdd:PTZ00121 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEkkkadeaKKKAEEAKKADEAKKKAEEAK 1457
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  346 KAEEELQDIKEKISKGEYGNAGIMA-EVEELRKRVLDMEGKDEELIKMEEQCR--DLNKRLERETLQSKDFKLEVEKLSK 422
Cdd:PTZ00121 1458 KAEEAKKKAEEAKKADEAKKKAEEAkKADEAKKKAEEAKKKADEAKKAAEAKKkaDEAKKAEEAKKADEAKKAEEAKKAD 1537
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  423 RIMALEKLEDAFNKSKQEcySLKCNLEKERMTTKQLSQELESLKVR-IKELEAIESRLEKTEFTLKEDLTKLKTLTVMFV 501
Cdd:PTZ00121 1538 EAKKAEEKKKADELKKAE--ELKKAEEKKKAEEAKKAEEDKNMALRkAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA 1615
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  502 DERKTMSEKLKKTEDklqaassqlqvEQNKVTTVTEKLIEETKRALKSKTDVEE-KMYSVTKERDDLKNKLKAEEEKGND 580
Cdd:PTZ00121 1616 EEAKIKAEELKKAEE-----------EKKKVEQLKKKEAEEKKKAEELKKAEEEnKIKAAEEAKKAEEDKKKAEEAKKAE 1684
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  581 LLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALH-QENNKIKelsqeVERLKLKLKDMKAIEDDLMKTEDEYETL 659
Cdd:PTZ00121 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKaEEENKIK-----AEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759
                         570       580
                  ....*....|....*....|....*....
gi 109659847  660 ERRYANERDKAQFLSKELEHVKMELAKYK 688
Cdd:PTZ00121 1760 AHLKKEEEKKAEEIRKEKEAVIEEELDEE 1788
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
283-687 5.38e-13

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 73.56  E-value: 5.38e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   283 EEEQKATRLEKELQtqttKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDEL----EETNRSLRKAEEELQDIKEKI 358
Cdd:TIGR02169  671 SEPAELQRLRERLE----GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIekeiEQLEQEEEKLKERLEELEEDL 746
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   359 SKGEYGNAGIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKDFKL-EVEKLSKRIMA-LEKLEDAFNK 436
Cdd:TIGR02169  747 SSLEQEIENVKSELKELEARIEELE---EDLHKLEEALNDLEARLSHSRIPEIQAELsKLEEEVSRIEArLREIEQKLNR 823
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   437 SKQEcyslKCNLEKERMTTKQLSQELESLKVRI-KELEAIESRLEKTEFTLKEDLTKLKTLtvmfvDERKtmsEKLKKTE 515
Cdd:TIGR02169  824 LTLE----KEYLEKEIQELQEQRIDLKEQIKSIeKEIENLNGKKEELEEELEELEAALRDL-----ESRL---GDLKKER 891
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   516 DKLQAASSQLQVEQNKvttvteklieetkralksktdveekmysvtkerddlknkLKAEEEKGNDLLSRvnmLKNRLQSL 595
Cdd:TIGR02169  892 DELEAQLRELERKIEE---------------------------------------LEAQIEKKRKRLSE---LKAKLEAL 929
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   596 EAIEKDFLKNKlnqDSGKSTTALHQENNKIKELSQEVERlklklkDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSK 675
Cdd:TIGR02169  930 EEELSEIEDPK---GEDEEIPEEELSLEDVQAELQRVEE------EIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEE 1000
                          410
                   ....*....|..
gi 109659847   676 ELEHVKMELAKY 687
Cdd:TIGR02169 1001 ERKAILERIEEY 1012
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
164-520 6.34e-13

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 73.55  E-value: 6.34e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   164 AEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKL---IDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFAL 240
Cdd:TIGR02168  664 GSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKEleeLEEELEQLRKELEELSRQISALRKDLARLEAEVE 743
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   241 MVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFhqdqDTIMAKLTNEDSQN 320
Cdd:TIGR02168  744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL----DELRAELTLLNEEA 819
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   321 RQLQQKLAALSRQID----ELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV-----------LDMEGK 385
Cdd:TIGR02168  820 ANLRERLESLERRIAaterRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERasleealallrSELEEL 899
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   386 DEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMAL-EKLEDAFNKSKQECYSLKCNLEKERMttkQLSQELES 464
Cdd:TIGR02168  900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLqERLSEEYSLTLEEAEALENKIEDDEE---EARRRLKR 976
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847   465 LKVRIKEL--------EAIESRLEKTEF--TLKEDLTK-LKTLTVMFVDERKTMSEKLKKTEDKLQA 520
Cdd:TIGR02168  977 LENKIKELgpvnlaaiEEYEELKERYDFltAQKEDLTEaKETLEEAIEEIDREARERFKDTFDQVNE 1043
PTZ00121 PTZ00121
MAEBL; Provisional
142-728 9.15e-13

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 73.25  E-value: 9.15e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYM-----EKSDEFICLLEQECERLKKLIDQEIKSQEE 216
Cdd:PTZ00121 1198 DARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAkkaeeERNNEEIRKFEEARMAHFARRQAAIKAEEA 1277
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  217 KEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAqltlQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQ 296
Cdd:PTZ00121 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAE----EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  297 TQTTKFHQDQDTIMA--KLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYG--NAGIMAEV 372
Cdd:PTZ00121 1354 AAADEAEAAEEKAEAaeKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAkkKAEEKKKA 1433
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  373 EELRKRVLDMEGKDEELIKMEEqcrdlnKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAfnKSKQECYSLKCNLEKER 452
Cdd:PTZ00121 1434 DEAKKKAEEAKKADEAKKKAEE------AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA--KKKAEEAKKKADEAKKA 1505
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  453 MTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKV 532
Cdd:PTZ00121 1506 AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  533 TtvteKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSG 612
Cdd:PTZ00121 1586 A----KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  613 KSTtalhQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDE---YETLERRYANERDKAQFLSKELE--HVKMELAKY 687
Cdd:PTZ00121 1662 KAA----EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEakkAEELKKKEAEEKKKAEELKKAEEenKIKAEEAKK 1737
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|.
gi 109659847  688 KLAEKTETSHEqwlFKRLQEEEAKSGHLSREVDALKEKIHE 728
Cdd:PTZ00121 1738 EAEEDKKKAEE---AKKDEEEKKKIAHLKKEEEKKAEEIRK 1775
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
233-577 3.22e-12

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 71.25  E-value: 3.22e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   233 TKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQ---DQDTI 309
Cdd:TIGR02169  670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEEleeDLSSL 749
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   310 MAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSL--RKAEEELQDIKEKISKgeygnagIMAEVEELRKRVLDMEGKDE 387
Cdd:TIGR02169  750 EQEIENVKSELKELEARIEELEEDLHKLEEALNDLeaRLSHSRIPEIQAELSK-------LEEEVSRIEARLREIEQKLN 822
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   388 ELIKMEEQCRDLNKRLERE----TLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERmttKQLSQELE 463
Cdd:TIGR02169  823 RLTLEKEYLEKEIQELQEQridlKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKER---DELEAQLR 899
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   464 SLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKT--EDKLQAASSQLQVEQNKVTTVTEKLIE 541
Cdd:TIGR02169  900 ELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEElsLEDVQAELQRVEEEIRALEPVNMLAIQ 979
                          330       340       350
                   ....*....|....*....|....*....|....*.
gi 109659847   542 ETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEK 577
Cdd:TIGR02169  980 EYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
120-810 6.01e-12

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 70.15  E-value: 6.01e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   120 QRDAFQAKSTPWQEDIYEKPMNELDKVVEKHKesyRRILGQLLVAEKSRRQ-----TILELEEEKRKHKE--YMEKSDEF 192
Cdd:pfam15921  246 QLEALKSESQNKIELLLQQHQDRIEQLISEHE---VEITGLTEKASSARSQansiqSQLEIIQEQARNQNsmYMRQLSDL 322
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   193 ---ICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTA-------QLTLQRQKIQE 262
Cdd:pfam15921  323 estVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLAdlhkrekELSLEKEQNKR 402
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   263 LTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNED-------SQNRQLQQKLAALSRQID 335
Cdd:pfam15921  403 LWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNeslekvsSLTAQLESTKEMLRKVVE 482
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   336 ELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVlDMEGKDEELIKME-EQCRDLNKRLERETLQSKDFK 414
Cdd:pfam15921  483 ELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRV-DLKLQELQHLKNEgDHLRNVQTECEALKLQMAEKD 561
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   415 LEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLK----VRIKELEAIESRLEKTEFTL---- 486
Cdd:pfam15921  562 KVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKdkkdAKIRELEARVSDLELEKVKLvnag 641
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   487 KEDLTKLKTLTvmfvDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSktdveeKMYSVTKERDD 566
Cdd:pfam15921  642 SERLRAVKDIK----QERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEM-ETTTNKLKM------QLKSAQSELEQ 710
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   567 LKNKLKAEEEKGNDLLS--------------RVNMLKNRLQSLEAI------EKDFLKNKLNQDSGKSTTALHQENNKIK 626
Cdd:pfam15921  711 TRNTLKSMEGSDGHAMKvamgmqkqitakrgQIDALQSKIQFLEEAmtnankEKHFLKEEKNKLSQELSTVATEKNKMAG 790
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   627 EL----SQEvERLKLKLKDMKAIEDDLMKTEDEYETLERRYANE--RDKAQFL--SKELE---HVKMELAKYKLAEKTET 695
Cdd:pfam15921  791 ELevlrSQE-RRLKEKVANMEVALDKASLQFAECQDIIQRQEQEsvRLKLQHTldVKELQgpgYTSNSSMKPRLLQPASF 869
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   696 SHEQWLFKRLQEEEAKSGHLSREVDALKEKiheymATEDLICHLQGDHSVLQKKLNQQENRNRDLGRE---------IEN 766
Cdd:pfam15921  870 TRTHSNVPSSQSTASFLSHHSRKTNALKED-----PTRDLKQLLQELRSVINEEPTVQLSKAEDKGRApslgalddrVRD 944
                          730       740       750       760
                   ....*....|....*....|....*....|....*....|....*
gi 109659847   767 LTKELERYRHFSKSLRPSLNGRRISDPQVFSKE-VQTEAVDNEPP 810
Cdd:pfam15921  945 CIIESSLRSDICHSSSNSLQTEGSKSSETCSREpVLLHAGELEDP 989
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
158-667 1.62e-11

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 68.91  E-value: 1.62e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  158 LGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLiDQEIKSQEEKEQEKEKRVTTLKEELTKLKS 237
Cdd:PRK02224  208 LNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETL-EAEIEDLRETIAETEREREELAEEVRDLRE 286
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  238 FALMVVDEQQRLTAQLTLQR-------QKIQELTTNAKETHTKLALAEARVQEEEQKATRLE---KELQTQTTKFHQDQD 307
Cdd:PRK02224  287 RLEELEEERDDLLAEAGLDDadaeaveARREELEDRDEELRDRLEECRVAAQAHNEEAESLRedaDDLEERAEELREEAA 366
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  308 TIMAKLTNEDSQNRQLQQKLAALSRQIDELEE----TNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV---- 379
Cdd:PRK02224  367 ELESELEEAREAVEDRREEIEELEEEIEELRErfgdAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVeeae 446
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  380 -LDMEGK---------DEELIKMEEQCRDLNKRLERETLqskDFKLEVEKLSKRIMALEKLedafnkskqecyslkcnle 449
Cdd:PRK02224  447 aLLEAGKcpecgqpveGSPHVETIEEDRERVEELEAELE---DLEEEVEEVEERLERAEDL------------------- 504
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  450 kermttKQLSQELESLKVRikeLEAIESRLEKTEFTLKEDLTKLKTLtvmfvDERKT-MSEKLKKTEDKLQAASSQLQVE 528
Cdd:PRK02224  505 ------VEAEDRIERLEER---REDLEELIAERRETIEEKRERAEEL-----RERAAeLEAEAEEKREAAAEAEEEAEEA 570
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  529 QNKVTTVTEKLiEETKRALKSKTDVEEKMYSVTKERDD---LKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKN 605
Cdd:PRK02224  571 REEVAELNSKL-AELKERIESLERIRTLLAAIADAEDEierLREKREALAELNDERRERLAEKRERKRELEAEFDEARIE 649
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  606 KLNQDSGKSTTALHQENNKIKELS-----------------QEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANER 667
Cdd:PRK02224  650 EAREDKERAEEYLEQVEEKLDELReerddlqaeigavenelEELEELRERREALENRVEALEALYDEAEELESMYGDLR 728
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
141-678 2.39e-11

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 68.42  E-value: 2.39e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  141 NELDKVVEKHKESYRRILGQLLVAEKSRR----------QTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQE 210
Cdd:COG1196   277 EELELELEEAQAEEYELLAELARLEQDIArleerrreleERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEA 356
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  211 IKSQEEKEQEKEKRVTTLKEELTKLKSFAlmvvDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATR 290
Cdd:COG1196   357 EAELAEAEEALLEAEAELAEAEEELEELA----EELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAE 432
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  291 LEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMA 370
Cdd:COG1196   433 LEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKA 512
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  371 EVEELRKR----VLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKR------IMALEKLEDAFNKSKQE 440
Cdd:COG1196   513 ALLLAGLRglagAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAkagratFLPLDKIRARAALAAAL 592
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  441 cysLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQA 520
Cdd:COG1196   593 ---ARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRE 669
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  521 ASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKmysvTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEK 600
Cdd:COG1196   670 LLAALLEAEAELEELAERLAEEELELEEALLAEEEE----ERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEE 745
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847  601 DFLKNKLNQDsgksttalHQENNKIKELSQEVERLKLKLKDMKAIEddlMKTEDEYETLERRYanerdkaQFLSKELE 678
Cdd:COG1196   746 ELLEEEALEE--------LPEPPDLEELERELERLEREIEALGPVN---LLAIEEYEELEERY-------DFLSEQRE 805
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
334-708 2.97e-11

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 68.17  E-value: 2.97e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   334 IDEL---EETNRSLRKAEEELQDIKEKISKgeygnagIMAEVEELRKRVLDMEGKDEELIKMeeqcRDLNKRLEretlqs 410
Cdd:TIGR02169  159 IDEIagvAEFDRKKEKALEELEEVEENIER-------LDLIIDEKRQQLERLRREREKAERY----QALLKEKR------ 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   411 kdfKLEVEKLSKRIMALEKledafnkskqecyslkcnlEKERmTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDL 490
Cdd:TIGR02169  222 ---EYEGYELLKEKEALER-------------------QKEA-IERQLASLEEELEKLTEEISELEKRLEEIEQLLEELN 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   491 TKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNK 570
Cdd:TIGR02169  279 KKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEE 358
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   571 LKAEEEKGNDLLSRVNMLKNRLQSL------EAIEKDFLKNKLN----------QDSGKSTTALHQENNKIKELSQEV-- 632
Cdd:TIGR02169  359 YAELKEELEDLRAELEEVDKEFAETrdelkdYREKLEKLKREINelkreldrlqEELQRLSEELADLNAAIAGIEAKIne 438
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847   633 --ERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKyKLAEKTETSHEQWLFKRLQEE 708
Cdd:TIGR02169  439 leEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAE-AEAQARASEERVRGGRAVEEV 515
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
321-678 3.94e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 67.77  E-value: 3.94e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   321 RQLQQKLAALSRQIDELEETNRSLRKAE------EELQDIKEKISKGEYGNAGimAEVEELRKRvldMEGKDEELIKMEE 394
Cdd:TIGR02168  179 RKLERTRENLDRLEDILNELERQLKSLErqaekaERYKELKAELRELELALLV--LRLEELREE---LEELQEELKEAEE 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   395 QCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKledafnkskqECYSLKCNLEKERMTTKQLSQELESLKVRIKELEA 474
Cdd:TIGR02168  254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQK----------ELYALANEISRLEQQKQILRERLANLERQLEELEA 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   475 IESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEetkralksktdVE 554
Cdd:TIGR02168  324 QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQ-----------LE 392
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   555 EKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVER 634
Cdd:TIGR02168  393 LQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 109659847   635 LKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELE 678
Cdd:TIGR02168  473 AEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQS 516
PTZ00121 PTZ00121
MAEBL; Provisional
113-672 4.85e-11

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 67.47  E-value: 4.85e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  113 KKVLEALQRDAFQAKStpwQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEf 192
Cdd:PTZ00121 1314 AKKADEAKKKAEEAKK---KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE- 1389
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  193 icllEQECERLKKLIDQEIKsqeekeqekekrvttlKEELTKLKSFALMVVDEQQRLTAqltlQRQKIQELTTNAKEthT 272
Cdd:PTZ00121 1390 ----KKKADEAKKKAEEDKK----------------KADELKKAAAAKKKADEAKKKAE----EKKKADEAKKKAEE--A 1443
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  273 KLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEE--- 349
Cdd:PTZ00121 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEakk 1523
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  350 --ELQDIKEKISKGEYGNAGIMAEVEELRKRvldmegkdEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLS--KRIM 425
Cdd:PTZ00121 1524 adEAKKAEEAKKADEAKKAEEKKKADELKKA--------EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAeeARIE 1595
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  426 ALEKLEDAFNKSKQEcySLKcNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLK-EDLTKLKTLTVMF-VDE 503
Cdd:PTZ00121 1596 EVMKLYEEEKKMKAE--EAK-KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKaEEENKIKAAEEAKkAEE 1672
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  504 RKTMSEKLKKTEDKLQAASSQLQV---EQNKVTTVTEKLIEETKRALKSKTDVEE---KMYSVTKERDDLKNK---LKAE 574
Cdd:PTZ00121 1673 DKKKAEEAKKAEEDEKKAAEALKKeaeEAKKAEELKKKEAEEKKKAEELKKAEEEnkiKAEEAKKEAEEDKKKaeeAKKD 1752
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  575 EEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEV----ERLKLKLKDMKAIEDDLM 650
Cdd:PTZ00121 1753 EEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIieggKEGNLVINDSKEMEDSAI 1832
                         570       580
                  ....*....|....*....|..
gi 109659847  651 KTEDEYETLERRYANERDKAQF 672
Cdd:PTZ00121 1833 KEVADSKNMQLEEADAFEKHKF 1854
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
283-688 1.55e-10

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 65.17  E-value: 1.55e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  283 EEEQKATRLEKELQTQTTKFHQDQDTImakltnedsqnRQLQQKLAALSRQIDELEETNRSLRKAEE------ELQDIKE 356
Cdd:COG4717    71 KELKELEEELKEAEEKEEEYAELQEEL-----------EELEEELEELEAELEELREELEKLEKLLQllplyqELEALEA 139
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  357 KISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQS-KDFKLEVEKLSKRImalEKLEDAFN 435
Cdd:COG4717   140 ELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEElQDLAEELEELQQRL---AELEEELE 216
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  436 KSKQECYSLKCNLE--KERMTTKQLSQELESLKVRIK------ELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTM 507
Cdd:COG4717   217 EAQEELEELEEELEqlENELEAAALEERLKEARLLLLiaaallALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLARE 296
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  508 SEKLKKTEDKLQAASSQLQVEQNKVTTVTEKL-----------------IEETKRALKSKTDVEEKMY--SVTKERDDLK 568
Cdd:COG4717   297 KASLGKEAEELQALPALEELEEEELEELLAALglppdlspeellelldrIEELQELLREAEELEEELQleELEQEIAALL 376
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  569 NKLKAEEEKG-----------NDLLSRVNMLKNRLQSL----EAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVE 633
Cdd:COG4717   377 AEAGVEDEEElraaleqaeeyQELKEELEELEEQLEELlgelEELLEALDEEELEEELEELEEELEELEEELEELREELA 456
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 109659847  634 RLKLKLKDMKAiEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYK 688
Cdd:COG4717   457 ELEAELEQLEE-DGELAELLQELEELKAELRELAEEWAALKLALELLEEAREEYR 510
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
278-694 2.75e-10

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 64.68  E-value: 2.75e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  278 EARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEK 357
Cdd:PRK02224  187 GSLDQLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRET 266
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  358 ISKGEYGNAGIMAEVEELRKRVLDMEGKD---------------------EELIKMEEQCRDlnkRLERETLQSKDFKLE 416
Cdd:PRK02224  267 IAETEREREELAEEVRDLRERLEELEEERddllaeaglddadaeavearrEELEDRDEELRD---RLEECRVAAQAHNEE 343
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  417 VEKLSKRIMALE----KLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIK----ELEAIESRLEKTEFTLKE 488
Cdd:PRK02224  344 AESLREDADDLEeraeELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGdapvDLGNAEDFLEELREERDE 423
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  489 DLTKLKTLTVmfvdERKTMSEKLKKTEDKLQAAS--------------SQLQVEQNKVTTVTEKL--IEETKRALKSKTD 552
Cdd:PRK02224  424 LREREAELEA----TLRTARERVEEAEALLEAGKcpecgqpvegsphvETIEEDRERVEELEAELedLEEEVEEVEERLE 499
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  553 VEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAiEKDFLKNKLnQDSGKSTTALHQENNK----IKEL 628
Cdd:PRK02224  500 RAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRE-RAAELEAEA-EEKREAAAEAEEEAEEareeVAEL 577
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109659847  629 SQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRyaNERDKAQflsKELEhvkmELAKYKLAEKTE 694
Cdd:PRK02224  578 NSKLAELKERIESLERIRTLLAAIADAEDEIERL--REKREAL---AELN----DERRERLAEKRE 634
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
156-709 3.65e-10

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 64.68  E-value: 3.65e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   156 RILGQLLVAEKSRRQTILELEEEKRKHKEYMEKS-DEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTK 234
Cdd:TIGR00606  358 RHQEHIRARDSLIQSLATRLELDGFERGPFSERQiKNFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKG 437
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   235 LKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKlalaEARVQEEEQKATRLEKELQTQTTKfhQDQDTIMAKLT 314
Cdd:TIGR00606  438 LGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILEL----DQELRKAERELSKAEKNSLTETLK--KEVKSLQNEKA 511
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   315 NEDSQNRQLQQKLAALSRQIDELEETNRSLRK---AEEELQDIKEKISKGEYGNAGIMAEVEELRKRvldMEGKDEELIK 391
Cdd:TIGR00606  512 DLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDkmdKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDW---LHSKSKEINQ 588
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   392 MEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALE-KLEDAFNKSKQECY--SLKCNLEKERMTTKQLSQELESLKVR 468
Cdd:TIGR00606  589 TRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEdKLFDVCGSQDEESDleRLKEEIEKSSKQRAMLAGATAVYSQF 668
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   469 IKELEAIES-------RLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIE 541
Cdd:TIGR00606  669 ITQLTDENQsccpvcqRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPE 748
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   542 ETKRALKSKTDVEEKMYSVTKERDDLKNkLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQE 621
Cdd:TIGR00606  749 LRNKLQKVNRDIQRLKNDIEEQETLLGT-IMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQV 827
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   622 NNKIKE-------LSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAK----YKLA 690
Cdd:TIGR00606  828 NQEKQEkqheldtVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSlireIKDA 907
                          570
                   ....*....|....*....
gi 109659847   691 EKTETSHEQWLFKRLQEEE 709
Cdd:TIGR00606  908 KEQDSPLETFLEKDQQEKE 926
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
224-777 6.39e-10

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 63.70  E-value: 6.39e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEeeqKATRLEKELQTQTTKFH 303
Cdd:pfam12128  242 EFTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKE---KRDELNGELSAADAAVA 318
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   304 QDQdtimAKLTNEDSQNRQLQQ----KLAALSRQID----ELEETNRSLRKAEEELQDIKEKIskgeygNAGIMAEVEEL 375
Cdd:pfam12128  319 KDR----SELEALEDQHGAFLDadieTAAADQEQLPswqsELENLEERLKALTGKHQDVTAKY------NRRRSKIKEQN 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   376 rkrVLDMEGKDEELIKMEEQcRDLNKRLERETLQSKDFKLEveklskrimalEKLEDAFNKSKQECYSLKCNLEKERMTT 455
Cdd:pfam12128  389 ---NRDIAGIKDKLAKIREA-RDRQLAVAEDDLQALESELR-----------EQLEAGKLEFNEEEYRLKSRLGELKLRL 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   456 KQL---SQELESLKVRIKELEAIESRLEKTeFTLKEDLTklktltvmfvDERKTMSEKLKKTEDKLQAASSQLQVEQNKV 532
Cdd:pfam12128  454 NQAtatPELLLQLENFDERIERAREEQEAA-NAEVERLQ----------SELRQARKRRDQASEALRQASRRLEERQSAL 522
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   533 TTVTEKLIEETKRAL----KSKTDVEEKMYSVTKE----RDDLKNKLKAEEEKGNDLLSRVNMlknRLQSLEAIEKDFLK 604
Cdd:pfam12128  523 DELELQLFPQAGTLLhflrKEAPDWEQSIGKVISPellhRTDLDPEVWDGSVGGELNLYGVKL---DLKRIDVPEWAASE 599
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   605 NKLNQDSGKSTTALHQENNKIKELSQ-------EVERLKLKLKDMKAI----EDDLMKTEDEYETLERR----YANERDK 669
Cdd:pfam12128  600 EELRERLDKAEEALQSAREKQAAAEEqlvqangELEKASREETFARTAlknaRLDLRRLFDEKQSEKDKknkaLAERKDS 679
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   670 AQFLSKELEHVKMELA---KYKLAEKTETSHE---QWLFKRLQEEEAKSGHLSReVDALKEKIHEYMATEDLICHLQGDH 743
Cdd:pfam12128  680 ANERLNSLEAQLKQLDkkhQAWLEEQKEQKREartEKQAYWQVVEGALDAQLAL-LKAAIAARRSGAKAELKALETWYKR 758
                          570       580       590
                   ....*....|....*....|....*....|....
gi 109659847   744 SVlqKKLNQQENRNRDLGREIENLTKELERYRHF 777
Cdd:pfam12128  759 DL--ASLGVDPDVIAKLKREIRTLERKIERIAVR 790
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
169-772 1.85e-09

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 62.30  E-value: 1.85e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   169 RQTILELEEEKRKHKEYMEKSDEFICLLEqECERLKKLIDQEIKSQEEKEQEKEKRVTTLKeeltklksfalMVVDEQQR 248
Cdd:pfam02463  166 RLKRKKKEALKKLIEETENLAELIIDLEE-LKLQELKLKEQAKKALEYYQLKEKLELEEEY-----------LLYLDYLK 233
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   249 LTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKfhqDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:pfam02463  234 LNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEE---ELKLLAKEEEELKSELLKLERRKV 310
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   329 ALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMA----EVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLE 404
Cdd:pfam02463  311 DDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREaeeeEEEELEKLQEKLEQLEEELLAKKKLESERLSSAA 390
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   405 RETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEF 484
Cdd:pfam02463  391 KLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKS 470
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   485 TLKEDLTKLKTLTVMFV--------DERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEK 556
Cdd:pfam02463  471 EDLLKETQLVKLQEQLElllsrqklEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVI 550
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   557 MYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEA------------IEKDFLKNKLNQDSGKSTTALHQENNK 624
Cdd:pfam02463  551 VEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLksiavleidpilNLAQLDKATLEADEDDKRAKVVEGILK 630
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   625 IKELSQEVERLKLKL----KDMKAIEDDLMKTEDEYETLE--RRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHE 698
Cdd:pfam02463  631 DTELTKLKESAKAKEsglrKGVSLEEGLAEKSEVKASLSEltKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKE 710
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   699 QWLFKRLQEEEAKSGHLSREVD-------ALKEKIHEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKEL 771
Cdd:pfam02463  711 ELKKLKLEAEELLADRVQEAQDkineelkLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEE 790

                   .
gi 109659847   772 E 772
Cdd:pfam02463  791 E 791
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
325-736 2.06e-09

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 61.71  E-value: 2.06e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  325 QKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEgKDEELIKMEEQCRDLNKRLE 404
Cdd:COG4717    71 KELKELEEELKEAEEKEEEYAELQEELEELEEELEELE-------AELEELREELEKLE-KLLQLLPLYQELEALEAELA 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  405 RETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNL-EKERMTTKQLSQELESLKVRIKELEAIESRLEKTE 483
Cdd:COG4717   143 ELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLsLATEEELQDLAEELEELQQRLAELEEELEEAQEEL 222
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  484 FTLKEDLTKLKTLTvmfvdERKTMSEKLKKTEDKLQAASSQLQVE------------------------------QNKVT 533
Cdd:COG4717   223 EELEEELEQLENEL-----EAAALEERLKEARLLLLIAAALLALLglggsllsliltiagvlflvlgllallfllLAREK 297
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  534 TVTEKLIEETkRALKSKTDVEEKMYSVTKERDDLKNKLKAEE--------EKGNDLLSRVNMLKNRLQ-SLEAIEKDFLK 604
Cdd:COG4717   298 ASLGKEAEEL-QALPALEELEEEELEELLAALGLPPDLSPEEllelldriEELQELLREAEELEEELQlEELEQEIAALL 376
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  605 NKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEyETLERRYANERDKAQFLSKELEHVKMEL 684
Cdd:COG4717   377 AEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDE-EELEEELEELEEELEELEEELEELREEL 455
                         410       420       430       440       450
                  ....*....|....*....|....*....|....*....|....*....|...
gi 109659847  685 AKYKLA-EKTETSHEqwLFKRLQEEEaksgHLSREVDALKEKIHEYMATEDLI 736
Cdd:COG4717   456 AELEAElEQLEEDGE--LAELLQELE----ELKAELRELAEEWAALKLALELL 502
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
414-678 5.31e-09

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 60.33  E-value: 5.31e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  414 KLEVEKLSKRIMALEKLEDAFNKSKQecYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKteftLKEDLTKL 493
Cdd:PRK05771   39 ELSNERLRKLRSLLTKLSEALDKLRS--YLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKE----LEEEISEL 112
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  494 KtltvmfvDERKTMSEKLKKTEdKLQAASSQLQVEQN-KVTTVTEKLIEETKralksktdveekmYSVTKERDDLKNKLK 572
Cdd:PRK05771  113 E-------NEIKELEQEIERLE-PWGNFDLDLSLLLGfKYVSVFVGTVPEDK-------------LEELKLESDVENVEY 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  573 AEEEKGNDLLSRVNmLKNRLQSLEAI--EKDFLKNKLNqDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLM 650
Cdd:PRK05771  172 ISTDKGYVYVVVVV-LKELSDEVEEElkKLGFERLELE-EEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEEL 249
                         250       260
                  ....*....|....*....|....*...
gi 109659847  651 KTEDEYetlerrYANERDKAQFLSKELE 678
Cdd:PRK05771  250 LALYEY------LEIELERAEALSKFLK 271
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
208-781 5.78e-09

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 60.57  E-value: 5.78e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   208 DQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLtlqrQKIQELTTNAKETHTKLALA---------- 277
Cdd:pfam01576    4 EEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQL----QAETELCAEAEEMRARLAARkqeleeilhe 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   278 -EARVQEEEQKATrlekELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKE 356
Cdd:pfam01576   80 lESRLEEEEERSQ----QLQNEKKKMQQHIQDLEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLEDQNSKLSKERK 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   357 KISKGEYGNAGIMAEVEELRKRVLDMEGKDEELI-------KMEEQCR----DLNKRLERETL----QSKDFKLEVEKLS 421
Cdd:pfam01576  156 LLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMIsdleerlKKEEKGRqeleKAKRKLEGESTdlqeQIAELQAQIAELR 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   422 KRIMALEK-LEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLkvrikELE-AIESRLEKTEFTLKEDLTKLKTLTVM 499
Cdd:pfam01576  236 AQLAKKEEeLQAALARLEEETAQKNNALKKIRELEAQISELQEDL-----ESErAARNKAEKQRRDLGEELEALKTELED 310
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   500 FVD----------ERKTMSEKLKKT-EDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLK 568
Cdd:pfam01576  311 TLDttaaqqelrsKREQEVTELKKAlEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQ 390
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   569 NKLKAEEEKGNDLLSRVNMLKNRLQSLEA--IEKDFLKNKLNQDSGKS-------TTALHQENNKIKELSQEVERLKLKL 639
Cdd:pfam01576  391 AELRTLQQAKQDSEHKRKKLEGQLQELQArlSESERQRAELAEKLSKLqselesvSSLLNEAEGKNIKLSKDVSSLESQL 470
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   640 KDMK-----------AIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELA--KYKLAEKTETSHeqwlfkrlQ 706
Cdd:pfam01576  471 QDTQellqeetrqklNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSdmKKKLEEDAGTLE--------A 542
                          570       580       590       600       610       620       630
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847   707 EEEAKSgHLSREVDALKEKIHEYMATEDlichlqgdhsvlqkKLNQQENRnrdLGREIENLTKELERYRHFSKSL 781
Cdd:pfam01576  543 LEEGKK-RLQRELEALTQQLEEKAAAYD--------------KLEKTKNR---LQQELDDLLVDLDHQRQLVSNL 599
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
170-609 1.24e-08

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 59.26  E-value: 1.24e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   170 QTILELEEEKRKHKEYMEKSDEficlLEQECERLKKLID---QEIKSQEEKEQEKEKRVTTLKEELTKLKsfalmvvDEQ 246
Cdd:TIGR04523  201 LLLSNLKKKIQKNKSLESQISE----LKKQNNQLKDNIEkkqQEINEKTTEISNTQTQLNQLKDEQNKIK-------KQL 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   247 QRLTAQLTLQRQKIQELTTNAKETHTKLAlaearvQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQK 326
Cdd:TIGR04523  270 SEKQKELEQNNKKIKELEKQLNQLKSEIS------DLNNQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQ 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   327 LAALSRQIDELEETNRSLRKAEEELQDIKEKISKgeyGNAGIMAEVEELRKRVLDMEGK-----------DEELIKMEEQ 395
Cdd:TIGR04523  344 ISQLKKELTNSESENSEKQRELEEKQNEIEKLKK---ENQSYKQEIKNLESQINDLESKiqnqeklnqqkDEQIKKLQQE 420
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   396 CRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQ-----------ECYSLKCNLEKERMTTKQLSQELES 464
Cdd:TIGR04523  421 KELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTREsletqlkvlsrSINKIKQNLEQKQKELKSKEKELKK 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   465 LKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQaaSSQLQVEQNKVTTVTEKLIEETK 544
Cdd:TIGR04523  501 LNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQK 578
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847   545 RALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEK--DFLKNKLNQ 609
Cdd:TIGR04523  579 SLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKniKSKKNKLKQ 645
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
70-774 1.67e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 58.93  E-value: 1.67e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847    70 DDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAqygfVTPKKVLEALQRDAFQAKStpwQEDIYEKPMNELDKVVEK 149
Cdd:TIGR02169  254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQ----LRVKEKIGELEAEIASLER---SIAEKERELEDAEERLAK 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   150 HKESYRRILGQLlvaEKSRRQtILELEEEKRKHKEYMEKSDEFICLLEQECERLkkliDQEIKSQEEKEQEKEKRVTTLK 229
Cdd:TIGR02169  327 LEAEIDKLLAEI---EELERE-IEEERKRRDKLTEEYAELKEELEDLRAELEEV----DKEFAETRDELKDYREKLEKLK 398
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   230 EELTKLKSFALMVVDEQQRLTAQLtlqrqkiqelttnaKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQdqdtI 309
Cdd:TIGR02169  399 REINELKRELDRLQEELQRLSEEL--------------ADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ----L 460
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   310 MAKLTNEDSQNRQLQQKLAALSRQideleetnrsLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEEL 389
Cdd:TIGR02169  461 AADLSKYEQELYDLKEEYDRVEKE----------LSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVHGTVAQL 530
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   390 IKMEEQ--------------------------CRDLNKR--LERETL----QSKDFKLEVEKLSKR---------IMALE 428
Cdd:TIGR02169  531 GSVGERyataievaagnrlnnvvveddavakeAIELLKRrkAGRATFlplnKMRDERRDLSILSEDgvigfavdlVEFDP 610
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   429 KLEDAFNKSKQECY------SLKCNLEKERMTT--------------------------KQLSQELESLKVRIKELEAIE 476
Cdd:TIGR02169  611 KYEPAFKYVFGDTLvvedieAARRLMGKYRMVTlegelfeksgamtggsraprggilfsRSEPAELQRLRERLEGLKREL 690
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   477 SRlekteftLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKlIEETKRALkskTDVEEK 556
Cdd:TIGR02169  691 SS-------LQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEED-LSSLEQEI---ENVKSE 759
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   557 MYSVTKERDDLKNKLKAEEEKGNDLLSRVNM--LKNRLQSLEAIEKDFLK-----NKLNQDSGKSTTALHQENNKIKELS 629
Cdd:TIGR02169  760 LKELEARIEELEEDLHKLEEALNDLEARLSHsrIPEIQAELSKLEEEVSRiearlREIEQKLNRLTLEKEYLEKEIQELQ 839
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   630 QEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANE----RDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRL 705
Cdd:TIGR02169  840 EQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAlrdlESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRL 919
                          730       740       750       760       770       780       790
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847   706 QEEEAKSGHLSREVDALKEKIHE--YMATEDLichlqgDHSVLQKKLNQQENRNRDLG----REIENLTKELERY 774
Cdd:TIGR02169  920 SELKAKLEALEEELSEIEDPKGEdeEIPEEEL------SLEDVQAELQRVEEEIRALEpvnmLAIQEYEEVLKRL 988
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
227-577 2.67e-08

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 58.44  E-value: 2.67e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   227 TLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE---KELQTQTTKFH 303
Cdd:pfam02463  663 EVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQeaqDKINEELKLLK 742
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   304 QDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDEL-EETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDM 382
Cdd:pfam02463  743 QKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELaEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQL 822
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   383 EGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQEL 462
Cdd:pfam02463  823 LIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKEL 902
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   463 ESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEE 542
Cdd:pfam02463  903 EEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAIEE 982
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 109659847   543 TKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEK 577
Cdd:pfam02463  983 FEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQ 1017
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
174-473 3.52e-08

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 58.16  E-value: 3.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   174 ELEEEKRKHKEYMEKSDEFICLLEQECERLKklidQEIKSQEEKEQEKEKRVTTLKEELTKLK-----SFALMVVDEQQR 248
Cdd:TIGR02169  727 QLEQEEEKLKERLEELEEDLSSLEQEIENVK----SELKELEARIEELEEDLHKLEEALNDLEarlshSRIPEIQAELSK 802
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   249 LTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE---KELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQ 325
Cdd:TIGR02169  803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKeqiKSIEKEIENLNGKKEELEEELEELEAALRDLES 882
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   326 KLAALSRQIDELEETNRSLRKAEEELQdikekiskgeygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLE- 404
Cdd:TIGR02169  883 RLGDLKKERDELEAQLRELERKIEELE-----------------AQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEe 945
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847   405 --RETLQSKDFKLEVEKLSKRIMALE----KLEDAFNKSKQECYSLKCNLEKermttkqLSQELESLKVRIKELE 473
Cdd:TIGR02169  946 ipEEELSLEDVQAELQRVEEEIRALEpvnmLAIQEYEEVLKRLDELKEKRAK-------LEEERKAILERIEEYE 1013
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
142-440 3.94e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 57.76  E-value: 3.94e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEficlLEQECERLKKLIDQEIKsqeekeqek 221
Cdd:TIGR02168  737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE----LEAQIEQLKEELKALRE--------- 803
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   222 ekRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTK 301
Cdd:TIGR02168  804 --ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNE 881
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   302 FHQ----------DQDTIMAKLTNEDSQNRQLQQKLAALSrqiDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIM-- 369
Cdd:TIGR02168  882 RASleealallrsELEELSEELRELESKRSELRRELEELR---EKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAea 958
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   370 ------AEVEELRKRVLDMEGKDEEL--IKME--EQCRDLNKRLERETLQSKDFKLEVEKLSKRI-----MALEKLEDAF 434
Cdd:TIGR02168  959 lenkieDDEEEARRRLKRLENKIKELgpVNLAaiEEYEELKERYDFLTAQKEDLTEAKETLEEAIeeidrEARERFKDTF 1038

                   ....*.
gi 109659847   435 NKSKQE 440
Cdd:TIGR02168 1039 DQVNEN 1044
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
330-657 4.35e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 57.64  E-value: 4.35e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  330 LSRQIDELE-ETNRSLR----KAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLE 404
Cdd:COG1196   198 LERQLEPLErQAEKAERyrelKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELE 277
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  405 RETLQSKDFKLEVEKLSKRIMALEKledafnkskqecyslkcNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEF 484
Cdd:COG1196   278 ELELELEEAQAEEYELLAELARLEQ-----------------DIARLEERRRELEERLEELEEELAELEEELEELEEELE 340
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  485 TLKEDLTKLKTltvmfvdERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMysvTKER 564
Cdd:COG1196   341 ELEEELEEAEE-------ELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELE---EAEE 410
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  565 DDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLK-NKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMK 643
Cdd:COG1196   411 ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEeAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAA 490
                         330
                  ....*....|....
gi 109659847  644 AIEDDLMKTEDEYE 657
Cdd:COG1196   491 ARLLLLLEAEADYE 504
SCP-1 pfam05483
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ...
246-771 5.49e-08

Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.


Pssm-ID: 114219 [Multi-domain]  Cd Length: 787  Bit Score: 57.04  E-value: 5.49e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   246 QQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQ 325
Cdd:pfam05483  228 EEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDIKMSLQR 307
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   326 KLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVldmegkdEELIKMEEQcrdlnkRLER 405
Cdd:pfam05483  308 SMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSL-------EELLRTEQQ------RLEK 374
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   406 ETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFT 485
Cdd:pfam05483  375 NEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKEIHD 454
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   486 LKEDLTKLKTLTVMFVDERKTM-----SEKLKKTEdkLQAASSQLQVEQNKVTTVTEKLIEETK--------------RA 546
Cdd:pfam05483  455 LEIQLTAIKTSEEHYLKEVEDLkteleKEKLKNIE--LTAHCDKLLLENKELTQEASDMTLELKkhqediinckkqeeRM 532
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   547 LKSKTDVEEK-------MYSVTKE----RDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDsgKST 615
Cdd:pfam05483  533 LKQIENLEEKemnlrdeLESVREEfiqkGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKN--KNI 610
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   616 TALHQENNKIKELSQEverlklKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQflsKELEHVKmeLAKYKLAEKTET 695
Cdd:pfam05483  611 EELHQENKALKKKGSA------ENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQ---KEIEDKK--ISEEKLLEEVEK 679
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847   696 SH--EQWLFKRLQEEEAKSGHLSREVDALKEK-IHEYmatEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKEL 771
Cdd:pfam05483  680 AKaiADEAVKLQKEIDKRCQHKIAEMVALMEKhKHQY---DKIIEERDSELGLYKNKEQEQSSAKAALEIELSNIKAEL 755
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
164-775 6.12e-08

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 57.29  E-value: 6.12e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   164 AEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKsQEEKEQEKEKRVTTLKEELTKLKSFALMVV 243
Cdd:pfam02463  210 LEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQ-EIEKEEEKLAQVLKENKEEEKEKKLQEEEL 288
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   244 DEQQRLTAQLTLQRQKIQELTT----NAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQ 319
Cdd:pfam02463  289 KLLAKEEEELKSELLKLERRKVddeeKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKL 368
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   320 NRQLQQKLAALSRQIDELEETNRsLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDL 399
Cdd:pfam02463  369 EQLEEELLAKKKLESERLSSAAK-LKEEELELKSEEEKEAQLL-------LELARQLEDLLKEEKKEELEILEEEEESIE 440
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   400 NKRLERETLQSKDFKLEVEKLSKRIMaLEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRL 479
Cdd:pfam02463  441 LKQGKLTEEKEELEKQELKLLKDELE-LKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDG 519
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   480 EKTEFTLKEDLTKLKTLTVMFVD---ERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEk 556
Cdd:pfam02463  520 VGGRIISAHGRLGDLGVAVENYKvaiSTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLE- 598
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   557 mysvTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTA--LHQENNKIKELSQEVER 634
Cdd:pfam02463  599 ----IDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEegLAEKSEVKASLSELTKE 674
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   635 LKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAE---KTETSHEQWLFKRLQEEEAK 711
Cdd:pfam02463  675 LLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQdkiNEELKLLKQKIDEEEEEEEK 754
                          570       580       590       600       610       620
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 109659847   712 SGHLSREVDALKEKIHEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYR 775
Cdd:pfam02463  755 SRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLE 818
PTZ00121 PTZ00121
MAEBL; Provisional
113-653 6.13e-08

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 57.46  E-value: 6.13e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  113 KKVLEALQRDAFQAKSTPWQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEf 192
Cdd:PTZ00121 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE- 1455
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  193 icllEQECERLKKLIDQEIKSQEEKEQEKEKRVT---TLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQelTTNAKE 269
Cdd:PTZ00121 1456 ----AKKAEEAKKKAEEAKKADEAKKKAEEAKKAdeaKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK--ADEAKK 1529
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  270 THTKLALAEARVQEEEQKATRLEK--------ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETN 341
Cdd:PTZ00121 1530 AEEAKKADEAKKAEEKKKADELKKaeelkkaeEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  342 RSLRKAEEE---------LQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQcrdlnkrlerETLQSKD 412
Cdd:PTZ00121 1610 EEAKKAEEAkikaeelkkAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE----------DKKKAEE 1679
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  413 FKLEVEKLSKRIMALEKLEDAFNKSKQecysLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTK 492
Cdd:PTZ00121 1680 AKKAEEDEKKAAEALKKEAEEAKKAEE----LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  493 LKTLTVMFVDERKTMSEKLKKTEdklqaassqlQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLK 572
Cdd:PTZ00121 1756 KKKIAHLKKEEEKKAEEIRKEKE----------AVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSK 1825
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  573 -AEEEKGNDLLSRVNMLKNRLQSLEaiEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVErlklKLKDMKAIEDDLMK 651
Cdd:PTZ00121 1826 eMEDSAIKEVADSKNMQLEEADAFE--KHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIE----EADEIEKIDKDDIE 1899

                  ..
gi 109659847  652 TE 653
Cdd:PTZ00121 1900 RE 1901
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
245-797 1.32e-07

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 55.90  E-value: 1.32e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   245 EQQRLTAQLT-LQRQKIQ----------ELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKL 313
Cdd:pfam05557    3 ELIESKARLSqLQNEKKQmelehkrariELEKKASALKRQLDRESDRNQELQKRIRLLEKREAEAEEALREQAELNRLKK 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   314 TNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKME 393
Cdd:pfam05557   83 KYLEALNKKLNEKESQLADAREVISCLKNELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQNLEKQQ 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   394 EQCRDLNKR---LERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMttkqLSQELESLKVRIK 470
Cdd:pfam05557  163 SSLAEAEQRikeLEFEIQSQEQDSEIVKNSKSELARIPELEKELERLREHNKHLNENIENKLL----LKEEVEDLKRKLE 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   471 ELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMseklkKTEDKLQAASSQLQveQNKVTTVTEK--LIEETKRALK 548
Cdd:pfam05557  239 REEKYREEAATLELEKEKLEQELQSWVKLAQDTGLNL-----RSPEDLSRRIEQLQ--QREIVLKEENssLTSSARQLEK 311
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   549 SKTDVEEKMYSVTKERDDLKNKLKAEEEkgndllsrvnmLKNRLQ---SLEAIEKDFLKNKL-NQDSGKSTT-ALHQENN 623
Cdd:pfam05557  312 ARRELEQELAQYLKKIEDLNKKLKRHKA-----------LVRRLQrrvLLLTKERDGYRAILeSYDKELTMSnYSPQLLE 380
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   624 KIKELSQEVERLKLKLKDMKAieddlmktedEYETLERRYANERDKAQFLSKELEhvkmelakyklaektetsheqwlFK 703
Cdd:pfam05557  381 RIEEAEDMTQKMQAHNEEMEA----------QLSVAEEELGGYKQQAQTLERELQ-----------------------AL 427
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   704 RLQEEEAKSGHLSREVDALKEKIHEYMAT-----------EDLICH--LQGD-----HSVLQKKLNQQENRNRDLGREIE 765
Cdd:pfam05557  428 RQQESLADPSYSKEEVDSLRRKLETLELErqrlreqknelEMELERrcLQGDydpkkTKVLHLSMNPAAEAYQQRKNQLE 507
                          570       580       590
                   ....*....|....*....|....*....|..
gi 109659847   766 NLTKELERYRHFSKSLRPSLNGRRISDPQVFS 797
Cdd:pfam05557  508 KLQAEIERLKRLLKKLEDDLEQVLRLPETTST 539
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
228-433 1.38e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 55.16  E-value: 1.38e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  228 LKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKF----- 302
Cdd:COG4942    32 LQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELaellr 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  303 ----HQDQDTIMAKLTNEDSQ------------NRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKekiskgeygna 366
Cdd:COG4942   112 alyrLGRQPPLALLLSPEDFLdavrrlqylkylAPARREQAEELRADLAELAALRAELEAERAELEALL----------- 180
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847  367 gimAEVEELRKRVLDMEGKDEELIKmeeqcrDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:COG4942   181 ---AELEEERAALEALKAERQKLLA------RLEKELAELAAELAELQQEAEELEALIARLEAEAAA 238
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
205-683 1.50e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 55.80  E-value: 1.50e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   205 KLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEE 284
Cdd:TIGR04523  106 SKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNI 185
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   285 E--------------------QKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQI----DELEET 340
Cdd:TIGR04523  186 QknidkiknkllklelllsnlKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLnqlkDEQNKI 265
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   341 NRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKD--EELIKMEEQCRDLNKRLERETLQSKDFKLEVE 418
Cdd:TIGR04523  266 KKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkSELKNQEKKLEEIQNQISQNNKIISQLNEQIS 345
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   419 KLSKRIM------------------ALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLE 480
Cdd:TIGR04523  346 QLKKELTnsesensekqreleekqnEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLE 425
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   481 KTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSKTDveeKMYSV 560
Cdd:TIGR04523  426 KEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNL-EQKQKELKSKEK---ELKKL 501
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   561 TKERDDLKNKLKAEEEKGNDLLSRVNMLKNRL----QSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLK 636
Cdd:TIGR04523  502 NEEKKELEEKVKDLTKKISSLKEKIEKLESEKkekeSKISDLEDELNKDDFELKKENLEKEIDEKNKEIEELKQTQKSLK 581
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|....*..
gi 109659847   637 lklKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKME 683
Cdd:TIGR04523  582 ---KKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKE 625
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
143-577 1.82e-07

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 55.54  E-value: 1.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  143 LDKVVEKHKESYRRILGQLLVAEKSR-RQTILELEEEKRKHKEYMEKSDEfICLLEQECERLKKlidqeiksqeekeqek 221
Cdd:COG4717    47 LLERLEKEADELFKPQGRKPELNLKElKELEEELKEAEEKEEEYAELQEE-LEELEEELEELEA---------------- 109
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  222 ekRVTTLKEELTKLKSF--ALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQT 299
Cdd:COG4717   110 --ELEELREELEKLEKLlqLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLS 187
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  300 TKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEEtNRSLRKAEEELQDIKEKISKGEYG--------------- 364
Cdd:COG4717   188 LATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEE-ELEQLENELEAAALEERLKEARLLlliaaallallglgg 266
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  365 ---------------NAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFK--LEVEKLSKRIMAL 427
Cdd:COG4717   267 sllsliltiagvlflVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPpdLSPEELLELLDRI 346
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  428 EKLEDAFNKSKQecyslkcnlEKERMTTKQLSQELESL--KVRIKELEAIESRLEKTEfTLKEDLTKLKTLTVMFVDERK 505
Cdd:COG4717   347 EELQELLREAEE---------LEEELQLEELEQEIAALlaEAGVEDEEELRAALEQAE-EYQELKEELEELEEQLEELLG 416
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  506 TMSEKLKKT-----EDKLQAASSQLQVEQNKVTTVTEKL--IEETKRALKSKTDVEEKMYsvtkERDDLKNKLKAEEEK 577
Cdd:COG4717   417 ELEELLEALdeeelEEELEELEEELEELEEELEELREELaeLEAELEQLEEDGELAELLQ----ELEELKAELRELAEE 491
COG5022 COG5022
Myosin heavy chain [General function prediction only];
318-837 3.02e-07

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 55.08  E-value: 3.02e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  318 SQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGnagimaEVEELRKRVLDMEgKDEELIKMEEQCR 397
Cdd:COG5022   806 LGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFG------RSLKAKKRFSLLK-KETIYLQSAQRVE 878
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  398 DLNKRLERETLQSKdfklEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTT-----------KQLSQELESLK 466
Cdd:COG5022   879 LAERQLQELKIDVK----SISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARlkkllnnidleEGPSIEYVKLP 954
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  467 VRIKELEAiESRLEKTEFTLkEDLTKLKTLTVmfvDERKTMSEKLKKTEDKLQAASSQLQVEQNKVttvteKLIEETKRA 546
Cdd:COG5022   955 ELNKLHEV-ESKLKETSEEY-EDLLKKSTILV---REGNKANSELKNFKKELAELSKQYGALQEST-----KQLKELPVE 1024
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  547 LKSKTDVEEKMYSVTKErddLKNKLKAEEEKGNDLLSrVNMLKNRLQSLEaIEKDFLKNKLNQDSGKSTTalhqeNNKIK 626
Cdd:COG5022  1025 VAELQSASKIISSESTE---LSILKPLQKLKGLLLLE-NNQLQARYKALK-LRRENSLLDDKQLYQLEST-----ENLLK 1094
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  627 ElsqeverlkLKLKDMKAIEDDLMKTEDEYETL---ERRYANERDKAQFLSKELEHVKMELAKYKLAEKT---------- 693
Cdd:COG5022  1095 T---------INVKDLEVTNRNLVKPANVLQFIvaqMIKLNLLQEISKFLSQLVNTLEPVFQKLSVLQLEldglfweanl 1165
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  694 -ETSHEQWlFKRLQEEEAKSGH---------------LSREVDALKEKIHEYMATEDLICHL--QGDHSVLQKKLNQQEN 755
Cdd:COG5022  1166 eALPSPPP-FAALSEKRLYQSAlydeksklsssevndLKNELIALFSKIFSGWPRGDKLKKLisEGWVPTEYSTSLKGFN 1244
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  756 RNRDLGREIENLTKelERYRHFSKSLRPSLNGRRISdPQVFSKEVQTEAVDNEPPDYKSLIPLERAVINGQLYEESENQD 835
Cdd:COG5022  1245 NLNKKFDTPASMSN--EKLLSLLNSIDNLLSSYKLE-EEVLPATINSLLQYINVGLFNALRTKASSLRWKSATEVNYNSE 1321

                  ..
gi 109659847  836 ED 837
Cdd:COG5022  1322 EL 1323
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
251-819 5.40e-07

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 54.20  E-value: 5.40e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   251 AQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAAL 330
Cdd:TIGR00618  166 KELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYL 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   331 SRQIDELEETNRSLRKAEEELQDIKEKISKgeygnagiMAEVEELRKRvLDMEGKDEELIKMEEQCRDLNKRLER--ETL 408
Cdd:TIGR00618  246 TQKREAQEEQLKKQQLLKQLRARIEELRAQ--------EAVLEETQER-INRARKAAPLAAHIKAVTQIEQQAQRihTEL 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   409 QSKDFKLEVE-----KLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKvRIKELEAIESRLEKTE 483
Cdd:TIGR00618  317 QSKMRSRAKLlmkraAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQ-HIHTLQQQKTTLTQKL 395
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   484 FTLKEDLTKLKTL--TVMFVDERKtmseklKKTEDKLQAASSQLQVEQnKVTTVTEKLIEETKRALKSKTDVEEKMYSVT 561
Cdd:TIGR00618  396 QSLCKELDILQREqaTIDTRTSAF------RDLQGQLAHAKKQQELQQ-RYAELCAAAITCTAQCEKLEKIHLQESAQSL 468
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   562 KERD----DLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEaiekdflknklnqdsgKSTTALHQENNKIKELSQEVERLKL 637
Cdd:TIGR00618  469 KEREqqlqTKEQIHLQETRKKAVVLARLLELQEEPCPLC----------------GSCIHPNPARQDIDNPGPLTRRMQR 532
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   638 KLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSR 717
Cdd:TIGR00618  533 GEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLAC 612
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   718 EVDALKEKIHEYMATEDLICHLQGDHSVLQKKLNQqenrnrdLGREIENLTKELERYRHFSKSLRPSLNG-RRISDPQVF 796
Cdd:TIGR00618  613 EQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTA-------LHALQLTLTQERVREHALSIRVLPKELLaSRQLALQKM 685
                          570       580
                   ....*....|....*....|...
gi 109659847   797 SKEVQTEAVDNEPPDYKSLIPLE 819
Cdd:TIGR00618  686 QSEKEQLTYWKEMLAQCQTLLRE 708
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
131-606 8.29e-07

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 53.49  E-value: 8.29e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   131 WQEDIYEKPMNELDKVVEKHKESYRRILgQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLID-- 208
Cdd:TIGR04523  194 NKLLKLELLLSNLKKKIQKNKSLESQIS-ELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSek 272
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   209 -QEIKSQEEKEQEKEKRVTTLKEELTKLKSfalmvvDEQQRLTAQLTLQRQKIQELTTNAKethTKLALAEARVQEEEQK 287
Cdd:TIGR04523  273 qKELEQNNKKIKELEKQLNQLKSEISDLNN------QKEQDWNKELKSELKNQEKKLEEIQ---NQISQNNKIISQLNEQ 343
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   288 ATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAG 367
Cdd:TIGR04523  344 ISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKEL 423
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   368 IMAEVEELRKRVLDMEGKDEELIK----MEEQCRDLNKRLERETLQSKDFKLEVEKLSKRimaLEKLEDAFNKSKQECYS 443
Cdd:TIGR04523  424 LEKEIERLKETIIKNNSEIKDLTNqdsvKELIIKNLDNTRESLETQLKVLSRSINKIKQN---LEQKQKELKSKEKELKK 500
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   444 LKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKT-----LTVMFVDERKTMSEKLKKTEDKL 518
Cdd:TIGR04523  501 LNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFelkkeNLEKEIDEKNKEIEELKQTQKSL 580
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   519 QAASSQLQveqnkvttvtEKLIEETKRALKSKTDVEEKMYSVTKERDDLKnKLKAEEEKGNDLLSRVNMLKNRL-QSLEA 597
Cdd:TIGR04523  581 KKKQEEKQ----------ELIDQKEKEKKDLIKEIEEKEKKISSLEKELE-KAKKENEKLSSIIKNIKSKKNKLkQEVKQ 649

                   ....*....
gi 109659847   598 IEKDFLKNK 606
Cdd:TIGR04523  650 IKETIKEIR 658
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
274-546 8.40e-07

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 52.46  E-value: 8.40e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  274 LALAEARVQEEEQKAtrLEKELQtqttKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEetnRSLRKAEEELQD 353
Cdd:COG4942    10 LLALAAAAQADAAAE--AEAELE----QLQQEIAELEKELAALKKEEKALLKQLAALERRIAALA---RRIRALEQELAA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  354 IKEKISKGEygnagimAEVEELRKRvldmegKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:COG4942    81 LEAELAELE-------KEIAELRAE------LEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPA 147
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  434 fnkskqecyslkcnlEKERMttKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLtvmfVDERKTMSEKLKK 513
Cdd:COG4942   148 ---------------RREQA--EELRADLAELAALRAELEAERAELEALLAELEEERAALEAL----KAERQKLLARLEK 206
                         250       260       270
                  ....*....|....*....|....*....|...
gi 109659847  514 TEDKLQAASSQLQVEQNKVTTVTEKLIEETKRA 546
Cdd:COG4942   207 ELAELAAELAELQQEAEELEALIARLEAEAAAA 239
PRK04778 PRK04778
septation ring formation regulator EzrA; Provisional
273-777 1.12e-06

septation ring formation regulator EzrA; Provisional


Pssm-ID: 179877 [Multi-domain]  Cd Length: 569  Bit Score: 52.53  E-value: 1.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  273 KLALAEARVQEEEQKATRLEKELQTQTT--KFHQDQDTIMAKltnedsqnrqlqqKLAALSRQIDELEETNRSLR--KAE 348
Cdd:PRK04778   38 KQELENLPVNDELEKVKKLNLTGQSEEKfeEWRQKWDEIVTN-------------SLPDIEEQLFEAEELNDKFRfrKAK 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  349 EELQDIKEKISKGEygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLeRETLQSKDFKL--EVEKLSKRima 426
Cdd:PRK04778  105 HEINEIESLLDLIE-------EDIEQILEELQELLESEEKNREEVEQLKDLYREL-RKSLLANRFSFgpALDELEKQ--- 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  427 LEKLEDAFNKSKQEcySLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTeftLKEDLTKLKT----------- 495
Cdd:PRK04778  174 LENLEEEFSQFVEL--TESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTE---LPDQLQELKAgyrelveegyh 248
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  496 LTVMFVDER-KTMSEKLKKTEDKLqaasSQLQVEqnkvttVTEKLIEETKRALKSKTDVEEKMY----SVTKERDDLKNK 570
Cdd:PRK04778  249 LDHLDIEKEiQDLKEQIDENLALL----EELDLD------EAEEKNEEIQERIDQLYDILEREVkarkYVEKNSDTLPDF 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  571 LKAEEEKGNDL-----------------LSRVNMLKNRLQSleaIEKDFLKNKLNQDSGKSTtalhqennkikeLSQEVE 633
Cdd:PRK04778  319 LEHAKEQNKELkeeidrvkqsytlneseLESVRQLEKQLES---LEKQYDEITERIAEQEIA------------YSELQE 383
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  634 RLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKL--------AEKTETSHE-QWLFKR 704
Cdd:PRK04778  384 ELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEKSNLpglpedylEMFFEVSDEiEALAEE 463
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847  705 LQEEEAKSGHLSREVDALKEKIHE-YMATEDLIchlqgDHSVLQKKLNQQENRNRDLGREIEN-LTKELERYRHF 777
Cdd:PRK04778  464 LEEKPINMEAVNRLLEEATEDVETlEEETEELV-----ENATLTEQLIQYANRYRSDNEEVAEaLNEAERLFREY 533
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
447-786 1.26e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 52.75  E-value: 1.26e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   447 NLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFtLKEDLTKL-KTLTVMFVDERKtmsEKLKKTEDKLQAASSQL 525
Cdd:TIGR02168  180 KLERTRENLDRLEDILNELERQLKSLERQAEKAERYKE-LKAELRELeLALLVLRLEELR---EELEELQEELKEAEEEL 255
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   526 QVEQNKVTTVTEKLieETKRALKSKtdVEEKMYSVTKERDDLKNKLkaeeekgNDLLSRVNMLKNRLQSLEaiekdflkN 605
Cdd:TIGR02168  256 EELTAELQELEEKL--EELRLEVSE--LEEEIEELQKELYALANEI-------SRLEQQKQILRERLANLE--------R 316
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   606 KLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANerdkaqfLSKELEHVKMELA 685
Cdd:TIGR02168  317 QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEE-------LEEQLETLRSKVA 389
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   686 KYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEKIHEYMATEdlichLQGDHSVLQKKLNQQENRNRDLGREIE 765
Cdd:TIGR02168  390 QLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKE-----LQAELEELEEELEELQEELERLEEALE 464
                          330       340
                   ....*....|....*....|.
gi 109659847   766 NLTKELERYRHFSKSLRPSLN 786
Cdd:TIGR02168  465 ELREELEEAEQALDAAERELA 485
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
166-490 1.29e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 52.72  E-value: 1.29e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   166 KSRRQTILELEEEKRKHKEYMEKSDEFICLLE---QECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSfalmv 242
Cdd:TIGR04523  366 EEKQNEIEKLKKENQSYKQEIKNLESQINDLEskiQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNS----- 440
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   243 vdEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTkfhqdqdtimaKLTNEDSQNRQ 322
Cdd:TIGR04523  441 --EIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEK-----------ELKKLNEEKKE 507
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   323 LQQKLAALSRQIDELEETNRSL----RKAEEELQDIKEKISKGEYG--NAGIMAEVEELRKRVLDMEGKDEELIKMEEQC 396
Cdd:TIGR04523  508 LEEKVKDLTKKISSLKEKIEKLesekKEKESKISDLEDELNKDDFElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEK 587
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   397 RDLNKRLEretlqsKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIE 476
Cdd:TIGR04523  588 QELIDQKE------KEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKW 661
                          330
                   ....*....|....
gi 109659847   477 SRLEKTEFTLKEDL 490
Cdd:TIGR04523  662 PEIIKKIKESKTKI 675
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
257-405 1.40e-06

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 52.61  E-value: 1.40e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  257 RQKIQELTTnakethtKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDtiMAKLTNEDSQNRQLQQKLAALSRQIDE 336
Cdd:COG4913   609 RAKLAALEA-------ELAELEEELAEAEERLEALEAELDALQERREALQR--LAEYSWDEIDVASAEREIAELEAELER 679
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  337 LEETNRSLRKAEEELQDIKEKISKGEygnagimaevEELRKRVLDMEGKDEELIKMEEQCRDLNKRLER 405
Cdd:COG4913   680 LDASSDDLAALEEQLEELEAELEELE----------EELDELKGEIGRLEKELEQAEEELDELQDRLEA 738
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
197-781 1.83e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 52.48  E-value: 1.83e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   197 EQECERLKKLIDQEIKSQEEKEQEKEKR----VTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHT 272
Cdd:pfam01576  326 EQEVTELKKALEEETRSHEAQLQEMRQKhtqaLEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEH 405
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   273 KLALAEARVQE--------EEQKATRLEK----------------ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:pfam01576  406 KRKKLEGQLQElqarlsesERQRAELAEKlsklqselesvssllnEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKL 485
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   329 ALSRQIDELEETNRSLRKAEEELQDIKEKISKgeygnagimaEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLEREtL 408
Cdd:pfam01576  486 NLSTRLRQLEDERNSLQEQLEEEEEAKRNVER----------QLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRE-L 554
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   409 QSKDFKLEveklsKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRL----EKTEF 484
Cdd:pfam01576  555 EALTQQLE-----EKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKKFDQMLAEEKAISARYaeerDRAEA 629
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   485 TLKEDLTKLKTLTVMfVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALksktdvEEKMYSVTKER 564
Cdd:pfam01576  630 EAREKETRALSLARA-LEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHEL-ERSKRAL------EQQVEEMKTQL 701
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   565 DDLKNKLKAEEEKgnDLLSRVNMlknrlQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKA 644
Cdd:pfam01576  702 EELEDELQATEDA--KLRLEVNM-----QALKAQFERDLQARDEQGEEKRRQLVKQVRELEAELEDERKQRAQAVAAKKK 774
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   645 IEDDLMKTEDEYETLERRYAN-----ERDKAQF--LSKELEHVKME----LAKYKLAEKTETSHEQWLFkRLQEEEAKSG 713
Cdd:pfam01576  775 LELDLKELEAQIDAANKGREEavkqlKKLQAQMkdLQRELEEARASrdeiLAQSKESEKKLKNLEAELL-QLQEDLAASE 853
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   714 HLSREVDALKEKIHEYMAT---------------EDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYRHFS 778
Cdd:pfam01576  854 RARRQAQQERDELADEIASgasgksalqdekrrlEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTELAAERSTS 933

                   ...
gi 109659847   779 KSL 781
Cdd:pfam01576  934 QKS 936
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
348-773 2.85e-06

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 51.31  E-value: 2.85e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  348 EEELQDIKEKISKGEYGNAGI-MAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMA 426
Cdd:COG4717    48 LERLEKEADELFKPQGRKPELnLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQL 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  427 LEKLEDafnkskqecyslkcnLEKERMTTKQLSQELESLKVRIKELEAIESRLEKteftLKEDLTKLKT-LTVMFVDERK 505
Cdd:COG4717   128 LPLYQE---------------LEALEAELAELPERLEELEERLEELRELEEELEE----LEAELAELQEeLEELLEQLSL 188
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  506 TMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSktdveekmYSVTKERDDLKNKLKAEEE--------- 576
Cdd:COG4717   189 ATEEELQDLAEELEELQQRLAELEEELEEAQEEL-EELEEELEQ--------LENELEAAALEERLKEARLllliaaall 259
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  577 ----KGNDLLSRVNMLKNRLQSLEAI------EKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIE 646
Cdd:COG4717   260 allgLGGSLLSLILTIAGVLFLVLGLlallflLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  647 DDLMKTEDEYETLERRYANERDKAQFlsKELEHVKMELAKYKLAEkTETSHEQWL--FKRLQEEEAKSGHLSREVDALKE 724
Cdd:COG4717   340 LELLDRIEELQELLREAEELEEELQL--EELEQEIAALLAEAGVE-DEEELRAALeqAEEYQELKEELEELEEQLEELLG 416
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*....
gi 109659847  725 KIHEYMATEDLIcHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:COG4717   417 ELEELLEALDEE-ELEEELEELEEELEELEEELEELREELAELEAELEQ 464
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
280-711 4.00e-06

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 51.20  E-value: 4.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   280 RVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKL----AALSRQIDELEETNRSLRKAEEELQDIK 355
Cdd:TIGR00606  685 RVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLApgrqSIIDLKEKEIPELRNKLQKVNRDIQRLK 764
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   356 EKISKGEYGNAGIMAEvEELRKRVLDMEGKDEELikmEEQCRDLNKRLERET--LQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:TIGR00606  765 NDIEEQETLLGTIMPE-EESAKVCLTDVTIMERF---QMELKDVERKIAQQAakLQGSDLDRTVQQVNQEKQEKQHELDT 840
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   434 FNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLtvmfVDERKTMSEKLKK 513
Cdd:TIGR00606  841 VVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIRE----IKDAKEQDSPLET 916
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   514 TEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKE-RDDLKNKLKAEEEKGNDLLSRVNMLKNRL 592
Cdd:TIGR00606  917 FLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDgKDDYLKQKETELNTVNAQLEECEKHQEKI 996
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   593 -QSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIE--DDLMKTEDEYETLERRYANERDK 669
Cdd:TIGR00606  997 nEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQmkQEHQKLEENIDLIKRNHVLALGR 1076
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|..
gi 109659847   670 AQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAK 711
Cdd:TIGR00606 1077 QKGYEKEIKHFKKELREPQFRDAEEKYREMMIVMRTTELVNK 1118
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
282-771 4.44e-06

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 50.94  E-value: 4.44e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   282 QEEEQKATRLE----KELQTQTTKFHQDQDTIMAKLTNEDSQnrqLQQKLAALSRQIDELEETNRSL--RKAEEE--LQD 353
Cdd:pfam01576    3 QEEEMQAKEEElqkvKERQQKAESELKELEKKHQQLCEEKNA---LQEQLQAETELCAEAEEMRARLaaRKQELEeiLHE 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   354 IKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEElikmEEQCRDlNKRLERETLQSKdfkleVEKLSKRIMALEKLEDA 433
Cdd:pfam01576   80 LESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDE----EEAARQ-KLQLEKVTTEAK-----IKKLEEDILLLEDQNSK 149
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   434 FNKSK----QECYSLKCNLEKERMTTKQLSQ-------ELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVD 502
Cdd:pfam01576  150 LSKERklleERISEFTSNLAEEEEKAKSLSKlknkheaMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIAELQA 229
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   503 ERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVE-EKMY--SVTKERDDLKNKLKAEEEKGN 579
Cdd:pfam01576  230 QIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLEsERAArnKAEKQRRDLGEELEALKTELE 309
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   580 DLLSRVNMlKNRLQSLEAIEKDFLKNKLnQDSGKSTTALHQE-----NNKIKELSQEVE---RLKLKL-KDMKAIEDDLM 650
Cdd:pfam01576  310 DTLDTTAA-QQELRSKREQEVTELKKAL-EEETRSHEAQLQEmrqkhTQALEELTEQLEqakRNKANLeKAKQALESENA 387
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   651 KTEDEYETL-----ERRYANERDKAQFLSKELEHVKMELAKYKLAEKTE--TSHEQWLFKRLQEEEAKSGHLSREVDALK 723
Cdd:pfam01576  388 ELQAELRTLqqakqDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSklQSELESVSSLLNEAEGKNIKLSKDVSSLE 467
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847   724 EKIHEY-----------MATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKEL 771
Cdd:pfam01576  468 SQLQDTqellqeetrqkLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQL 526
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
471-767 5.81e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 50.84  E-value: 5.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   471 ELEAIESRLEKTEFTLKEDLTKLKTLTvmfvDERKTmSEKLKKTEDKLQAASSQLQVEQNKVTtvtEKLIEETKRALKSK 550
Cdd:TIGR02169  178 ELEEVEENIERLDLIIDEKRQQLERLR----REREK-AERYQALLKEKREYEGYELLKEKEAL---ERQKEAIERQLASL 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   551 TDVEEKmysVTKERDDLKNKLKAEEEKGNDLLSRVNmlknRLQSLEAIEkdfLKNKLnqdsGKSTTALHQENNKIKELSQ 630
Cdd:TIGR02169  250 EEELEK---LTEEISELEKRLEEIEQLLEELNKKIK----DLGEEEQLR---VKEKI----GELEAEIASLERSIAEKER 315
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   631 EVERLKLKLKDmkaIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKyklaektetsheqwLFKRLQEEEA 710
Cdd:TIGR02169  316 ELEDAEERLAK---LEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELED--------------LRAELEEVDK 378
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847   711 KSGHLSREVDALKEKIHEYmatEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENL 767
Cdd:TIGR02169  379 EFAETRDELKDYREKLEKL---KREINELKRELDRLQEELQRLSEELADLNAAIAGI 432
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
205-774 6.03e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 50.40  E-value: 6.03e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   205 KLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEE 284
Cdd:TIGR04523   78 KILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKL 157
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   285 EQKATRLEK---ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAAlsrqIDELEETNRSLRKAEEELQDIKEKISKG 361
Cdd:TIGR04523  158 NNKYNDLKKqkeELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSN----LKKKIQKNKSLESQISELKKQNNQLKDN 233
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   362 eygnagIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQEC 441
Cdd:TIGR04523  234 ------IEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQD 307
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   442 YS------LKCNLEKERMTTKQLSQELE---SLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLK 512
Cdd:TIGR04523  308 WNkelkseLKNQEKKLEEIQNQISQNNKiisQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIK 387
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   513 KTEDKLQAASSQLQveqnKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRL 592
Cdd:TIGR04523  388 NLESQINDLESKIQ----NQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTR 463
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   593 QSLEAIEKDFLK--NKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKa 670
Cdd:TIGR04523  464 ESLETQLKVLSRsiNKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISD- 542
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   671 qfLSKELEHVKMELAKYKLAEktetsheqwlfkrlqeeeaksghlsrEVDALKEKIHEymatedlichLQGDHSVLQKKL 750
Cdd:TIGR04523  543 --LEDELNKDDFELKKENLEK--------------------------EIDEKNKEIEE----------LKQTQKSLKKKQ 584
                          570       580
                   ....*....|....*....|....
gi 109659847   751 NQQENRNRDLGREIENLTKELERY 774
Cdd:TIGR04523  585 EEKQELIDQKEKEKKDLIKEIEEK 608
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
312-531 7.64e-06

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 49.44  E-value: 7.64e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  312 KLTNEDSQNRQLQQKLAALSRQID----ELEETNRSLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVldmegkDE 387
Cdd:COG3883    17 QIQAKQKELSELQAELEAAQAELDalqaELEELNEEYNELQAELEALQAEIDKLQ-------AEIAEAEAEI------EE 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  388 ELIKMEEQCRDLNKRLERET-----LQSKDFklevEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERmttKQLSQEL 462
Cdd:COG3883    84 RREELGERARALYRSGGSVSyldvlLGSESF----SDFLDRLSALSKIADADADLLEELKADKAELEAKK---AELEAKL 156
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  463 ESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNK 531
Cdd:COG3883   157 AELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAA 225
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
356-685 8.01e-06

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 50.02  E-value: 8.01e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  356 EKISKGEYGNAGIMAEVEELR-----KRVLDMEGKDEELI----KMEEQCRDLNKRLE-RETLQSKDFKLEVE----KLS 421
Cdd:COG3206    71 SGLSSLSASDSPLETQIEILKsrpvlERVVDKLNLDEDPLgeeaSREAAIERLRKNLTvEPVKGSNVIEISYTspdpELA 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  422 KRImaLEKLEDAFNKskqecYSLKCNLEKERMTTKQLSQELESLKvriKELEAIESRLEktEFTLKEDLTKLKtltvmfv 501
Cdd:COG3206   151 AAV--ANALAEAYLE-----QNLELRREEARKALEFLEEQLPELR---KELEEAEAALE--EFRQKNGLVDLS------- 211
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  502 DERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLieETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDL 581
Cdd:COG3206   212 EEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL--GSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPN 289
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  582 LSRVNMLKNRLQSLEAiekdflknKLNQDSGKSTTALHQEnnkIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLER 661
Cdd:COG3206   290 HPDVIALRAQIAALRA--------QLQQEAQRILASLEAE---LEALQAREASLQAQLAQLEARLAELPELEAELRRLER 358
                         330       340
                  ....*....|....*....|....
gi 109659847  662 RYANERDKAQFLSKELEHVKMELA 685
Cdd:COG3206   359 EVEVARELYESLLQRLEEARLAEA 382
COG5022 COG5022
Myosin heavy chain [General function prediction only];
148-755 9.10e-06

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 50.08  E-value: 9.10e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  148 EKHKESYR-RILGQLLVAEKSRRQTIL--ELEEEKRKHKEY-MEKSDEficLLEQECERLKKLIDQEIksqeeKEQEKEK 223
Cdd:COG5022   821 KLQKTIKReKKLRETEEVEFSLKAEVLiqKFGRSLKAKKRFsLLKKET---IYLQSAQRVELAERQLQ-----ELKIDVK 892
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  224 RVTTLKEELTKLKSFALMVV-DEQQRLTAQLTLQRQKIQELTTNAKETHTKLALA-EARVQEEEQKatrlekeLQTQTTK 301
Cdd:COG5022   893 SISSLKLVNLELESEIIELKkSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSiEYVKLPELNK-------LHEVESK 965
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  302 FHQDQDTIMAKLTNEDSQNRQLQ----------QKLAALSRQIDELEETNRSLRKAEEELQDIK--EKISKGEYGNAGIM 369
Cdd:COG5022   966 LKETSEEYEDLLKKSTILVREGNkanselknfkKELAELSKQYGALQESTKQLKELPVEVAELQsaSKIISSESTELSIL 1045
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  370 AEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQ-------SKDFKLEVEKLSKRIMALEKLEDAFNKSKQecy 442
Cdd:COG5022  1046 KPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQlestenlLKTINVKDLEVTNRNLVKPANVLQFIVAQM--- 1122
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  443 sLKCNLEKERMttkqlsqelESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAAS 522
Cdd:COG5022  1123 -IKLNLLQEIS---------KFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKS 1192
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  523 SQLQVEqnkVTTVTEKLIEETKR-------------ALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLK 589
Cdd:COG5022  1193 KLSSSE---VNDLKNELIALFSKifsgwprgdklkkLISEGWVPTEYSTSLKGFNNLNKKFDTPASMSNEKLLSLLNSID 1269
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  590 NRLQS-------LEAIEKDFLKN-----------KLNQDSGKSTTALHQ--ENNKIKELSQEVERLKLKLKDMKAIEDDL 649
Cdd:COG5022  1270 NLLSSykleeevLPATINSLLQYinvglfnalrtKASSLRWKSATEVNYnsEELDDWCREFEISDVDEELEELIQAVKVL 1349
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  650 MKTEDEYETLERRYanerdKAQFLSKELEHVKMeLAKYKLAEKtETSHEQWLFKRLQEEEAKS--GHLSREVDALKEKIH 727
Cdd:COG5022  1350 QLLKDDLNKLDELL-----DACYSLNPAEIQNL-KSRYDPADK-ENNLPKEILKKIEALLIKQelQLSLEGKDETEVHLS 1422
                         650       660
                  ....*....|....*....|....*...
gi 109659847  728 EYMATEDLICHLQGDHSVLQKKLNQQEN 755
Cdd:COG5022  1423 EIFSEEKSLISLDRNSIYKEEVLSSLSA 1450
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
284-434 9.26e-06

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 49.86  E-value: 9.26e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  284 EEQKATRLEKELQT-QTTKFHQDQDtimakLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGE 362
Cdd:COG2433   383 EELIEKELPEEEPEaEREKEHEERE-----LTEEEEEIRRLEEQVERLEAEVEELEAELEEKDERIERLERELSEARSEE 457
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  363 YGNAGIMAEVEELRKRVLDMEGkdeELIKMEEQCRDLNKRLER-ETLQSKDFK------LEVEKLSKRimALEKLEDAF 434
Cdd:COG2433   458 RREIRKDREISRLDREIERLER---ELEEERERIEELKRKLERlKELWKLEHSgelvpvKVVEKFTKE--AIRRLEEEY 531
EzrA pfam06160
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ...
310-662 1.41e-05

Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.


Pssm-ID: 428797 [Multi-domain]  Cd Length: 542  Bit Score: 49.08  E-value: 1.41e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   310 MAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISK------GEYGNAgimaeVEELRKRVLDME 383
Cdd:pfam06160   85 KKALDEIEELLDDIEEDIKQILEELDELLESEEKNREEVEELKDKYRELRKtllanrFSYGPA-----IDELEKQLAEIE 159
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   384 GKDEEL---------IKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRI-MALEKLEDAFNKSKQECYSLK-CNLEKER 452
Cdd:pfam06160  160 EEFSQFeeltesgdyLEAREVLEKLEEETDALEELMEDIPPLYEELKTELpDQLEELKEGYREMEEEGYALEhLNVDKEI 239
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   453 mttKQLSQELESLKVRIKELEaiesrLEKTEFTLKEDLTKLKTLTVMF---VDERKTMSEKLKKTEDKLQAASSQLQVEQ 529
Cdd:pfam06160  240 ---QQLEEQLEENLALLENLE-----LDEAEEALEEIEERIDQLYDLLekeVDAKKYVEKNLPEIEDYLEHAEEQNKELK 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   530 NKVTTVTEKLI---EETKRALKSKTDVE--EKMYSVTKERDD--------LKNKLKAEEEKGNDLLSRVNMLKNRLQSLE 596
Cdd:pfam06160  312 EELERVQQSYTlneNELERVRGLEKQLEelEKRYDEIVERLEekevayseLQEELEEILEQLEEIEEEQEEFKESLQSLR 391
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   597 AIEKDFLKNKLNQDSGKSTTALHQEN------------------NKIKELSQEVERLKLklkDMKAIEDDLMKTEDEYET 658
Cdd:pfam06160  392 KDELEAREKLDEFKLELREIKRLVEKsnlpglpesyldyffdvsDEIEDLADELNEVPL---NMDEVNRLLDEAQDDVDT 468

                   ....
gi 109659847   659 LERR 662
Cdd:pfam06160  469 LYEK 472
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
160-723 1.44e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 49.66  E-value: 1.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   160 QLLVAEKSRRQTILELEEEKRKHKEyMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFA 239
Cdd:TIGR00606  232 QLESSREIVKSYENELDPLKNRLKE-IEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNH 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   240 LMVVDEQ-------QRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQT--------------- 297
Cdd:TIGR00606  311 QRTVREKerelvdcQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATrleldgfergpfser 390
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   298 QTTKFHQDQDTIMAKLTNEDSQN-RQLQQKLAALSRQIDELEETNRS-----------LRKAEEELQDIKEKISKGEYGN 365
Cdd:TIGR00606  391 QIKNFHTLVIERQEDEAKTAAQLcADLQSKERLKQEQADEIRDEKKGlgrtielkkeiLEKKQEELKFVIKELQQLEGSS 470
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   366 AGIMAEVEELRKRVLDM---------EGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAF-- 434
Cdd:TIGR00606  471 DRILELDQELRKAERELskaeknsltETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDeq 550
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   435 ---NKSKQECY------------SLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKL--KTLT 497
Cdd:TIGR00606  551 irkIKSRHSDEltsllgyfpnkkQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYedKLFD 630
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   498 VMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETK-------RALKSK-------TDVEEKMYSVTKE 563
Cdd:TIGR00606  631 VCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQsccpvcqRVFQTEaelqefiSDLQSKLRLAPDK 710
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   564 RDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAiEKDFLKNKLNqdsgKSTTALHQENNKIKELSQEVERLKLKLKDMK 643
Cdd:TIGR00606  711 LKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEK-EIPELRNKLQ----KVNRDIQRLKNDIEEQETLLGTIMPEEESAK 785
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   644 AIEDDLMKTEDEYETL---ERRYANERDKAQF--LSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSRE 718
Cdd:TIGR00606  786 VCLTDVTIMERFQMELkdvERKIAQQAAKLQGsdLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSK 865

                   ....*
gi 109659847   719 VDALK 723
Cdd:TIGR00606  866 TNELK 870
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
144-353 1.45e-05

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 49.53  E-value: 1.45e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  144 DKVVEkHKESYRRILGQLLVAEKSRR---------QTILELEEEKRKHKEYMEKSD-----EFICLLEQECERLkkliDQ 209
Cdd:COG4913   228 DALVE-HFDDLERAHEALEDAREQIEllepirelaERYAAARERLAELEYLRAALRlwfaqRRLELLEAELEEL----RA 302
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  210 EIKSQEEKEQEKEKRVTTLKEELTKLKSfALMVVDEQQ--RLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQK 287
Cdd:COG4913   303 ELARLEAELERLEARLDALREELDELEA-QIRGNGGDRleQLEREIERLERELEERERRRARLEALLAALGLPLPASAEE 381
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109659847  288 ATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQD 353
Cdd:COG4913   382 FAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRD 447
46 PHA02562
endonuclease subunit; Provisional
273-512 1.86e-05

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 48.86  E-value: 1.86e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  273 KLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQ-QKLAALSRQID-ELEETNRSLRKAEEE 350
Cdd:PHA02562  170 KLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKyDELVEEAKTIKaEIEELTDELLNLVMD 249
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  351 LQDIKEKISKGEYGNAGIMAEVEELRKrVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKL 430
Cdd:PHA02562  250 IEDPSAALNKLNTAAAKIKSKIEQFQK-VIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI 328
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  431 EDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAiESRLEKTEF-TLKEDLTKL---KTLTVMFVDERKT 506
Cdd:PHA02562  329 MDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQA-EFVDNAEELaKLQDELDKIvktKSELVKEKYHRGI 407

                  ....*.
gi 109659847  507 MSEKLK 512
Cdd:PHA02562  408 VTDLLK 413
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
140-343 1.96e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 48.22  E-value: 1.96e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  140 MNELDKVVEKHKESYRRILGQLLVAEK---SRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEE 216
Cdd:COG4942    36 IAELEKELAALKKEEKALLKQLAALERriaALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYR 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  217 KEQEKEKRVTTLKEELTKL-KSFALM--VVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEK 293
Cdd:COG4942   116 LGRQPPLALLLSPEDFLDAvRRLQYLkyLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKA 195
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 109659847  294 ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRS 343
Cdd:COG4942   196 ERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
242-884 2.82e-05

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 48.50  E-value: 2.82e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   242 VVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE----------KELQTQTTKFHQDQDTIMA 311
Cdd:TIGR00606  194 VRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYEneldplknrlKEIEHNLSKIMKLDNEIKA 273
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   312 KLTNE---DSQNRQLQQKLA----ALSRQIDELEETN-RSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV---- 379
Cdd:TIGR00606  274 LKSRKkqmEKDNSELELKMEkvfqGTDEQLNDLYHNHqRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQgrlq 353
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   380 LDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFkLEVEKLSKRIMALEKLEDAFNKSKQECYSLKcnlEKERMTTKQLS 459
Cdd:TIGR00606  354 LQADRHQEHIRARDSLIQSLATRLELDGFERGPF-SERQIKNFHTLVIERQEDEAKTAAQLCADLQ---SKERLKQEQAD 429
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   460 QELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMfVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKL 539
Cdd:TIGR00606  430 EIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGS-SDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQN 508
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   540 ----IEETKRALKSK-------TDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRV-------NMLKNRLQSLEA---I 598
Cdd:TIGR00606  509 ekadLDRKLRKLDQEmeqlnhhTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLlgyfpnkKQLEDWLHSKSKeinQ 588
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   599 EKDFLKnKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDM---KAIEDDLMKTEDEYETLERRYANERDKAQFLSK 675
Cdd:TIGR00606  589 TRDRLA-KLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVcgsQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQ 667
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   676 ELEHVKME------LAKYKLAEKTETsheQWLFKRLQEEEAKSGHLSREVDALKEKIHEymATEDLICHLQGDHSVLQ-- 747
Cdd:TIGR00606  668 FITQLTDEnqsccpVCQRVFQTEAEL---QEFISDLQSKLRLAPDKLKSTESELKKKEK--RRDEMLGLAPGRQSIIDlk 742
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   748 -KKLNQQENRNRDLGREIENLTKELERYRHFSKSLRPSLNGRRISDPQVFSKEVQTEAVDNEPPDYKSLIPLERAVINGQ 826
Cdd:TIGR00606  743 eKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDR 822
                          650       660       670       680       690
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847   827 LYEESENQDEDPNDEGSVLSFKCSQSTPCPVNRKLWIPWMKSKEGHLQNGKMQTKPNA 884
Cdd:TIGR00606  823 TVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNL 880
PTZ00440 PTZ00440
reticulocyte binding protein 2-like protein; Provisional
197-770 2.98e-05

reticulocyte binding protein 2-like protein; Provisional


Pssm-ID: 240419 [Multi-domain]  Cd Length: 2722  Bit Score: 48.67  E-value: 2.98e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  197 EQECERLKKL----IDQEIKSqeekeqekekrvttLKEELTKLKSFALMVVDEQqrltaqLTLQRQKIQELTTNAKETHT 272
Cdd:PTZ00440  672 KNEYEKLEFMksdnIDNIIKN--------------LKKELQNLLSLKENIIKKQ------LNNIEQDISNSLNQYTIKYN 731
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  273 KLALAEARVQEEEQKATRLEKELQTQTTKF-----HQDQDTIMAKLTNED--SQNRQLQQKLAALSRQIDELEEtnrSLR 345
Cdd:PTZ00440  732 DLKSSIEEYKEEEEKLEVYKHQIINRKNEFilhlyENDKDLPDGKNTYEEflQYKDTILNKENKISNDINILKE---NKK 808
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  346 KAEEELQDIKEKISKGEYGNAGIMAEVEELRKRvLDMEGKDEELIKMEEQCRDLNKRLErETLQskdfklEVEKLSKRIM 425
Cdd:PTZ00440  809 NNQDLLNSYNILIQKLEAHTEKNDEELKQLLQK-FPTEDENLNLKELEKEFNENNQIVD-NIIK------DIENMNKNIN 880
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  426 ALEKLEDAFNKS---KQECYSLKCNLEKERMTTKQLSQELES------------LKVRIKELEAIESRLEKTEFT----- 485
Cdd:PTZ00440  881 IIKTLNIAINRSnsnKQLVEHLLNNKIDLKNKLEQHMKIINTdniiqkneklnlLNNLNKEKEKIEKQLSDTKINnlkmq 960
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  486 LKEDLTKLKTLTVMFVDERKTMSEKLKKTED-------KLQAASSQLQVEQNKVTTVT----EKLIEETKRALKSK-TDV 553
Cdd:PTZ00440  961 IEKTLEYYDKSKENINGNDGTHLEKLDKEKDewehfksEIDKLNVNYNILNKKIDDLIkkqhDDIIELIDKLIKEKgKEI 1040
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  554 EEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNklnqdsgksttalhqennkIKELSQEVE 633
Cdd:PTZ00440 1041 EEKVDQYISLLEKMKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALLKK-------------------IDENKNKLI 1101
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  634 RLKLKLKDMKAIEDDLM-KTEDEYET----LERRYANERDkaqfLSKELEHVKMELAKYKLAEKTETSHEQWLF----KR 704
Cdd:PTZ00440 1102 EIKNKSHEHVVNADKEKnKQTEHYNKkkksLEKIYKQMEK----TLKELENMNLEDITLNEVNEIEIEYERILIdhivEQ 1177
                         570       580       590       600       610       620
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109659847  705 LQEEEAKSGHLSREVDALKEKIHEYMAteDLICHLQGDHSVLqkKLNQQENRNRDLGREIENLTKE 770
Cdd:PTZ00440 1178 INNEAKKSKTIMEEIESYKKDIDQVKK--NMSKERNDHLTTF--EYNAYYDKATASYENIEELTTE 1239
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
247-409 3.77e-05

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 46.46  E-value: 3.77e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  247 QRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNrqLQQK 326
Cdd:COG1579    20 DRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNKEYEA--LQKE 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  327 LAALSRQIDELEEtnrSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV-LDMEGKDEELIKMEEQCRDLNKRLER 405
Cdd:COG1579    98 IESLKRRISDLED---EILELMERIEELEEELAELEAELAELEAELEEKKAELdEELAELEAELEELEAEREELAAKIPP 174

                  ....
gi 109659847  406 ETLQ 409
Cdd:COG1579   175 ELLA 178
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
592-775 4.68e-05

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 47.45  E-value: 4.68e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  592 LQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQ 671
Cdd:COG4717    40 LAFIRAMLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELE 119
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  672 FLSKELEHVKMELAKYKLAEKTETSHEQWlfKRLQEEEAKSGHLSREVDALKEKIHEY------------MATEDLICHL 739
Cdd:COG4717   120 KLEKLLQLLPLYQELEALEAELAELPERL--EELEERLEELRELEEELEELEAELAELqeeleelleqlsLATEEELQDL 197
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 109659847  740 QGDHSVLQKKLNQQENRNRDLGREIENLTKELERYR 775
Cdd:COG4717   198 AEELEELQQRLAELEEELEEAQEELEELEEELEQLE 233
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
456-712 5.68e-05

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 46.68  E-value: 5.68e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  456 KQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLktltvmfvderktmSEKLKKTEDKLQAASSQLQVEQNKVTTV 535
Cdd:COG4942    23 AEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAAL--------------ERRIAALARRIRALEQELAALEAELAEL 88
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  536 TEKlIEETKRALKsktdveekmysvtKERDDLKNKLKAEEEKGNdlLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKST 615
Cdd:COG4942    89 EKE-IAELRAELE-------------AQKEELAELLRALYRLGR--QPPLALLLSPEDFLDAVRRLQYLKYLAPARREQA 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  616 TALHQENNKIKELSQEVERLKLKLKDMKAieddlmKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTET 695
Cdd:COG4942   153 EELRADLAELAALRAELEAERAELEALLA------ELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELE 226
                         250
                  ....*....|....*..
gi 109659847  696 SheqwLFKRLQEEEAKS 712
Cdd:COG4942   227 A----LIARLEAEAAAA 239
SMC_N pfam02463
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ...
426-782 7.58e-05

RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.


Pssm-ID: 426784 [Multi-domain]  Cd Length: 1161  Bit Score: 47.27  E-value: 7.58e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   426 ALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERK 505
Cdd:pfam02463  174 ALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEI 253
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   506 TMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKtdvEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRV 585
Cdd:pfam02463  254 ESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEEL---KSELLKLERRKVDDEEKLKESEKEKKKAEKEL 330
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   586 NMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKElsqevERLKLKLKDMKAIEDDLMKTEDEYETLERRYAN 665
Cdd:pfam02463  331 KKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLE-----EELLAKKKLESERLSSAAKLKEEELELKSEEEK 405
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   666 ERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKR--------LQEEEAKSGHLSREVDALKEKIHEYMATEDLIC 737
Cdd:pfam02463  406 EAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGklteekeeLEKQELKLLKDELELKKSEDLLKETQLVKLQEQ 485
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*
gi 109659847   738 HLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYRHFSKSLR 782
Cdd:pfam02463  486 LELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGR 530
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
224-355 9.09e-05

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 46.55  E-value: 9.09e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKAT---------RLEKE 294
Cdd:COG3206   220 QLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARytpnhpdviALRAQ 299
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109659847  295 LQTQTTKFHQDQDTIMAKLtneDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIK 355
Cdd:COG3206   300 IAALRAQLQQEAQRILASL---EAELEALQAREASLQAQLAQLEARLAELPELEAELRRLE 357
COG5283 COG5283
Phage-related tail protein [Mobilome: prophages, transposons];
237-403 9.52e-05

Phage-related tail protein [Mobilome: prophages, transposons];


Pssm-ID: 444094 [Multi-domain]  Cd Length: 747  Bit Score: 46.77  E-value: 9.52e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  237 SFALMVVDeqQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMA---KL 313
Cdd:COG5283     2 QVILGAVD--KPFKSALESAKQRVAALAQALKALEAPTRALARALERAKQAAARLQTKYNKLRQSLQRLRQALDQagiDT 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  314 TNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISK-GEYGNAGI-------------------MAEVe 373
Cdd:COG5283    80 RQLSAAQRRLRSSLEQTNRQLERQQQRLARLGARQDRLKAARARLQRlAGAGAAAAaigaalaasvkpaidfedaMADV- 158
                         170       180       190
                  ....*....|....*....|....*....|
gi 109659847  374 elrKRVLDMEGKDEELIKMEEQCRDLNKRL 403
Cdd:COG5283   159 ---AATVDLDKSSEQFKALGKQARELSAQT 185
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
383-586 1.29e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 46.45  E-value: 1.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  383 EGKDEELIKMEEQCRDLNKRLEretlqskDFKLEVEKLSKRIMALEKLEDAFN-----KSKQECYSlkcNLEKERMTTKQ 457
Cdd:COG4913   613 AALEAELAELEEELAEAEERLE-------ALEAELDALQERREALQRLAEYSWdeidvASAEREIA---ELEAELERLDA 682
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  458 LSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKtltvmfvDERKTMSEKLKKTEDKLQAASSQLQVEQnkvttvtE 537
Cdd:COG4913   683 SSDDLAALEEQLEELEAELEELEEELDELKGEIGRLE-------KELEQAEEELDELQDRLEAAEDLARLEL-------R 748
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 109659847  538 KLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVN 586
Cdd:COG4913   749 ALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRAFN 797
PRK06975 PRK06975
bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
246-352 1.30e-04

bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed


Pssm-ID: 235899 [Multi-domain]  Cd Length: 656  Bit Score: 46.25  E-value: 1.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  246 QQRL--TAQLTLQRQkiqelttNAKETHTklalAEARVQEEEqkatrlekeLQTQTTKFHQDQDTIMAKLTNEDSQNRQL 323
Cdd:PRK06975  345 NRKVdrLDQELVQRQ-------QANDAQT----AELRVKTEQ---------AQASVHQLDSQFAQLDGKLADAQSAQQAL 404
                          90       100       110
                  ....*....|....*....|....*....|.
gi 109659847  324 QQKLAALSRQID--ELEETNRSLRKAEEELQ 352
Cdd:PRK06975  405 EQQYQDLSRNRDdwMIAEVEQMLSSASQQLQ 435
PRK12704 PRK12704
phosphodiesterase; Provisional
344-493 1.41e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 45.92  E-value: 1.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  344 LRKAEEELQDIKEKISKgeygnagimaEVEELRKRVLdMEGKDEELIKMEEQCRDLNKRleRETLQSKDFKLE--VEKLS 421
Cdd:PRK12704   33 IKEAEEEAKRILEEAKK----------EAEAIKKEAL-LEAKEEIHKLRNEFEKELRER--RNELQKLEKRLLqkEENLD 99
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  422 KRIMALEKLEDAFNKSKQECYSLKCNLEK-----ERMTTKQLsQELEslkvRIKEL---EAIESRLEKTEFTLKEDLTKL 493
Cdd:PRK12704  100 RKLELLEKREEELEKKEKELEQKQQELEKkeeelEELIEEQL-QELE----RISGLtaeEAKEILLEKVEEEARHEAAVL 174
46 PHA02562
endonuclease subunit; Provisional
198-401 1.60e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 45.78  E-value: 1.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  198 QECERLKKLIDQEIKS----QEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTK 273
Cdd:PHA02562  184 QTLDMKIDHIQQQIKTynknIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTA 263
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  274 LALAEARVQEEEQKATRLEK--ELQTQTTKFHQDQDtIMAKLTNedsQNRQLQQKLAALSRQIDELEETNRSLRKAEEEL 351
Cdd:PHA02562  264 AAKIKSKIEQFQKVIKMYEKggVCPTCTQQISEGPD-RITKIKD---KLKELQHSLEKLDTAIDELEEIMDEFNEQSKKL 339
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847  352 QDIKEKISKGEYG-------NAGIMAEVEELRKRVLDmegKDEELIKMEEQCRDLNK 401
Cdd:PHA02562  340 LELKNKISTNKQSlitlvdkAKKVKAAIEELQAEFVD---NAEELAKLQDELDKIVK 393
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
329-785 1.82e-04

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 45.81  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   329 ALSRQIDELEETNRSLrKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETL 408
Cdd:TIGR00606  170 ALKQKFDEIFSATRYI-KALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELD 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   409 QSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQEleslkvrikELEAIESRLEKTEFTLKE 488
Cdd:TIGR00606  249 PLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDE---------QLNDLYHNHQRTVREKER 319
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   489 DLTKLKTLTVMFVDERKTMSEklKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSKTDVEEKMYSVTKErddLK 568
Cdd:TIGR00606  320 ELVDCQRELEKLNKERRLLNQ--EKTELLVEQGRLQLQADRHQEHIRARDS-LIQSLATRLELDGFERGPFSERQ---IK 393
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   569 NKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTT------ALHQENNKIKELSQEVERLKLKLKDM 642
Cdd:TIGR00606  394 NFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTielkkeILEKKQEELKFVIKELQQLEGSSDRI 473
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   643 KAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTE------TSHEQWLF---KRLQEEEAKSG 713
Cdd:TIGR00606  474 LELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEqlnhhtTTRTQMEMltkDKMDKDEQIRK 553
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847   714 HLSREVDALKEKIHEYMATEDLICHLQGdhsvLQKKLNQQENRNRDLGREIENLTKELERYRHFSKSLRPSL 785
Cdd:TIGR00606  554 IKSRHSDELTSLLGYFPNKKQLEDWLHS----KSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQL 621
PRK12704 PRK12704
phosphodiesterase; Provisional
256-427 1.91e-04

phosphodiesterase; Provisional


Pssm-ID: 237177 [Multi-domain]  Cd Length: 520  Bit Score: 45.54  E-value: 1.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  256 QRQKIQELTTNAKETHTKLALAEARvQEEEQKATRLEKE-------LQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:PRK12704   39 EAKRILEEAKKEAEAIKKEALLEAK-EEIHKLRNEFEKElrerrneLQKLEKRLLQKEENLDRKLELLEKREEELEKKEK 117
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  329 ALSRQIDELEETNRSLRKAEEELQDIKEKISkgeygnaGIMAevEELRKRVLDmegkdeeliKMEEQCRD----LNKRLE 404
Cdd:PRK12704  118 ELEQKQQELEKKEEELEELIEEQLQELERIS-------GLTA--EEAKEILLE---------KVEEEARHeaavLIKEIE 179
                         170       180
                  ....*....|....*....|...
gi 109659847  405 RETlqskdfKLEVEKLSKRIMAL 427
Cdd:PRK12704  180 EEA------KEEADKKAKEILAQ 196
PRK01156 PRK01156
chromosome segregation protein; Provisional
166-792 1.93e-04

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 45.66  E-value: 1.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  166 KSRRQTILELEEEKRKHKEYMEKSDEFICLLEQEcERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFAlmvvDE 245
Cdd:PRK01156  176 DMLRAEISNIDYLEEKLKSSNLELENIKKQIADD-EKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLE----DM 250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  246 QQRLTAQLtlqrqkiqelttnaKETHTKLALAEARVQEEEQKATRLEKelqTQTTKFHQDQDTIMAKLTNEdSQNRQLQQ 325
Cdd:PRK01156  251 KNRYESEI--------------KTAESDLSMELEKNNYYKELEERHMK---IINDPVYKNRNYINDYFKYK-NDIENKKQ 312
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  326 KLAALSRQIDELEETNRSLRKAEEELQDIKEKISkgeygnagimaEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLER 405
Cdd:PRK01156  313 ILSNIDAEINKYHAIIKKLSVLQKDYNDYIKKKS-----------RYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEE 381
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  406 ETLQSKDFKLEVEKLSKRIMALEKledafnkskqecySLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTE-- 483
Cdd:PRK01156  382 YSKNIERMSAFISEILKIQEIDPD-------------AIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMem 448
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  484 -----------FTLKEDltKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQlQVEQNKVTTVTEKliEETKRALKSKTD 552
Cdd:PRK01156  449 lngqsvcpvcgTTLGEE--KSNHIINHYNEKKSRLEEKIREIEIEVKDIDEK-IVDLKKRKEYLES--EEINKSINEYNK 523
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  553 VEEKMYSVTKERDDLkNKLKAEEEKGNDLLSRVNMLKnrLQSLEAIEKDFLkNKLNQDSGKSTTALHQE----NNKIKEL 628
Cdd:PRK01156  524 IESARADLEDIKIKI-NELKDKHDKYEEIKNRYKSLK--LEDLDSKRTSWL-NALAVISLIDIETNRSRsneiKKQLNDL 599
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  629 SQEVERLKLKLKDMKA-IEDDLMKTEDEYETLERRYANERDKA---QFLSKELEHVKMELAKYKLAEKTETSheqwLFKR 704
Cdd:PRK01156  600 ESRLQEIEIGFPDDKSyIDKSIREIENEANNLNNKYNEIQENKiliEKLRGKIDNYKKQIAEIDSIIPDLKE----ITSR 675
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  705 LQEEEAKSGHLSREVDALKEKIHEYMATedlichlqgdHSVLQKKLNQQENRNRDLGREIENLTK------ELERYRH-F 777
Cdd:PRK01156  676 INDIEDNLKKSRKALDDAKANRARLEST----------IEILRTRINELSDRINDINETLESMKKikkaigDLKRLREaF 745
                         650
                  ....*....|....*
gi 109659847  778 SKSLRPSLNGRRISD 792
Cdd:PRK01156  746 DKSGVPAMIRKSASQ 760
CCCAP pfam15964
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ...
224-607 2.09e-04

Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.


Pssm-ID: 435040 [Multi-domain]  Cd Length: 703  Bit Score: 45.28  E-value: 2.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   224 RVTTLKEEL-TKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKF 302
Cdd:pfam15964  304 RLTKERDDLmSALVSVRSSLAEAQQRESSAYEQVKQAVQMTEEANFEKTKALIQCEQLKSELERQKERLEKELASQQEKR 383
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   303 HQDQDTIMAKLTNEDSQNRQ----LQQKLAALSRQIDELEETNRSLRKAEEELQ---------------DIKEKISKGEY 363
Cdd:pfam15964  384 AQEKEALRKEMKKEREELGAtmlaLSQNVAQLEAQVEKVTREKNSLVSQLEEAQkqlasqemdvtkvcgEMRYQLNQTKM 463
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   364 GNAGIMAEVEELRKRVL-DMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKrimALEKLEDAFNKSKQECY 442
Cdd:pfam15964  464 KKDEAEKEHREYRTKTGrQLEIKDQEIEKLGLELSESKQRLEQAQQDAARAREECLKLTE---LLGESEHQLHLTRLEKE 540
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   443 SLKCNLEKE-RMTTKQLSQELESLKVRIKELEAiesRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAA 521
Cdd:pfam15964  541 SIQQSFSNEaKAQALQAQQREQELTQKMQQMEA---QHDKTVNEQYSLLTSQNTFIAKLKEECCTLAKKLEEITQKSRSE 617
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   522 SSQLQVEQNKVTTVTEKLIEETKRaLKSKTDVEEKMYSVTKER-DDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLeAIEK 600
Cdd:pfam15964  618 VEQLSQEKEYLQDRLEKLQKRNEE-LEEQCVQHGRMHERMKQRlRQLDKHCQATAQQLVQLLSKQNQLFKERQNL-TEEV 695

                   ....*..
gi 109659847   601 DFLKNKL 607
Cdd:pfam15964  696 QSLRSQV 702
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
224-409 2.40e-04

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 45.39  E-value: 2.40e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  224 RVTTLKEEL----TKLKSFA----LMVVDEQ-QRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE-- 292
Cdd:COG3206   183 QLPELRKELeeaeAALEEFRqkngLVDLSEEaKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLqs 262
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  293 ---KELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDelEETNRSLRKAEEELQDIKEKIskgeygnAGIM 369
Cdd:COG3206   263 pviQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQ--QEAQRILASLEAELEALQARE-------ASLQ 333
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 109659847  370 AEVEELRKRVLDMEGKDEELIKMEEQcRDLNKRLERETLQ 409
Cdd:COG3206   334 AQLAQLEARLAELPELEAELRRLERE-VEVARELYESLLQ 372
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
63-409 3.47e-04

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 44.93  E-value: 3.47e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   63 QAEDLSRDDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAQY----GFVTPKKVLEALQRDAFQAKSTPWQEDIYEK 138
Cdd:COG1196   459 EALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYegflEGVKAALLLAGLRGLAGAVAVLIGVEAAYEA 538
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  139 PMNE------LDKVVEKHKESYRRIlgQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFI--------CLLEQECERLK 204
Cdd:COG1196   539 ALEAalaaalQNIVVEDDEVAAAAI--EYLKAAKAGRATFLPLDKIRARAALAAALARGAIgaavdlvaSDLREADARYY 616
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  205 KLIDQE---------IKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLA 275
Cdd:COG1196   617 VLGDTLlgrtlvaarLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELE 696
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  276 LAEARVQEEEQKATRLEKELQTQTtkfHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIK 355
Cdd:COG1196   697 EALLAEEEEERELAEAEEERLEEE---LEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLE 773
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 109659847  356 EKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQ 409
Cdd:COG1196   774 REIEALGPVNLLAIEEYEELEERYDFLSEQREDLEEARETLEEAIEEIDRETRE 827
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
328-496 3.54e-04

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 44.85  E-value: 3.54e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  328 AALSRQI-----DELEETNRSLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEG----KDEELIKMEEQCRD 398
Cdd:COG2433   380 EALEELIekelpEEEPEAEREKEHEERELTEEEEEIRRLE-------EQVERLEAEVEELEAeleeKDERIERLERELSE 452
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  399 LNKRLERETLQSKdfklEVEKLSKRIMALEKledafnkskqecyslkcnlekermttkqlsqELESLKVRIKELEAIESR 478
Cdd:COG2433   453 ARSEERREIRKDR----EISRLDREIERLER-------------------------------ELEEERERIEELKRKLER 497
                         170
                  ....*....|....*....
gi 109659847  479 LEKTEFTL-KEDLTKLKTL 496
Cdd:COG2433   498 LKELWKLEhSGELVPVKVV 516
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
232-440 5.95e-04

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 43.99  E-value: 5.95e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  232 LTKLKSFALMVVDEQQRLTAQLTLQRQKIQEL-------TTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQ 304
Cdd:COG4717   293 LAREKASLGKEAEELQALPALEELEEEELEELlaalglpPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEI 372
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  305 DQDTIMAKLTNEDS---------QNRQLQQKLAALSRQIDELEETNRSLRKA------EEELQDIKEKISKGEygnagim 369
Cdd:COG4717   373 AALLAEAGVEDEEElraaleqaeEYQELKEELEELEEQLEELLGELEELLEAldeeelEEELEELEEELEELE------- 445
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109659847  370 AEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKdFKLEVEKLSKRIMALEKLEDAFNKSKQE 440
Cdd:COG4717   446 EELEELREELAELE---AELEQLEEDGELAELLQELEELKAE-LRELAEEWAALKLALELLEEAREEYREE 512
46 PHA02562
endonuclease subunit; Provisional
132-360 6.17e-04

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 43.85  E-value: 6.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  132 QEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQei 211
Cdd:PHA02562  196 QIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQ-- 273
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  212 ksqeekeqekekrvttlkeeLTKLKSFaLMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKatrl 291
Cdd:PHA02562  274 --------------------FQKVIKM-YEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI---- 328
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847  292 EKELQTQTTKFHQDQDTIMAK---LTNEDSQNRQLQqklAALSRQIDELEETNRSLRKAEEELQDIKEKISK 360
Cdd:PHA02562  329 MDEFNEQSKKLLELKNKISTNkqsLITLVDKAKKVK---AAIEELQAEFVDNAEELAKLQDELDKIVKTKSE 397
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
310-700 9.38e-04

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 43.50  E-value: 9.38e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   310 MAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEE-LQDIKEKISKGEYGN--AGIMAEVEEL---RKRVLD-- 381
Cdd:TIGR01612  485 IDENSKQDNTVKLILMRMKDFKDIIDFMELYKPDEVPSKNIiGFDIDQNIKAKLYKEieAGLKESYELAknwKKLIHEik 564
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   382 --MEGKDEELIKMEEQCRDL-NKRLE--RETLQSKDFKLEVEklskrimalEKLEDAFNKSK--QECYSLKCNLEKERMT 454
Cdd:TIGR01612  565 keLEEENEDSIHLEKEIKDLfDKYLEidDEIIYINKLKLELK---------EKIKNISDKNEyiKKAIDLKKIIENNNAY 635
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   455 TKQLS-----QELESLKVRIKELEAIESRLEKTeftLKEDLTKLKTLTVMFVDErktmsEKLKKTEDK--LQAASSQLQV 527
Cdd:TIGR01612  636 IDELAkispyQVPEHLKNKDKIYSTIKSELSKI---YEDDIDALYNELSSIVKE-----NAIDNTEDKakLDDLKSKIDK 707
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   528 EQNKVTTVTEKLIE------ETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNrlqslEAIEKD 601
Cdd:TIGR01612  708 EYDKIQNMETATVElhlsniENKKNELLDIIVEIKKHIHGEINKDLNKILEDFKNKEKELSNKINDYAK-----EKDELN 782
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   602 FLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLerryanerdKAQFLSKELEHVK 681
Cdd:TIGR01612  783 KYKSKISEIKNHYNDQINIDNIKDEDAKQNYDKSKEYIKTISIKEDEIFKIINEMKFM---------KDDFLNKVDKFIN 853
                          410
                   ....*....|....*....
gi 109659847   682 MElakYKLAEKTETSHEQW 700
Cdd:TIGR01612  854 FE---NNCKEKIDSEHEQF 869
HOOK pfam05622
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ...
322-801 9.87e-04

HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.


Pssm-ID: 461694 [Multi-domain]  Cd Length: 528  Bit Score: 43.14  E-value: 9.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   322 QLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVldmEGKDEELIKMeEQCRDlnk 401
Cdd:pfam05622   11 ELAQRCHELDQQVSLLQEEKNSLQQENKKLQERLDQLESGDDSGTPGGKKYLLLQKQL---EQLQEENFRL-ETARD--- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   402 rleretlqskDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMT---TKQLSQELESLKVRIKELEaiesr 478
Cdd:pfam05622   84 ----------DYRIKCEELEKEVLELQHRNEELTSLAEEAQALKDEMDILRESsdkVKKLEATVETYKKKLEDLG----- 148
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   479 lektefTLKEDLTKLKTLTVMFVDERKTMSEKLKKTedklQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTD---VEE 555
Cdd:pfam05622  149 ------DLRRQVKLLEERNAEYMQRTLQLEEELKKA----NALRGQLETYKRQVQELHGKLSEESKKADKLEFEykkLEE 218
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   556 KMYSVTKERDDLKNK----------LKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLnqdsgKSTTALHQENNKI 625
Cdd:pfam05622  219 KLEALQKEKERLIIErdtlretneeLRCAQLQQAELSQADALLSPSSDPGDNLAAEIMPAEI-----REKLIRLQHENKM 293
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   626 KELSQEvERLKLKLKDMKAIEDDLMKTEDEYETlERRYANERdkaqflSKELEHVKMELAKYKLAEKTETSHEQWLFKRL 705
Cdd:pfam05622  294 LRLGQE-GSYRERLTELQQLLEDANRRKNELET-QNRLANQR------ILELQQQVEELQKALQEQGSKAEDSSLLKQKL 365
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   706 QEEEAKSGHLSREVDALKEKIHEYMATEDLICHLQGDHsvLQKKLNQQENRNRDLGreiENLTKELERYRHFSKSLRPSL 785
Cdd:pfam05622  366 EEHLEKLHEAQSELQKKKEQIEELEPKQDSNLAQKIDE--LQEALRKKDEDMKAME---ERYKKYVEKAKSVIKTLDPKQ 440
                          490
                   ....*....|....*.
gi 109659847   786 NGRRISDPQVFSKEVQ 801
Cdd:pfam05622  441 NPASPPEIQALKNQLL 456
COG2433 COG2433
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
535-722 1.00e-03

Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];


Pssm-ID: 441980 [Multi-domain]  Cd Length: 644  Bit Score: 43.31  E-value: 1.00e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  535 VTEKLIEEtkralksKTDVEEKMYSVTKERDDlkNKLKAEEEKGNDLLSRVNMLKNRLQSLEaiekdflknklnqdsgks 614
Cdd:COG2433   381 ALEELIEK-------ELPEEEPEAEREKEHEE--RELTEEEEEIRRLEEQVERLEAEVEELE------------------ 433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  615 ttalhqenNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANerdkaqfLSKELEHVKMELAkyKLAEKTE 694
Cdd:COG2433   434 --------AELEEKDERIERLERELSEARSEERREIRKDREISRLDREIER-------LERELEEERERIE--ELKRKLE 496
                         170       180
                  ....*....|....*....|....*....
gi 109659847  695 tsheqwLFKRLQEEEAKSGHLS-REVDAL 722
Cdd:COG2433   497 ------RLKELWKLEHSGELVPvKVVEKF 519
PRK05771 PRK05771
V-type ATP synthase subunit I; Validated
507-726 1.15e-03

V-type ATP synthase subunit I; Validated


Pssm-ID: 235600 [Multi-domain]  Cd Length: 646  Bit Score: 42.99  E-value: 1.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  507 MSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERddlKNKLKAEEEKGNDLLSRVN 586
Cdd:PRK05771   51 LLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEE---ISELENEIKELEQEIERLE 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  587 MLKN------RLQSLE-------AIEKDFLKNKLNQDSGKSTTALHQENNKI-------KELSQEVERLkLKLKDMKAIE 646
Cdd:PRK05771  128 PWGNfdldlsLLLGFKyvsvfvgTVPEDKLEELKLESDVENVEYISTDKGYVyvvvvvlKELSDEVEEE-LKKLGFERLE 206
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  647 -DDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEqwlfkrLQEEE-----AKSGHL----- 715
Cdd:PRK05771  207 lEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIE------LERAEalskfLKTDKTfaieg 280
                         250
                  ....*....|....
gi 109659847  716 ---SREVDALKEKI 726
Cdd:PRK05771  281 wvpEDRVKKLKELI 294
Spc7 smart00787
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ...
291-430 1.30e-03

Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.


Pssm-ID: 197874 [Multi-domain]  Cd Length: 312  Bit Score: 42.31  E-value: 1.30e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847    291 LEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKA-EEELQDIKEKISKGEYGNAGIM 369
Cdd:smart00787  145 LKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCdPTELDRAKEKLKKLLQEIMIKV 224
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847    370 AEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKL-EVEKLSKRIMALEKL 430
Cdd:smart00787  225 KKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFkEIEKLKEQLKLLQSL 286
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
219-798 1.35e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 42.70  E-value: 1.35e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   219 QEKEKRVTTLKEEL----TKLKSFALMVVD-----------------EQQRLTAQLTLQRQKIQELTTNAKETHTKLALA 277
Cdd:TIGR04523   36 KQLEKKLKTIKNELknkeKELKNLDKNLNKdeekinnsnnkikileqQIKDLNDKLKKNKDKINKLNSDLSKINSEIKND 115
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   278 EARVQEEEQKATRLEKELQtqttKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELE----ETNRSLRKAEEELQD 353
Cdd:TIGR04523  116 KEQKNKLEVELNKLEKQKK----ENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELEnelnLLEKEKLNIQKNIDK 191
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   354 IKEKISKGEY----------GNAGIMAEVEELRKRVL-----------DMEGKDEELIKMEEQCRDLNKRLERETLQSKD 412
Cdd:TIGR04523  192 IKNKLLKLELllsnlkkkiqKNKSLESQISELKKQNNqlkdniekkqqEINEKTTEISNTQTQLNQLKDEQNKIKKQLSE 271
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   413 FKLEVEKLSKRImalEKLEDAFNKSKQECYSLkcNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTK 492
Cdd:TIGR04523  272 KQKELEQNNKKI---KELEKQLNQLKSEISDL--NNQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQ 346
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   493 LKTLTVMFVDERKTMSEKLKKTEDKLQ-----------------AASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEE 555
Cdd:TIGR04523  347 LKKELTNSESENSEKQRELEEKQNEIEklkkenqsykqeiknleSQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEK 426
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   556 KMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLK--NKLNQDSGKSTTALHQENNKIKELSQEVE 633
Cdd:TIGR04523  427 EIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRsiNKIKQNLEQKQKELKSKEKELKKLNEEKK 506
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   634 RLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKaqfLSKELEHVKMELAKYKLAEK--------TETSHEQWLFKRL 705
Cdd:TIGR04523  507 ELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISD---LEDELNKDDFELKKENLEKEideknkeiEELKQTQKSLKKK 583
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   706 QEE-EAKSGHLSREVDALKEKIHEYMATEdlichlqgdhSVLQKKLNQQENRNRDLGREIENLTKELERYRHFSKSLRPS 784
Cdd:TIGR04523  584 QEEkQELIDQKEKEKKDLIKEIEEKEKKI----------SSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKET 653
                          650
                   ....*....|....
gi 109659847   785 LNGRRISDPQVFSK 798
Cdd:TIGR04523  654 IKEIRNKWPEIIKK 667
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
350-550 1.44e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.45  E-value: 1.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  350 ELQDIKEKISKgeygnagIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEK 429
Cdd:COG1579    11 DLQELDSELDR-------LEHRLKELPAELAELE---DELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  430 LEDAFNKSKQecYslkcnlekermttKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKtltvmfvDERKTMSE 509
Cdd:COG1579    81 QLGNVRNNKE--Y-------------EALQKEIESLKRRISDLEDEILELMERIEELEEELAELE-------AELAELEA 138
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 109659847  510 KLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSK 550
Cdd:COG1579   139 ELEEKKAELDEELAELEAELEELEAEREELAAKIPPELLAL 179
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
137-660 1.83e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 42.47  E-value: 1.83e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   137 EKPMNELDKVVEKHKESYRRILGQLLVAEKSRRqtilELEEEKRKHKEYMEKSDEFICLLEQECERL------------- 203
Cdd:pfam01576  172 EEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQ----ELEKAKRKLEGESTDLQEQIAELQAQIAELraqlakkeeelqa 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   204 ----------------KKL---------IDQEIKSQEEKEQEKEKRVTTLKEELTKLKSfalMVVDEQQRLTAQLTLQRQ 258
Cdd:pfam01576  248 alarleeetaqknnalKKIreleaqiseLQEDLESERAARNKAEKQRRDLGEELEALKT---ELEDTLDTTAAQQELRSK 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   259 KIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELE 338
Cdd:pfam01576  325 REQEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSE 404
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   339 etnRSLRKAEEELQDIKEKISKGEYGNA-------GIMAEVEELRKRVLDMEGKD----EELIKMEEQCRDLNKRLERET 407
Cdd:pfam01576  405 ---HKRKKLEGQLQELQARLSESERQRAelaeklsKLQSELESVSSLLNEAEGKNiklsKDVSSLESQLQDTQELLQEET 481
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   408 LQ-----SKDFKLEVEKLSKRIMALEKLEDAFNKSKQ------ECYSLKCNLEKERMTT-------KQLSQELESLKVRI 469
Cdd:pfam01576  482 RQklnlsTRLRQLEDERNSLQEQLEEEEEAKRNVERQlstlqaQLSDMKKKLEEDAGTLealeegkKRLQRELEALTQQL 561
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   470 KELEAIESRLEKTEFTLKEDLTKLktltVMFVDERKTMSEKLKKTEDKLqaasSQLQVEQNkvtTVTEKLIEETKRALKS 549
Cdd:pfam01576  562 EEKAAAYDKLEKTKNRLQQELDDL----LVDLDHQRQLVSNLEKKQKKF----DQMLAEEK---AISARYAEERDRAEAE 630
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   550 KTDVEEKMYSVTKERDDLKNKlKAEEEKGNdllsrvnmlknrlQSLEAIEKDFLKNKlnQDSGKSttaLHQENNKIKELS 629
Cdd:pfam01576  631 AREKETRALSLARALEEALEA-KEELERTN-------------KQLRAEMEDLVSSK--DDVGKN---VHELERSKRALE 691
                          570       580       590
                   ....*....|....*....|....*....|.
gi 109659847   630 QEVERLKLKLKDMkaiEDDLMKTEDEYETLE 660
Cdd:pfam01576  692 QQVEEMKTQLEEL---EDELQATEDAKLRLE 719
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
132-636 1.84e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 42.73  E-value: 1.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   132 QEDIYE---KPMNELDKVvEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSdeficlLEQECERLkklid 208
Cdd:TIGR01612 1185 KKNIYDeikKLLNEIAEI-EKDKTSLEEVKGINLSYGKNLGKLFLEKIDEEKKKSEHMIKA------MEAYIEDL----- 1252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   209 QEIKSQEEKEQEKEKRVTTLKEELtklKSFALMVVDEQQRLTAQltlqrQKIQELTTNAKETHTKLALAEARvqeeEQKA 288
Cdd:TIGR01612 1253 DEIKEKSPEIENEMGIEMDIKAEM---ETFNISHDDDKDHHIIS-----KKHDENISDIREKSLKIIEDFSE----ESDI 1320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   289 TRLEKELQT---QTTKFHQDQDTIMAKLTN-----EDSQNRQLQQKLAALSRQIDELEET-NRSLRKAEEELQDIKEKIS 359
Cdd:TIGR01612 1321 NDIKKELQKnllDAQKHNSDINLYLNEIANiynilKLNKIKKIIDEVKEYTKEIEENNKNiKDELDKSEKLIKKIKDDIN 1400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   360 KGEYGNA-----------GIMAEVEELRKRVLDMEGKDEELIKMEEQCRDlNKRLERETLQSKDFKLE-VEKLSK----- 422
Cdd:TIGR01612 1401 LEECKSKiestlddkdidECIKKIKELKNHILSEESNIDTYFKNADENNE-NVLLLFKNIEMADNKSQhILKIKKdnatn 1479
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   423 ----RIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEaIESRLEKTEFTLKEDLTKLKTLTV 498
Cdd:TIGR01612 1480 dhdfNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALA-IKNKFAKTKKDSEIIIKEIKDAHK 1558
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   499 MFVDERKTMSEKLKKTEDKlqaassQLQVE--------QNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNK 570
Cdd:TIGR01612 1559 KFILEAEKSEQKIKEIKKE------KFRIEddaakndkSNKAAIDIQLSLENFENKFLKISDIKKKINDCLKETESIEKK 1632
                          490       500       510       520       530       540       550
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109659847   571 LK-----AEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKlnqdsgkstTALHQENNKIKELSQEVERLK 636
Cdd:TIGR01612 1633 ISsfsidSQDTELKENGDNLNSLQEFLESLKDQKKNIEDKK---------KELDELDSEIEKIEIDVDQHK 1694
MukB COG3096
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ...
244-532 1.97e-03

Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442330 [Multi-domain]  Cd Length: 1470  Bit Score: 42.63  E-value: 1.97e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  244 DEQQRLTAQLTLQRQKIQELttnaKETHTKLA--LAEARVQEEEQKATRLEkELQTQTTKFHQDQDTIMAKltneDSQNR 321
Cdd:COG3096   850 RELAQHRAQEQQLRQQLDQL----KEQLQLLNklLPQANLLADETLADRLE-ELREELDAAQEAQAFIQQH----GKALA 920
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  322 QLQQKLAALSRQIDELEETNRSLRKAEEELQDIKekiskgeygnAGIMAeVEELRKRVLDMEGKDEEliKMEEQCRDLNK 401
Cdd:COG3096   921 QLEPLVAVLQSDPEQFEQLQADYLQAKEQQRRLK----------QQIFA-LSEVVQRRPHFSYEDAV--GLLGENSDLNE 987
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  402 RLERETLQSKdfklevEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEK 481
Cdd:COG3096   988 KLRARLEQAE------EARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQELEQELEELGVQADAEAEERARIRR 1061
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  482 TEftLKEDLTKL--------KTLTVmFVDERKTMSEKLKKTEDKLQAASSqlQVEQNKV 532
Cdd:COG3096  1062 DE--LHEELSQNrsrrsqleKQLTR-CEAEMDSLQKRLRKAERDYKQERE--QVVQAKA 1115
CagA_N pfam18971
CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial ...
246-532 2.05e-03

CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial virulence determinant and critical for gastric carcinogenesis. X-ray crystallographic analysis of the N-terminal CagA fragment (residues 1-876) revealed that the region has a structure comprised of three discrete domains. Domain I constitutes a mobile CagA N terminus, while Domain II tethers CagA to the plasma membrane by interacting with membrane phosphatidylserine. Domain III interacts intramolecularly with the intrinsically disordered C-terminal region, and this interaction potentiates the pathogenic scaffold/hub function of CagA.


Pssm-ID: 408741 [Multi-domain]  Cd Length: 876  Bit Score: 42.45  E-value: 2.05e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   246 QQRLTAQ-LTLQRQK--IQELTTNAKETHTKL-----ALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNED 317
Cdd:pfam18971  562 ENKLTAKgLSLQEANklIKDFLSSNKELAGKAlnfnkAVAEAKSTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKS 641
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   318 SQNRQLQQKLAALSrQIDEL-----EETNR---------SLRKAEEELQDIKEKISKgeygnagimaEVEELRKRVLDME 383
Cdd:pfam18971  642 GNKNKMEAKAQANS-QKDEIfalinKEANRdaraiaytqNLKGIKRELSDKLEKISK----------DLKDFSKSFDEFK 710
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   384 -GKDEELIKMEEQCRDLNKRLeretlqsKDFKLEVEKLSK---RIMALEKLEDAFNKSKQECYSLKCNLE---KERMTTK 456
Cdd:pfam18971  711 nGKNKDFSKAEETLKALKGSV-------KDLGINPEWISKvenLNAALNEFKNGKNKDFSKVTQAKSDLEnsvKDVIINQ 783
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847   457 QLSQELESLKVRIKELEAIE--SRLEKTeftlkedLTKLKTLTvmfvdeRKTMSEKLKKTEDKLQAASSQL-QVEQNKV 532
Cdd:pfam18971  784 KVTDKVDNLNQAVSVAKAMGdfSRVEQV-------LADLKNFS------KEQLAQQAQKNEDFNTGKNSELyQSVKNSV 849
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
321-576 2.29e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 42.21  E-value: 2.29e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  321 RQLQQKLAALSRQIDELEETNRSLRKAEEE---LQDIKEKisKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCR 397
Cdd:COG4913   221 PDTFEAADALVEHFDDLERAHEALEDAREQielLEPIREL--AERYAAARERLAELEYLRAALRLWFAQRRLELLEAELE 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  398 DLNKRLERETLQSKDFKLEVEKLSKRIMALEK-LEDAFNKSKQEcyslkcnLEKERmttKQLSQELESLKVRIKELEAIE 476
Cdd:COG4913   299 ELRAELARLEAELERLEARLDALREELDELEAqIRGNGGDRLEQ-------LEREI---ERLERELEERERRRARLEALL 368
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  477 SRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQnkvttvtEKLIEETKRALKSKTDVEEK 556
Cdd:COG4913   369 AALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRREL-------RELEAEIASLERRKSNIPAR 441
                         250       260
                  ....*....|....*....|
gi 109659847  557 MYSVtkeRDDLKNKLKAEEE 576
Cdd:COG4913   442 LLAL---RDALAEALGLDEA 458
PRK10246 PRK10246
exonuclease subunit SbcC; Provisional
244-480 2.56e-03

exonuclease subunit SbcC; Provisional


Pssm-ID: 182330 [Multi-domain]  Cd Length: 1047  Bit Score: 42.09  E-value: 2.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  244 DEQQRLTAQLTLQRQKIQELTTNAKEThtkLALAEARVQEEEQKATRLEKELQTQTTKFHQDQD------------TIMA 311
Cdd:PRK10246  297 ERIQEQSAALAHTRQQIEEVNTRLQST---MALRARIRHHAAKQSAELQAQQQSLNTWLAEHDRfrqwnnelagwrAQFS 373
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  312 KLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKaeeelQDIKEKISKgeygnagiMAEVEELRKRvldmegkdeeLIK 391
Cdd:PRK10246  374 QQTSDREQLRQWQQQLTHAEQKLNALPAITLTLTA-----DEVAAALAQ--------HAEQRPLRQR----------LVA 430
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  392 MEEQCRDLNKRLERetlqskdfkleveklskrimalekLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKV---- 467
Cdd:PRK10246  431 LHGQIVPQQKRLAQ------------------------LQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTiceq 486
                         250
                  ....*....|....*
gi 109659847  468 --RIKELEAIESRLE 480
Cdd:PRK10246  487 eaRIKDLEAQRAQLQ 501
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
198-728 2.62e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 42.08  E-value: 2.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   198 QECERLKKLIDQEIksqeekeqekekrvTTLKEELTklksfalmvvdEQQRLTAQLTLQRQKIQElttnakethtKLALA 277
Cdd:pfam01576  204 QELEKAKRKLEGES--------------TDLQEQIA-----------ELQAQIAELRAQLAKKEE----------ELQAA 248
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   278 EARVQEEEQKATRLEK---ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQI-DELEETNrslrkAEEELQD 353
Cdd:pfam01576  249 LARLEEETAQKNNALKkirELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELeDTLDTTA-----AQQELRS 323
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   354 IKEkiskgeygnagimAEVEELRKRVldmegkDEELIKMEEQCRDLNKRlERETLQSKDFKLEVEKLSKriMALEKLEDA 433
Cdd:pfam01576  324 KRE-------------QEVTELKKAL------EEETRSHEAQLQEMRQK-HTQALEELTEQLEQAKRNK--ANLEKAKQA 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   434 FNKSKQEcyslkcnLEKERMTTKQLSQELE----SLKVRIKELEAIESRLEKTEFTLKEDLTKLKTltvmfvdERKTMSE 509
Cdd:pfam01576  382 LESENAE-------LQAELRTLQQAKQDSEhkrkKLEGQLQELQARLSESERQRAELAEKLSKLQS-------ELESVSS 447
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   510 KLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTdveeKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLK 589
Cdd:pfam01576  448 LLNEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKLNLST----RLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQ 523
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   590 NRLQSleaiekdfLKNKLNQDSGksttALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKT----EDEYETLERRYAN 665
Cdd:pfam01576  524 AQLSD--------MKKKLEEDAG----TLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTknrlQQELDDLLVDLDH 591
                          490       500       510       520       530       540
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109659847   666 ERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEKIHE 728
Cdd:pfam01576  592 QRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEE 654
46 PHA02562
endonuclease subunit; Provisional
486-724 2.66e-03

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 41.92  E-value: 2.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  486 LKEDLTKLKTLTVMfvderktmsEKLKKteDKLQAASSQLQVEQNKVTTVTEKL------IEETKRalKSKTDVEEKMYS 559
Cdd:PHA02562  155 LVEDLLDISVLSEM---------DKLNK--DKIRELNQQIQTLDMKIDHIQQQIktynknIEEQRK--KNGENIARKQNK 221
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  560 VTKERDDLKNkLKAEEEKGNDLLSRVNM------------------LKNRLQSLEAIEKDFLKNklnQDSGKSTTALHQE 621
Cdd:PHA02562  222 YDELVEEAKT-IKAEIEELTDELLNLVMdiedpsaalnklntaaakIKSKIEQFQKVIKMYEKG---GVCPTCTQQISEG 297
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  622 NNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEY--------------ETLERRYANERDKAQFLSKELEHVKMELAKY 687
Cdd:PHA02562  298 PDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFneqskkllelknkiSTNKQSLITLVDKAKKVKAAIEELQAEFVDN 377
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 109659847  688 KLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKE 724
Cdd:PHA02562  378 AEELAKLQDELDKIVKTKSELVKEKYHRGIVTDLLKD 414
DUF4407 pfam14362
Domain of unknown function (DUF4407); This family of proteins is found in bacteria. Proteins ...
292-433 3.19e-03

Domain of unknown function (DUF4407); This family of proteins is found in bacteria. Proteins in this family are typically between 366 and 597 amino acids in length. There is a single completely conserved residue R that may be functionally important.


Pssm-ID: 464151 [Multi-domain]  Cd Length: 295  Bit Score: 41.08  E-value: 3.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   292 EKELQTQ-TTKFHQDQDTIMAKLTNE-DSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGI- 368
Cdd:pfam14362  105 EKEIDRElLEIQQEEADAAKAQLAAAyRARLAELEAQIAALDAEIDAAEARLDALQAEARCELDGTPGTGTGVPGDGPVa 184
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847   369 ---MAEVEELRKRVLDMEGK-DEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:pfam14362  185 ktkQAQLDAAQAELAALQAQnDARLAALRAELARLTAERAAARARSQAAIDGDDGLLARLEALNRLTTE 253
235kDa-fam TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
138-660 3.38e-03

reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.


Pssm-ID: 130673 [Multi-domain]  Cd Length: 2757  Bit Score: 41.96  E-value: 3.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   138 KPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEfiCLLEQECERLKKLIDQEIKSQEEK 217
Cdd:TIGR01612  917 KKVDEYIKICENTKESIEKFHNKQNILKEILNKNIDTIKESNLIEKSYKDKFDN--TLIDKINELDKAFKDASLNDYEAK 994
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   218 EQEKEKRVTTLKEELTKLKSFAL-MVVDEQQRLTAQLTlqrQKIQELTTNAKETHTKLALAEARVQEEEQKA-------- 288
Cdd:TIGR01612  995 NNELIKYFNDLKANLGKNKENMLyHQFDEKEKATNDIE---QKIEDANKNIPNIEIAIHTSIYNIIDEIEKEigkniell 1071
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   289 -TRLEKELQTQTTKFHQDQDTImaKLTNEDSQNRQLQQKLAALSRQI-DELEETNRSLRKAEEELQDIKEKisKGEYGNA 366
Cdd:TIGR01612 1072 nKEILEEAEINITNFNEIKEKL--KHYNFDDFGKEENIKYADEINKIkDDIKNLDQKIDHHIKALEEIKKK--SENYIDE 1147
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   367 gIMAEVEELRKrVLDMEGKDEELIKMEEQCRDLNKRLEREtlqsKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKC 446
Cdd:TIGR01612 1148 -IKAQINDLED-VADKAISNDDPEEIEKKIENIVTKIDKK----KNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYGK 1221
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   447 NLEKerMTTKQLSQELESLKVRIKELEAIESRLEKteftLKEDLTKLKTLTVMFVDERKTMsEKLKKTEDKLQAASSQLQ 526
Cdd:TIGR01612 1222 NLGK--LFLEKIDEEKKKSEHMIKAMEAYIEDLDE----IKEKSPEIENEMGIEMDIKAEM-ETFNISHDDDKDHHIISK 1294
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   527 VEQNKVTTVTEKlieetkrALKSKTDVEEKmysvtKERDDLKNKLK---AEEEKGNdllSRVNMLKNRLQSLEAIEK-DF 602
Cdd:TIGR01612 1295 KHDENISDIREK-------SLKIIEDFSEE-----SDINDIKKELQknlLDAQKHN---SDINLYLNEIANIYNILKlNK 1359
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847   603 LKNKLNqdsgksttalhqennKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLE 660
Cdd:TIGR01612 1360 IKKIID---------------EVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDINLE 1402
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
169-406 3.69e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.44  E-value: 3.69e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  169 RQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLidQEIKSQEEKEQEKEKRVTTLKEELTKLKSFAlmvvDEQQR 248
Cdd:COG4913   616 EAELAELEEELAEAEERLEALEAELDALQERREALQRL--AEYSWDEIDVASAEREIAELEAELERLDASS----DDLAA 689
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  249 LTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:COG4913   690 LEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRE 769
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  329 ALSRQIDELEETnrsLRKAEEELQDIKEKIsKGEYGN--AGIMAEVEELRK--RVLDmEGKDEELIKMEEQCRDLNKRLE 404
Cdd:COG4913   770 NLEERIDALRAR---LNRAEEELERAMRAF-NREWPAetADLDADLESLPEylALLD-RLEEDGLPEYEERFKELLNENS 844

                  ..
gi 109659847  405 RE 406
Cdd:COG4913   845 IE 846
PRK01156 PRK01156
chromosome segregation protein; Provisional
372-816 3.72e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 41.43  E-value: 3.72e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  372 VEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKE 451
Cdd:PRK01156  175 IDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRY 254
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  452 RMTTKQLSQELESLKVRIKELEAIESRLEKTE--------------FTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDk 517
Cdd:PRK01156  255 ESEIKTAESDLSMELEKNNYYKELEERHMKIIndpvyknrnyindyFKYKNDIENKKQILSNIDAEINKYHAIIKKLSV- 333
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  518 LQAASSQLQVEQNKvttvteklIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVN-MLKNRLQSLE 596
Cdd:PRK01156  334 LQKDYNDYIKKKSR--------YDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISeILKIQEIDPD 405
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  597 AIEKDF--LKNKLNQDSGKsTTALHQENNKIKELSQEVERLKLKL--------------------------KDMKAIEDD 648
Cdd:PRK01156  406 AIKKELneINVKLQDISSK-VSSLNQRIRALRENLDELSRNMEMLngqsvcpvcgttlgeeksnhiinhynEKKSRLEEK 484
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  649 LMKTEDEYETLERRYANERDKAQFLSKE------LEHVKMELAKYKLaEKTETSHEQWLFKRLQEEEAKSGHLSREVDAL 722
Cdd:PRK01156  485 IREIEIEVKDIDEKIVDLKKRKEYLESEeinksiNEYNKIESARADL-EDIKIKINELKDKHDKYEEIKNRYKSLKLEDL 563
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  723 KEKIHEY---MATEDLIC--HLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYrhfSKSLRPSLNGRRISDPQVFS 797
Cdd:PRK01156  564 DSKRTSWlnaLAVISLIDieTNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKS---IREIENEANNLNNKYNEIQE 640
                         490
                  ....*....|....*....
gi 109659847  798 KEVQTEAVDNEPPDYKSLI 816
Cdd:PRK01156  641 NKILIEKLRGKIDNYKKQI 659
mukB PRK04863
chromosome partition protein MukB;
257-479 4.14e-03

chromosome partition protein MukB;


Pssm-ID: 235316 [Multi-domain]  Cd Length: 1486  Bit Score: 41.48  E-value: 4.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  257 RQKIQELTTNAKETHTKLALAEARVQEEEQkATRLEKELQTQTTKfHQDQDTIMAKLTNEDSQnRQLQQKLAALSRQIDE 336
Cdd:PRK04863  448 QAKEQEATEELLSLEQKLSVAQAAHSQFEQ-AYQLVRKIAGEVSR-SEAWDVARELLRRLREQ-RHLAEQLQQLRMRLSE 524
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  337 LEETNRSLRKAEEELQDIKEKISKGeYGNAgimAEVEELRkrvldmEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLE 416
Cdd:PRK04863  525 LEQRLRQQQRAERLLAEFCKRLGKN-LDDE---DELEQLQ------EELEARLESLSESVSEARERRMALRQQLEQLQAR 594
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 109659847  417 VEKLSKRIM-------ALEKLEDAFN---KSKQECYSLKCN-LEKERmttkQLSQELESLKVRIKELEAIESRL 479
Cdd:PRK04863  595 IQRLAARAPawlaaqdALARLREQSGeefEDSQDVTEYMQQlLERER----ELTVERDELAARKQALDEEIERL 664
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
247-352 4.34e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 41.16  E-value: 4.34e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  247 QRLTAQLTLQRQKIQELTTNAKETH-------TKLALAEARVQEEEQKA-TRLEKELQTQTTKFHQDQDTImAKLTNEDS 318
Cdd:COG3206   266 QQLRAQLAELEAELAELSARYTPNHpdvialrAQIAALRAQLQQEAQRIlASLEAELEALQAREASLQAQL-AQLEARLA 344
                          90       100       110
                  ....*....|....*....|....*....|....
gi 109659847  319 QNRQLQQKLAALSRQIDELEETNRSLRKAEEELQ 352
Cdd:COG3206   345 ELPELEAELRRLEREVEVARELYESLLQRLEEAR 378
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
509-726 4.55e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 40.90  E-value: 4.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  509 EKLKKTEDKLQAASSQLQVEQNKvttvTEKLIEETKRALKsktDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNML 588
Cdd:COG4942    23 AEAEAELEQLQQEIAELEKELAA----LKKEEKALLKQLA---ALERRIAALARRIRALEQELAALEAELAELEKEIAEL 95
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  589 KNRLQSLEAIEKDFLKN-KLNQDSGKSTTALHQENnkIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANER 667
Cdd:COG4942    96 RAELEAQKEELAELLRAlYRLGRQPPLALLLSPED--FLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAER 173
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847  668 DKAQFLSKELEHVKMELAKYKlAEKTETSHEqwLFKRLQEEEAKSGHLSREVDALKEKI 726
Cdd:COG4942   174 AELEALLAELEEERAALEALK-AERQKLLAR--LEKELAELAAELAELQQEAEELEALI 229
sbcc TIGR00618
exonuclease SbcC; All proteins in this family for which functions are known are part of an ...
148-365 4.56e-03

exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 129705 [Multi-domain]  Cd Length: 1042  Bit Score: 41.11  E-value: 4.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   148 EKHKESYRRILGQLLVAEKSRRQTILELEEEKRK---HKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKR 224
Cdd:TIGR00618  658 ERVREHALSIRVLPKELLASRQLALQKMQSEKEQltyWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAR 737
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   225 VTTLKEELTKLKSFALMV----VDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTT 300
Cdd:TIGR00618  738 EDALNQSLKELMHQARTVlkarTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDED 817
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847   301 KFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGN 365
Cdd:TIGR00618  818 ILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGIN 882
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
95-301 5.32e-03

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 40.88  E-value: 5.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847    95 AEKMDLALLEAQYGFVTPKKVLEALQRDAFQAKSTPWQEdIYEKPMNELDKVVEKHKESYRRIlGQLLVAEKSRRQTILE 174
Cdd:pfam17380  401 ARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRR-LEEERAREMERVRLEEQERQQQV-ERLRQQEEERKRKKLE 478
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   175 LEEEKRKHKEYMEKSDEficLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSfalmvvdEQQRLTAQLT 254
Cdd:pfam17380  479 LEKEKRDRKRAEEQRRK---ILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREA-------EEERRKQQEM 548
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|...
gi 109659847   255 LQRQKIQELTTNAKETHTKLALAEA------RVQEEEQKATRLEKELQTQTTK 301
Cdd:pfam17380  549 EERRRIQEQMRKATEERSRLEAMEReremmrQIVESEKARAEYEATTPITTIK 601
RecN COG0497
DNA repair ATPase RecN [Replication, recombination and repair];
227-433 6.24e-03

DNA repair ATPase RecN [Replication, recombination and repair];


Pssm-ID: 440263 [Multi-domain]  Cd Length: 555  Bit Score: 40.44  E-value: 6.24e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  227 TLKEELTKLKSFALMVVDEQQRLTAQLT------LQRQKIQELTT------NAKETHTKLALAEARVQEEEQKATRLEKE 294
Cdd:COG0497   169 ALKKELEELRADEAERARELDLLRFQLEeleaaaLQPGEEEELEEerrrlsNAEKLREALQEALEALSGGEGGALDLLGQ 248
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  295 LQTQttkfhqdqdtiMAKLTNEDSQnrqLQQKLAALSRQIDELEETNRSLRKA-------EEELQDIKEKIS-----KGE 362
Cdd:COG0497   249 ALRA-----------LERLAEYDPS---LAELAERLESALIELEEAASELRRYldslefdPERLEEVEERLAllrrlARK 314
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 109659847  363 YGN--AGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETlqskdfklevEKLSK-RIMALEKLEDA 433
Cdd:COG0497   315 YGVtvEELLAYAEELRAELAELENSDERLEELEAELAEAEAELLEAA----------EKLSAaRKKAAKKLEKA 378
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
372-669 6.30e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 40.80  E-value: 6.30e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  372 VEELRKRVL----DMEGKDEELIKMEEqcrDLNKRLERETLQSKDFKLEVEK------LSKRI--MALEKLEdafnkskq 439
Cdd:PTZ00108 1037 VKELKKLGYvrfkDIIKKKSEKITAEE---EEGAEEDDEADDEDDEEELGAAvsydylLSMPIwsLTKEKVE-------- 1105
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  440 ecyslkcNLEKERMTTKQLSQELESLKVR---IKELEAIESRLEKTEFTLKEDLTKLKTLTVM-FVDERKTMSEKLKKTE 515
Cdd:PTZ00108 1106 -------KLNAELEKKEKELEKLKNTTPKdmwLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKtKGKASKLRKPKLKKKE 1178
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  516 DKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSL 595
Cdd:PTZ00108 1179 KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDND 1258
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847  596 EAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKD--MKAIEDDLMKTEDEYE--TLERRYANERDK 669
Cdd:PTZ00108 1259 EFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSptKKKVKKRLEGSLAALKkkKKSEKKTARKKK 1336
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
507-671 6.55e-03

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 40.20  E-value: 6.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  507 MSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKlIEETKRALKS-KTDVEEKMYSVTKERDDLKNKLKAEEEKGN------ 579
Cdd:COG3883    28 LQAELEAAQAELDALQAELEELNEEYNELQAE-LEALQAEIDKlQAEIAEAEAEIEERREELGERARALYRSGGsvsyld 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  580 ---------DLLSRVNMLK----NRLQSLEAIEKDflKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIE 646
Cdd:COG3883   107 vllgsesfsDFLDRLSALSkiadADADLLEELKAD--KAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALL 184
                         170       180
                  ....*....|....*....|....*
gi 109659847  647 DDLMKTEDEYETLERRYANERDKAQ 671
Cdd:COG3883   185 AQLSAEEAAAEAQLAELEAELAAAE 209
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
624-775 7.30e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 40.44  E-value: 7.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   624 KIKELSQEVERL---KLKLKDMKAIEDDLMKTEdEYETLERRYANERDKAQfLSKELEHVKMELAKY--KLAEKTETSHE 698
Cdd:TIGR02169  192 IIDEKRQQLERLrreREKAERYQALLKEKREYE-GYELLKEKEALERQKEA-IERQLASLEEELEKLteEISELEKRLEE 269
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   699 qwLFKRLQEEEAKSGHL-SREVDALKEKIHEYMA----TEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02169  270 --IEQLLEELNKKIKDLgEEEQLRVKEKIGELEAeiasLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEE 347

                   ..
gi 109659847   774 YR 775
Cdd:TIGR02169  348 ER 349
ClyA_Cry6Aa-like cd22656
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ...
247-451 7.80e-03

Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.


Pssm-ID: 439154 [Multi-domain]  Cd Length: 309  Bit Score: 39.66  E-value: 7.80e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  247 QRLTAQLTLQRQKIQELTTNAKETHTKLalaearvqeeeqkaTRLEKELQTQTTKFHQDQDTIMAKLTNEDSqnRQLQQK 326
Cdd:cd22656   117 KTIKALLDDLLKEAKKYQDKAAKVVDKL--------------TDFENQTEKDQTALETLEKALKDLLTDEGG--AIARKE 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  327 LAALSRQIDELEETNrsLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEgkdeeliKMEEQCRDLNkrlere 406
Cdd:cd22656   181 IKDLQKELEKLNEEY--AAKLKAKIDELKALIADDE-------AKLAAALRLIADLT-------AADTDLDNLL------ 238
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 109659847  407 tlqskdfklevEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKE 451
Cdd:cd22656   239 -----------ALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDD 272
CortBP2 pfam09727
Cortactin-binding protein-2; This entry is the first approximately 250 residues of ...
387-505 8.88e-03

Cortactin-binding protein-2; This entry is the first approximately 250 residues of cortactin-binding protein 2. In addition to being a positional candidate for autism this protein is expressed at highest levels in the brain in humans. The human protein has six associated ankyrin repeat domains pfam00023 towards the C-terminus which act as protein-protein interaction domains.


Pssm-ID: 462860 [Multi-domain]  Cd Length: 187  Bit Score: 38.74  E-value: 8.88e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847   387 EELIKMEEQCRDLNKRLERETLqskdfklEVEKLSKRimALEKLEDafNKSKQECYSLK-----CNLEKERmttKQLSQE 461
Cdd:pfam09727   80 AELEKLVEKQRETQRRMLEQLA-------AAEKRHRR--VIRELEE--EKRKHARDTAQgddftYLLEKER---ERLKQE 145
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 109659847   462 LESLKVRIKELEAiesRLEKTEFTLKEDLTKLKTLTVMFVDERK 505
Cdd:pfam09727  146 LEQEKAQQKRLEK---ELKKLLEKLEEELSKQKQIALLLVKERK 186
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
316-671 9.67e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 40.00  E-value: 9.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  316 EDSQNRQLQQKLAALSRQIdeLEETNRSLRKAEEEL-QDIKEKISkgeygnagimAEVEELRKRVLDMEGKDEELIKMEE 394
Cdd:NF033838   82 KHTQNVALNKKLSDIKTEY--LYELNVLKEKSEAELtSKTKKELD----------AAFEQFKKDTLEPGKKVAEATKKVE 149
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  395 QCRDLNKRLERE------TLQSKDFKLEVEKLSKRImalekledafNKSKQECYSLKCNLEKERMTTKQLSQELESLKVR 468
Cdd:NF033838  150 EAEKKAKDQKEEdrrnypTNTYKTLELEIAESDVEV----------KKAELELVKEEAKEPRDEEKIKQAKAKVESKKAE 219
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  469 IKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALK 548
Cdd:NF033838  220 ATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKRRAKRGVLGEPATPDKKENDAKSSDSSVGEETLPSPS 299
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847  549 SKTdvEEKMYSVTKERDDLKNKLKAEEEKgndllSRVNMLKNRLQSL--EAIEKDFLKNKLNQDSGKSTTALHQENNKIK 626
Cdd:NF033838  300 LKP--EKKVAEAEKKVEEAKKKAKDQKEE-----DRRNYPTNTYKTLelEIAESDVKVKEAELELVKEEAKEPRNEEKIK 372
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|....*
gi 109659847  627 ELSQEVERLKLKLKDMKAIEDDLMKTEDEyetlERRYANERDKAQ 671
Cdd:NF033838  373 QAKAKVESKKAEATRLEKIKTDRKKAEEE----AKRKAAEEDKVK 413
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH