|
Name |
Accession |
Description |
Interval |
E-value |
| CortBP2 |
pfam09727 |
Cortactin-binding protein-2; This entry is the first approximately 250 residues of ... |
66-248 |
3.77e-64 |
|
Cortactin-binding protein-2; This entry is the first approximately 250 residues of cortactin-binding protein 2. In addition to being a positional candidate for autism this protein is expressed at highest levels in the brain in humans. The human protein has six associated ankyrin repeat domains pfam00023 towards the C-terminus which act as protein-protein interaction domains.
Pssm-ID: 462860 [Multi-domain] Cd Length: 187 Bit Score: 215.16 E-value: 3.77e-64
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 66 DLSRDDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAQYGFVTPKKVLEALQRDAFQAKSTPWQEDIYE----KPMN 141
Cdd:pfam09727 1 DLSKDDLLKLLSILEGELQARDIVIAVLKAEKVKQLLLEARYGFKYPSDPLLALQRDSELLRDQSQDEDVYEamyeKPLA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEK 221
Cdd:pfam09727 81 ELEKLVEKQRETQRRMLEQLAAAEKRHRRVIRELEEEKRKHARDTAQGDDFTYLLEKERERLKQELEQEKAQQKRLEKEL 160
|
170 180
....*....|....*....|....*..
gi 109659847 222 EKRVTTLKEELTKLKSFALMVVDEQQR 248
Cdd:pfam09727 161 KKLLEKLEEELSKQKQIALLLVKERKR 187
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
224-773 |
1.04e-18 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 92.43 E-value: 1.04e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFH 303
Cdd:TIGR02168 233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 304 QDQDTIM---AKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKIskgEYGNAGIMAEVEELRKRVL 380
Cdd:TIGR02168 313 NLERQLEeleAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL---ESRLEELEEQLETLRSKVA 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 381 DMEGK----DEELIKMEEQCRDLNKRLERET---------LQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCN 447
Cdd:TIGR02168 390 QLELQiaslNNEIERLEARLERLEDRRERLQqeieellkkLEEAELKELQAELEELEEELEELQEELERLEEALEELREE 469
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 448 LEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMF--------VDE--RKTMSEKL------ 511
Cdd:TIGR02168 470 LEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILgvlselisVDEgyEAAIEAALggrlqa 549
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 512 ---KKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKrALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNML 588
Cdd:TIGR02168 550 vvvENLNAAKKAIAFLKQNELGRVTFLPLDSIKGTE-IQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVV 628
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 589 KNRLQSLE-AIEKDFLKNKLNQD--------------SGKSTTALHQENN-------------KIKELSQEVERLKLKLK 640
Cdd:TIGR02168 629 DDLDNALElAKKLRPGYRIVTLDgdlvrpggvitggsAKTNSSILERRREieeleekieeleeKIAELEKALAELRKELE 708
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 641 DMKAIEDDLMKTEDEYET----LERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLS 716
Cdd:TIGR02168 709 ELEEELEQLRKELEELSRqisaLRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELE 788
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847 717 REVDALKEkihEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02168 789 AQIEQLKE---ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLED 842
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
224-775 |
6.33e-17 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 86.53 E-value: 6.33e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 224 RVTTLKEELTKLKSFALmvVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFH 303
Cdd:COG1196 214 RYRELKEELKELEAELL--LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEY 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 304 QDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDME 383
Cdd:COG1196 292 ELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAE 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 384 GKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEK--------LSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTT 455
Cdd:COG1196 372 AELAEAEEELEELAEELLEALRAAAELAAQLEELEEaeeallerLERLEEELEELEEALAELEEEEEEEEEALEEAAEEE 451
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 456 KQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSE-----KLKKTEDKLQAASSQLQVEQn 530
Cdd:COG1196 452 AELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGflegvKAALLLAGLRGLAGAVAVLI- 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 531 KVTTVTEKLIEETKRALkSKTDVEEKMYSVTKERDDLK------------NKLKAEEEKGNDLLSRVNMLKNRLQSLEAI 598
Cdd:COG1196 531 GVEAAYEAALEAALAAA-LQNIVVEDDEVAAAAIEYLKaakagratflplDKIRARAALAAALARGAIGAAVDLVASDLR 609
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 599 EKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELE 678
Cdd:COG1196 610 EADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLA 689
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 679 HVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEKIHEYMATEDLICHLQG--------DHSVLQKKL 750
Cdd:COG1196 690 EEELELEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEAleelpeppDLEELEREL 769
|
570 580
....*....|....*....|....*....
gi 109659847 751 NQQENRNRDLG----REIENLTKELERYR 775
Cdd:COG1196 770 ERLEREIEALGpvnlLAIEEYEELEERYD 798
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
142-731 |
1.02e-16 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 85.50 E-value: 1.02e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKR------KHKEYMEKSDEFICLLEQECERLKKLIDQ------ 209
Cdd:PRK03918 190 NIEELIKEKEKELEEVLREINEISSELPELREELEKLEKevkeleELKEEIEELEKELESLEGSKRKLEEKIREleerie 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 210 EIKSQEEKEQEKEKRVTTLK---EELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKlalaEARVQEEEQ 286
Cdd:PRK03918 270 ELKKEIEELEEKVKELKELKekaEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK----EERLEELKK 345
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 287 KATRLEKELQtQTTKFHQDQDTIMAKLTNEdsqnRQLQQKLAALsrqidELEETNRSLRKAEEELQDIKEKISKgeygna 366
Cdd:PRK03918 346 KLKELEKRLE-ELEERHELYEEAKAKKEEL----ERLKKRLTGL-----TPEKLEKELEELEKAKEEIEEEISK------ 409
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 367 gIMAEVEELRKRVLDMEGKDEELIKMEEQC----RDLNKRLERETLqsKDFKLEVEKLSKRimaLEKLEDAFNKSKQECY 442
Cdd:PRK03918 410 -ITARIGELKKEIKELKKAIEELKKAKGKCpvcgRELTEEHRKELL--EEYTAELKRIEKE---LKEIEEKERKLRKELR 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 443 SLKCNLEKERmttkqlsqELESLKVRIKELEAIESRLEKTeftlkeDLTKLKtltvmfvdERKTMSEKLKKTEDKLQAAS 522
Cdd:PRK03918 484 ELEKVLKKES--------ELIKLKELAEQLKELEEKLKKY------NLEELE--------KKAEEYEKLKEKLIKLKGEI 541
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 523 SQLqveqnkvttvtEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLkaeEEKGndlLSRVNMLKNRLQSLEAIEKDF 602
Cdd:PRK03918 542 KSL-----------KKELEKLEELKKKLAELEKKLDELEEELAELLKEL---EELG---FESVEELEERLKELEPFYNEY 604
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 603 LKNK--------LNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAI--EDDLMKTEDEYETLERRYANERDKAQF 672
Cdd:PRK03918 605 LELKdaekelerEEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKysEEEYEELREEYLELSRELAGLRAELEE 684
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 673 LSKELEHVKMELAKYKlAEKTEtsheqwlFKRLQEEEAKSGHLSREVDALKEKIHEYMA 731
Cdd:PRK03918 685 LEKRREEIKKTLEKLK-EELEE-------REKAKKELEKLEKALERVEELREKVKKYKA 735
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
321-772 |
4.14e-15 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 80.49 E-value: 4.14e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 321 RQLQQKLAALSRQIDELEET-------NRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELR---KRVLDMEGKDEELI 390
Cdd:PRK03918 217 PELREELEKLEKEVKELEELkeeieelEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEekvKELKELKEKAEEYI 296
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 391 KMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLE------KERMTTKQLSQELES 464
Cdd:PRK03918 297 KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEeleerhELYEEAKAKKEELER 376
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 465 LKVRIK--ELEAIESRLEKTEFTLKEDLTKLKTLTVMF------VDERKTMSEKLKKTEDKLQAASSQLQVEQNKvtTVT 536
Cdd:PRK03918 377 LKKRLTglTPEKLEKELEELEKAKEEIEEEISKITARIgelkkeIKELKKAIEELKKAKGKCPVCGRELTEEHRK--ELL 454
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 537 EKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEE--KGNDLLSRVNMLKNRLQS--LEAIEKDF-----LKNKL 607
Cdd:PRK03918 455 EEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESEliKLKELAEQLKELEEKLKKynLEELEKKAeeyekLKEKL 534
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 608 NQDSGKSTTaLHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDE-----YETLERR------YANERDKAQFLSKE 676
Cdd:PRK03918 535 IKLKGEIKS-LKKELEKLEELKKKLAELEKKLDELEEELAELLKELEElgfesVEELEERlkelepFYNEYLELKDAEKE 613
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 677 LEHVKMELAkyKLAEKTETSheqwlFKRLQEEEAKSGHLSREVDALKEKI--HEYMATEDLICHLQGDHSVLQKKLNQQE 754
Cdd:PRK03918 614 LEREEKELK--KLEEELDKA-----FEELAETEKRLEELRKELEELEKKYseEEYEELREEYLELSRELAGLRAELEELE 686
|
490
....*....|....*...
gi 109659847 755 NRNRDLGREIENLTKELE 772
Cdd:PRK03918 687 KRREEIKKTLEKLKEELE 704
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
148-773 |
1.14e-14 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 79.34 E-value: 1.14e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 148 EKHKESYRRILGQLLVAEKSRRQTILELEEEKRKhkeyMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTT 227
Cdd:TIGR02169 219 EKREYEGYELLKEKEALERQKEAIERQLASLEEE----LEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKE 294
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 228 LKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLalaEARVQEEEQKATRLEKELQTQTTKFhqdqD 307
Cdd:TIGR02169 295 KIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEEL---EREIEEERKRRDKLTEEYAELKEEL----E 367
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 308 TIMAKLTNEDSQNR-------QLQQKLAALSRQIDELEETNRSL----RKAEEELQDIKEKISKGEYGNAGIMAEVEELR 376
Cdd:TIGR02169 368 DLRAELEEVDKEFAetrdelkDYREKLEKLKREINELKRELDRLqeelQRLSEELADLNAAIAGIEAKINELEEEKEDKA 447
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 377 KRVLDMEGKDEELIKMEEqcrDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTK 456
Cdd:TIGR02169 448 LEIKKQEWKLEQLAADLS---KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVH 524
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 457 QLSQELesLKVRIKELEAIES----RLE----KTEFTLKEDLTKLKT-----LTVMFVDERKTMSEKLKKT--------- 514
Cdd:TIGR02169 525 GTVAQL--GSVGERYATAIEVaagnRLNnvvvEDDAVAKEAIELLKRrkagrATFLPLNKMRDERRDLSILsedgvigfa 602
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 515 ------EDKLQAASSQ-----LQVE----------QNKVTTVTEKLIEET----------KRALKSKTDVEEKMYSVTKE 563
Cdd:TIGR02169 603 vdlvefDPKYEPAFKYvfgdtLVVEdieaarrlmgKYRMVTLEGELFEKSgamtggsrapRGGILFSRSEPAELQRLRER 682
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 564 RDDLKNKLKAEEEKGNDLLSRVNMLKNRL----QSLEAIEKDflKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKL 639
Cdd:TIGR02169 683 LEGLKRELSSLQSELRRIENRLDELSQELsdasRKIGEIEKE--IEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSEL 760
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 640 KDMKA----IEDDLMKTEDEYETLERRYANERdkAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHL 715
Cdd:TIGR02169 761 KELEArieeLEEDLHKLEEALNDLEARLSHSR--IPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQEL 838
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847 716 SREVDALKEKIHEYMATEDLichLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02169 839 QEQRIDLKEQIKSIEKEIEN---LNGKKEELEEELEELEAALRDLESRLGDLKKERDE 893
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
306-772 |
2.89e-14 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 77.80 E-value: 2.89e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 306 QDTIMAKLTNEDSQNRQLQQKLaalsrQIDELEETNRSLRKAEEELQDIKEKISKgeygnagIMAEVEELRKRVLDMEgk 385
Cdd:PRK03918 134 QGEIDAILESDESREKVVRQIL-----GLDDYENAYKNLGEVIKEIKRRIERLEK-------FIKRTENIEELIKEKE-- 199
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 386 dEELIKMEEQCRDLNKRLEretlqskDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESL 465
Cdd:PRK03918 200 -KELEEVLREINEISSELP-------ELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEEL 271
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 466 KVRIKELEAIESRLEKTEftlkedltklktltvmfvdERKTMSEKLKKTEDKLQAASSQLQVEQNKVttvtEKLIEETKR 545
Cdd:PRK03918 272 KKEIEELEEKVKELKELK-------------------EKAEEYIKLSEFYEEYLDELREIEKRLSRL----EEEINGIEE 328
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 546 ALKSKTDVEEKMYSVTKERDDLKNKLkAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKI 625
Cdd:PRK03918 329 RIKELEEKEERLEELKKKLKELEKRL-EELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEI 407
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 626 KELSQEVERLKLKLKDMKAIEDDLMK------------TEDEYETLERRY-------ANERDKAQFLSKELEHVKMELAK 686
Cdd:PRK03918 408 SKITARIGELKKEIKELKKAIEELKKakgkcpvcgrelTEEHRKELLEEYtaelkriEKELKEIEEKERKLRKELRELEK 487
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 687 yKLAEKTETSHEQWLFKRLQEEEAKSGHLSREvdALKEKIHEYMATEDLICHLQGDHSVLQ---KKLNQQENRNRDLGRE 763
Cdd:PRK03918 488 -VLKKESELIKLKELAEQLKELEEKLKKYNLE--ELEKKAEEYEKLKEKLIKLKGEIKSLKkelEKLEELKKKLAELEKK 564
|
....*....
gi 109659847 764 IENLTKELE 772
Cdd:PRK03918 565 LDELEEELA 573
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
142-598 |
3.06e-14 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 77.77 E-value: 3.06e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 142 ELDKVVEKHKESyRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLK---KLIDQEIKSQEEKE 218
Cdd:PRK02224 238 EADEVLEEHEER-REELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLaeaGLDDADAEAVEARR 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 219 QEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQ 298
Cdd:PRK02224 317 EELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEEL 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 299 TTKFhqdqDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYG----NAGIMAEVEE 374
Cdd:PRK02224 397 RERF----GDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLEAGKCPECGqpveGSPHVETIEE 472
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 375 LRKRVLDMEgkdEELIKMEEQCRDLNKRLER-----------ETLQSK---------DFKLEVEKLSKRIMAL----EKL 430
Cdd:PRK02224 473 DRERVEELE---AELEDLEEEVEEVEERLERaedlveaedriERLEERredleeliaERRETIEEKRERAEELreraAEL 549
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 431 EDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEfTLKEDLTKLKtltvmfvDERKTMSEK 510
Cdd:PRK02224 550 EAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIA-DAEDEIERLR-------EKREALAEL 621
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 511 LKKTEDKLQAAS---SQL--QVEQNKVttvtEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNK---LKAEEEKGNDLL 582
Cdd:PRK02224 622 NDERRERLAEKRerkRELeaEFDEARI----EEAREDKERAEEYLEQVEEKLDELREERDDLQAEigaVENELEELEELR 697
|
490
....*....|....*.
gi 109659847 583 SRVNMLKNRLQSLEAI 598
Cdd:PRK02224 698 ERREALENRVEALEAL 713
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
276-691 |
3.23e-14 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 77.79 E-value: 3.23e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 276 LAEARVQEEEQKATRLEKELQTQTTKFHQDQDTI-----MAKLTNEDSQNRQ-LQQKLAALSRQIDELEETNRSLRKaee 349
Cdd:TIGR02168 622 LGGVLVVDDLDNALELAKKLRPGYRIVTLDGDLVrpggvITGGSAKTNSSILeRRREIEELEEKIEELEEKIAELEK--- 698
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 350 ELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRimaLEK 429
Cdd:TIGR02168 699 ALAELRKELEELEEELEQLRKELEELSRQISALR---KDLARLEAEVEQLEERIAQLSKELTELEAEIEELEER---LEE 772
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 430 LEDAFNKSKQEcyslkcnLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSE 509
Cdd:TIGR02168 773 AEEELAEAEAE-------IEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEE 845
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 510 KLKKTEDKLQAASSqlqvEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLL------- 582
Cdd:TIGR02168 846 QIEELSEDIESLAA----EIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRreleelr 921
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 583 SRVNMLKNRLQSLEAiEKDFLKNKLNQDsGKSTTALHQEN-----NKIKELSQEVERLKLKLKDMKAIEddlMKTEDEYE 657
Cdd:TIGR02168 922 EKLAQLELRLEGLEV-RIDNLQERLSEE-YSLTLEEAEALenkieDDEEEARRRLKRLENKIKELGPVN---LAAIEEYE 996
|
410 420 430
....*....|....*....|....*....|....
gi 109659847 658 TLERRYanerdkaQFLSKELEHVkmELAKYKLAE 691
Cdd:TIGR02168 997 ELKERY-------DFLTAQKEDL--TEAKETLEE 1021
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
164-799 |
1.37e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 75.95 E-value: 1.37e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 164 AEKSRRQTILELEEEkRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVT-TLKEELTKLKSFALMV 242
Cdd:PTZ00121 1100 AEEAKKTETGKAEEA-RKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEdARKAEEARKAEDAKKA 1178
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 243 VDEQQRLTAQLTLQRQKIQEL--TTNAKETHTKLALAEARVQEEEQKATRLEKelqTQTTKFHQDQDTIMAKLTNEDSQN 320
Cdd:PTZ00121 1179 EAARKAEEVRKAEELRKAEDArkAEAARKAEEERKAEEARKAEDAKKAEAVKK---AEEAKKDAEEAKKAEEERNNEEIR 1255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 321 RQLQQKLAALSRQIDELEETNRslRKAEEeLQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEE---QCR 397
Cdd:PTZ00121 1256 KFEEARMAHFARRQAAIKAEEA--RKADE-LKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEakkKAD 1332
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 398 DLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEK---ERMTTKQLSQELESLKVRIKELEA 474
Cdd:PTZ00121 1333 AAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKkaeEKKKADEAKKKAEEDKKKADELKK 1412
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 475 IESRLEKTEFTLK--EDLTKLKTLTVMfVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTD 552
Cdd:PTZ00121 1413 AAAAKKKADEAKKkaEEKKKADEAKKK-AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKK 1491
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 553 VEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALH--QENNKIKELSQ 630
Cdd:PTZ00121 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKkaEEKKKAEEAKK 1571
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 631 EVERLKLKLK---DMKAIE----DDLMKTEDEYETLE----RRYANERDKAQFLSKElEHVKMELAKYKLAEKTETSHEQ 699
Cdd:PTZ00121 1572 AEEDKNMALRkaeEAKKAEeariEEVMKLYEEEKKMKaeeaKKAEEAKIKAEELKKA-EEEKKKVEQLKKKEAEEKKKAE 1650
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 700 WLFKRLQEEEAKSGHLSREVDALKEKIHEYMATEDlichlqgdhsvLQKKLNQQENRNRDLGREIENLTKELERYRHFSK 779
Cdd:PTZ00121 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE-----------DEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAE 1719
|
650 660
....*....|....*....|
gi 109659847 780 SLRPSLNGRRISDPQVFSKE 799
Cdd:PTZ00121 1720 ELKKAEEENKIKAEEAKKEA 1739
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
113-775 |
2.20e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 75.09 E-value: 2.20e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 113 KKVLEALQRDAFQAKStpwQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAeksrRQTILELEEEKRKHKEYMEKSDEF 192
Cdd:TIGR02168 245 QEELKEAEEELEELTA---ELQELEEKLEELRLEVSELEEEIEELQKELYAL----ANEISRLEQQKQILRERLANLERQ 317
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 193 ICLLEQECERL--KKLIDQEI-KSQEEKEQEKEKRVTTLKEELTKLKsfalmvvDEQQRLTAQLTLQRQKIQELTTNAKE 269
Cdd:TIGR02168 318 LEELEAQLEELesKLDELAEElAELEEKLEELKEELESLEAELEELE-------AELEELESRLEELEEQLETLRSKVAQ 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 270 THTKLALAEARVQEEEQKATRLEKELQtqttKFHQDQDTIMAKLtnEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEE 349
Cdd:TIGR02168 391 LELQIASLNNEIERLEARLERLEDRRE----RLQQEIEELLKKL--EEAELKELQAELEELEEELEELQEELERLEEALE 464
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 350 ELQDIKEKISkgeygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEK 429
Cdd:TIGR02168 465 ELREELEEAE----------QALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISVDEG 534
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 430 LEDA----------------FNKSKQECYSLKCNLEKERM--------------TTKQLSQELESLKVRIKELEAIESRL 479
Cdd:TIGR02168 535 YEAAieaalggrlqavvvenLNAAKKAIAFLKQNELGRVTflpldsikgteiqgNDREILKNIEGFLGVAKDLVKFDPKL 614
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 480 EK-----------------------------TEFTLKEDL-------TKLKTLTVMFVDERKTMSEKLKKTEDKLQAASS 523
Cdd:TIGR02168 615 RKalsyllggvlvvddldnalelakklrpgyRIVTLDGDLvrpggviTGGSAKTNSSILERRREIEELEEKIEELEEKIA 694
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 524 QLQVEQNKVTTVTEKLIEETKRALKSKTDVEekmysvtKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEkdfl 603
Cdd:TIGR02168 695 ELEKALAELRKELEELEEELEQLRKELEELS-------RQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEI---- 763
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 604 kNKLNQDSGKSTTALHQENNKIKELSQEVERLKlklKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKME 683
Cdd:TIGR02168 764 -EELEERLEEAEEELAEAEAEIEELEAQIEQLK---EELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERR 839
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 684 LAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEkihEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGRE 763
Cdd:TIGR02168 840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLN---ERASLEEALALLRSELEELSEELRELESKRSELRRE 916
|
730
....*....|..
gi 109659847 764 IENLTKELERYR 775
Cdd:TIGR02168 917 LEELREKLAQLE 928
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
172-525 |
2.73e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 74.72 E-value: 2.73e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 172 ILELEEEKRKHKEYMEKsdeficlLEQECERLKKLIDQEIKSQE--EKEQEKEKRVTTLKEELTKLKSFALM-----VVD 244
Cdd:TIGR02169 165 VAEFDRKKEKALEELEE-------VEENIERLDLIIDEKRQQLErlRREREKAERYQALLKEKREYEGYELLkekeaLER 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 245 EQQRLTAQLTLQRQKIQELTtnakethtklalaeARVQEEEQKATRLEKELQTQTTKfhqdqdtIMAKLTNEdsqNRQLQ 324
Cdd:TIGR02169 238 QKEAIERQLASLEEELEKLT--------------EEISELEKRLEEIEQLLEELNKK-------IKDLGEEE---QLRVK 293
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 325 QKLAALSRQI----DELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLN 400
Cdd:TIGR02169 294 EKIGELEAEIasleRSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT---EEYAELKEELEDLR 370
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 401 KRLEREtlqSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELeaiESRLE 480
Cdd:TIGR02169 371 AELEEV---DKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINEL---EEEKE 444
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 109659847 481 KTEFTLKEDLTKLKTLTVMFVDERKTM---SEKLKKTEDKLQAASSQL 525
Cdd:TIGR02169 445 DKALEIKKQEWKLEQLAADLSKYEQELydlKEEYDRVEKELSKLQREL 492
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
113-688 |
4.62e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 74.02 E-value: 4.62e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 113 KKVLEAlqRDAFQAKSTPWQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEf 192
Cdd:PTZ00121 1237 KDAEEA--KKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEE- 1313
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 193 icllEQECERLKKLIDQEIKSQEEKEQekekrvttlKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTnakETHT 272
Cdd:PTZ00121 1314 ----AKKADEAKKKAEEAKKKADAAKK---------KAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK---EEAK 1377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 273 KLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQ-------QKLAALSRQIDELEETNRSLR 345
Cdd:PTZ00121 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEkkkadeaKKKAEEAKKADEAKKKAEEAK 1457
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 346 KAEEELQDIKEKISKGEYGNAGIMA-EVEELRKRVLDMEGKDEELIKMEEQCR--DLNKRLERETLQSKDFKLEVEKLSK 422
Cdd:PTZ00121 1458 KAEEAKKKAEEAKKADEAKKKAEEAkKADEAKKKAEEAKKKADEAKKAAEAKKkaDEAKKAEEAKKADEAKKAEEAKKAD 1537
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 423 RIMALEKLEDAFNKSKQEcySLKCNLEKERMTTKQLSQELESLKVR-IKELEAIESRLEKTEFTLKEDLTKLKTLTVMFV 501
Cdd:PTZ00121 1538 EAKKAEEKKKADELKKAE--ELKKAEEKKKAEEAKKAEEDKNMALRkAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA 1615
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 502 DERKTMSEKLKKTEDklqaassqlqvEQNKVTTVTEKLIEETKRALKSKTDVEE-KMYSVTKERDDLKNKLKAEEEKGND 580
Cdd:PTZ00121 1616 EEAKIKAEELKKAEE-----------EKKKVEQLKKKEAEEKKKAEELKKAEEEnKIKAAEEAKKAEEDKKKAEEAKKAE 1684
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 581 LLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALH-QENNKIKelsqeVERLKLKLKDMKAIEDDLMKTEDEYETL 659
Cdd:PTZ00121 1685 EDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKaEEENKIK-----AEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759
|
570 580
....*....|....*....|....*....
gi 109659847 660 ERRYANERDKAQFLSKELEHVKMELAKYK 688
Cdd:PTZ00121 1760 AHLKKEEEKKAEEIRKEKEAVIEEELDEE 1788
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
283-687 |
5.38e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 73.56 E-value: 5.38e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 283 EEEQKATRLEKELQtqttKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDEL----EETNRSLRKAEEELQDIKEKI 358
Cdd:TIGR02169 671 SEPAELQRLRERLE----GLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIekeiEQLEQEEEKLKERLEELEEDL 746
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 359 SKGEYGNAGIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKDFKL-EVEKLSKRIMA-LEKLEDAFNK 436
Cdd:TIGR02169 747 SSLEQEIENVKSELKELEARIEELE---EDLHKLEEALNDLEARLSHSRIPEIQAELsKLEEEVSRIEArLREIEQKLNR 823
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 437 SKQEcyslKCNLEKERMTTKQLSQELESLKVRI-KELEAIESRLEKTEFTLKEDLTKLKTLtvmfvDERKtmsEKLKKTE 515
Cdd:TIGR02169 824 LTLE----KEYLEKEIQELQEQRIDLKEQIKSIeKEIENLNGKKEELEEELEELEAALRDL-----ESRL---GDLKKER 891
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 516 DKLQAASSQLQVEQNKvttvteklieetkralksktdveekmysvtkerddlknkLKAEEEKGNDLLSRvnmLKNRLQSL 595
Cdd:TIGR02169 892 DELEAQLRELERKIEE---------------------------------------LEAQIEKKRKRLSE---LKAKLEAL 929
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 596 EAIEKDFLKNKlnqDSGKSTTALHQENNKIKELSQEVERlklklkDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSK 675
Cdd:TIGR02169 930 EEELSEIEDPK---GEDEEIPEEELSLEDVQAELQRVEE------EIRALEPVNMLAIQEYEEVLKRLDELKEKRAKLEE 1000
|
410
....*....|..
gi 109659847 676 ELEHVKMELAKY 687
Cdd:TIGR02169 1001 ERKAILERIEEY 1012
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
164-520 |
6.34e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.55 E-value: 6.34e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 164 AEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKL---IDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFAL 240
Cdd:TIGR02168 664 GSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKEleeLEEELEQLRKELEELSRQISALRKDLARLEAEVE 743
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 241 MVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFhqdqDTIMAKLTNEDSQN 320
Cdd:TIGR02168 744 QLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL----DELRAELTLLNEEA 819
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 321 RQLQQKLAALSRQID----ELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV-----------LDMEGK 385
Cdd:TIGR02168 820 ANLRERLESLERRIAaterRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERasleealallrSELEEL 899
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 386 DEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMAL-EKLEDAFNKSKQECYSLKCNLEKERMttkQLSQELES 464
Cdd:TIGR02168 900 SEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLqERLSEEYSLTLEEAEALENKIEDDEE---EARRRLKR 976
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847 465 LKVRIKEL--------EAIESRLEKTEF--TLKEDLTK-LKTLTVMFVDERKTMSEKLKKTEDKLQA 520
Cdd:TIGR02168 977 LENKIKELgpvnlaaiEEYEELKERYDFltAQKEDLTEaKETLEEAIEEIDREARERFKDTFDQVNE 1043
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
142-728 |
9.15e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 73.25 E-value: 9.15e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYM-----EKSDEFICLLEQECERLKKLIDQEIKSQEE 216
Cdd:PTZ00121 1198 DARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAkkaeeERNNEEIRKFEEARMAHFARRQAAIKAEEA 1277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 217 KEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAqltlQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQ 296
Cdd:PTZ00121 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAE----EAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAE 1353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 297 TQTTKFHQDQDTIMA--KLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYG--NAGIMAEV 372
Cdd:PTZ00121 1354 AAADEAEAAEEKAEAaeKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAkkKAEEKKKA 1433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 373 EELRKRVLDMEGKDEELIKMEEqcrdlnKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAfnKSKQECYSLKCNLEKER 452
Cdd:PTZ00121 1434 DEAKKKAEEAKKADEAKKKAEE------AKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA--KKKAEEAKKKADEAKKA 1505
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 453 MTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKV 532
Cdd:PTZ00121 1506 AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEE 1585
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 533 TtvteKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSG 612
Cdd:PTZ00121 1586 A----KKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 613 KSTtalhQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDE---YETLERRYANERDKAQFLSKELE--HVKMELAKY 687
Cdd:PTZ00121 1662 KAA----EEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEakkAEELKKKEAEEKKKAEELKKAEEenKIKAEEAKK 1737
|
570 580 590 600
....*....|....*....|....*....|....*....|.
gi 109659847 688 KLAEKTETSHEqwlFKRLQEEEAKSGHLSREVDALKEKIHE 728
Cdd:PTZ00121 1738 EAEEDKKKAEE---AKKDEEEKKKIAHLKKEEEKKAEEIRK 1775
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
233-577 |
3.22e-12 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 71.25 E-value: 3.22e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 233 TKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQ---DQDTI 309
Cdd:TIGR02169 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEEleeDLSSL 749
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 310 MAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSL--RKAEEELQDIKEKISKgeygnagIMAEVEELRKRVLDMEGKDE 387
Cdd:TIGR02169 750 EQEIENVKSELKELEARIEELEEDLHKLEEALNDLeaRLSHSRIPEIQAELSK-------LEEEVSRIEARLREIEQKLN 822
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 388 ELIKMEEQCRDLNKRLERE----TLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERmttKQLSQELE 463
Cdd:TIGR02169 823 RLTLEKEYLEKEIQELQEQridlKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKER---DELEAQLR 899
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 464 SLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKT--EDKLQAASSQLQVEQNKVTTVTEKLIE 541
Cdd:TIGR02169 900 ELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEElsLEDVQAELQRVEEEIRALEPVNMLAIQ 979
|
330 340 350
....*....|....*....|....*....|....*.
gi 109659847 542 ETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEK 577
Cdd:TIGR02169 980 EYEEVLKRLDELKEKRAKLEEERKAILERIEEYEKK 1015
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
120-810 |
6.01e-12 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 70.15 E-value: 6.01e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 120 QRDAFQAKSTPWQEDIYEKPMNELDKVVEKHKesyRRILGQLLVAEKSRRQ-----TILELEEEKRKHKE--YMEKSDEF 192
Cdd:pfam15921 246 QLEALKSESQNKIELLLQQHQDRIEQLISEHE---VEITGLTEKASSARSQansiqSQLEIIQEQARNQNsmYMRQLSDL 322
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 193 ---ICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTA-------QLTLQRQKIQE 262
Cdd:pfam15921 323 estVSQLRSELREAKRMYEDKIEELEKQLVLANSELTEARTERDQFSQESGNLDDQLQKLLAdlhkrekELSLEKEQNKR 402
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 263 LTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNED-------SQNRQLQQKLAALSRQID 335
Cdd:pfam15921 403 LWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNeslekvsSLTAQLESTKEMLRKVVE 482
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 336 ELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVlDMEGKDEELIKME-EQCRDLNKRLERETLQSKDFK 414
Cdd:pfam15921 483 ELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRV-DLKLQELQHLKNEgDHLRNVQTECEALKLQMAEKD 561
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 415 LEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLK----VRIKELEAIESRLEKTEFTL---- 486
Cdd:pfam15921 562 KVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKdkkdAKIRELEARVSDLELEKVKLvnag 641
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 487 KEDLTKLKTLTvmfvDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSktdveeKMYSVTKERDD 566
Cdd:pfam15921 642 SERLRAVKDIK----QERDQLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEM-ETTTNKLKM------QLKSAQSELEQ 710
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 567 LKNKLKAEEEKGNDLLS--------------RVNMLKNRLQSLEAI------EKDFLKNKLNQDSGKSTTALHQENNKIK 626
Cdd:pfam15921 711 TRNTLKSMEGSDGHAMKvamgmqkqitakrgQIDALQSKIQFLEEAmtnankEKHFLKEEKNKLSQELSTVATEKNKMAG 790
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 627 EL----SQEvERLKLKLKDMKAIEDDLMKTEDEYETLERRYANE--RDKAQFL--SKELE---HVKMELAKYKLAEKTET 695
Cdd:pfam15921 791 ELevlrSQE-RRLKEKVANMEVALDKASLQFAECQDIIQRQEQEsvRLKLQHTldVKELQgpgYTSNSSMKPRLLQPASF 869
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 696 SHEQWLFKRLQEEEAKSGHLSREVDALKEKiheymATEDLICHLQGDHSVLQKKLNQQENRNRDLGRE---------IEN 766
Cdd:pfam15921 870 TRTHSNVPSSQSTASFLSHHSRKTNALKED-----PTRDLKQLLQELRSVINEEPTVQLSKAEDKGRApslgalddrVRD 944
|
730 740 750 760
....*....|....*....|....*....|....*....|....*
gi 109659847 767 LTKELERYRHFSKSLRPSLNGRRISDPQVFSKE-VQTEAVDNEPP 810
Cdd:pfam15921 945 CIIESSLRSDICHSSSNSLQTEGSKSSETCSREpVLLHAGELEDP 989
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
158-667 |
1.62e-11 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 68.91 E-value: 1.62e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 158 LGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLiDQEIKSQEEKEQEKEKRVTTLKEELTKLKS 237
Cdd:PRK02224 208 LNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETL-EAEIEDLRETIAETEREREELAEEVRDLRE 286
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 238 FALMVVDEQQRLTAQLTLQR-------QKIQELTTNAKETHTKLALAEARVQEEEQKATRLE---KELQTQTTKFHQDQD 307
Cdd:PRK02224 287 RLEELEEERDDLLAEAGLDDadaeaveARREELEDRDEELRDRLEECRVAAQAHNEEAESLRedaDDLEERAEELREEAA 366
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 308 TIMAKLTNEDSQNRQLQQKLAALSRQIDELEE----TNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV---- 379
Cdd:PRK02224 367 ELESELEEAREAVEDRREEIEELEEEIEELRErfgdAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVeeae 446
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 380 -LDMEGK---------DEELIKMEEQCRDLNKRLERETLqskDFKLEVEKLSKRIMALEKLedafnkskqecyslkcnle 449
Cdd:PRK02224 447 aLLEAGKcpecgqpveGSPHVETIEEDRERVEELEAELE---DLEEEVEEVEERLERAEDL------------------- 504
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 450 kermttKQLSQELESLKVRikeLEAIESRLEKTEFTLKEDLTKLKTLtvmfvDERKT-MSEKLKKTEDKLQAASSQLQVE 528
Cdd:PRK02224 505 ------VEAEDRIERLEER---REDLEELIAERRETIEEKRERAEEL-----RERAAeLEAEAEEKREAAAEAEEEAEEA 570
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 529 QNKVTTVTEKLiEETKRALKSKTDVEEKMYSVTKERDD---LKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKN 605
Cdd:PRK02224 571 REEVAELNSKL-AELKERIESLERIRTLLAAIADAEDEierLREKREALAELNDERRERLAEKRERKRELEAEFDEARIE 649
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 606 KLNQDSGKSTTALHQENNKIKELS-----------------QEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANER 667
Cdd:PRK02224 650 EAREDKERAEEYLEQVEEKLDELReerddlqaeigavenelEELEELRERREALENRVEALEALYDEAEELESMYGDLR 728
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
141-678 |
2.39e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 68.42 E-value: 2.39e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 141 NELDKVVEKHKESYRRILGQLLVAEKSRR----------QTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQE 210
Cdd:COG1196 277 EELELELEEAQAEEYELLAELARLEQDIArleerrreleERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEA 356
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 211 IKSQEEKEQEKEKRVTTLKEELTKLKSFAlmvvDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATR 290
Cdd:COG1196 357 EAELAEAEEALLEAEAELAEAEEELEELA----EELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAE 432
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 291 LEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMA 370
Cdd:COG1196 433 LEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKA 512
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 371 EVEELRKR----VLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKR------IMALEKLEDAFNKSKQE 440
Cdd:COG1196 513 ALLLAGLRglagAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKAAkagratFLPLDKIRARAALAAAL 592
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 441 cysLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQA 520
Cdd:COG1196 593 ---ARGAIGAAVDLVASDLREADARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRE 669
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 521 ASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKmysvTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEK 600
Cdd:COG1196 670 LLAALLEAEAELEELAERLAEEELELEEALLAEEEE----ERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEE 745
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847 601 DFLKNKLNQDsgksttalHQENNKIKELSQEVERLKLKLKDMKAIEddlMKTEDEYETLERRYanerdkaQFLSKELE 678
Cdd:COG1196 746 ELLEEEALEE--------LPEPPDLEELERELERLEREIEALGPVN---LLAIEEYEELEERY-------DFLSEQRE 805
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
334-708 |
2.97e-11 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 68.17 E-value: 2.97e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 334 IDEL---EETNRSLRKAEEELQDIKEKISKgeygnagIMAEVEELRKRVLDMEGKDEELIKMeeqcRDLNKRLEretlqs 410
Cdd:TIGR02169 159 IDEIagvAEFDRKKEKALEELEEVEENIER-------LDLIIDEKRQQLERLRREREKAERY----QALLKEKR------ 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 411 kdfKLEVEKLSKRIMALEKledafnkskqecyslkcnlEKERmTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDL 490
Cdd:TIGR02169 222 ---EYEGYELLKEKEALER-------------------QKEA-IERQLASLEEELEKLTEEISELEKRLEEIEQLLEELN 278
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 491 TKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNK 570
Cdd:TIGR02169 279 KKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEE 358
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 571 LKAEEEKGNDLLSRVNMLKNRLQSL------EAIEKDFLKNKLN----------QDSGKSTTALHQENNKIKELSQEV-- 632
Cdd:TIGR02169 359 YAELKEELEDLRAELEEVDKEFAETrdelkdYREKLEKLKREINelkreldrlqEELQRLSEELADLNAAIAGIEAKIne 438
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847 633 --ERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKyKLAEKTETSHEQWLFKRLQEE 708
Cdd:TIGR02169 439 leEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAE-AEAQARASEERVRGGRAVEEV 515
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
321-678 |
3.94e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 67.77 E-value: 3.94e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 321 RQLQQKLAALSRQIDELEETNRSLRKAE------EELQDIKEKISKGEYGNAGimAEVEELRKRvldMEGKDEELIKMEE 394
Cdd:TIGR02168 179 RKLERTRENLDRLEDILNELERQLKSLErqaekaERYKELKAELRELELALLV--LRLEELREE---LEELQEELKEAEE 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 395 QCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKledafnkskqECYSLKCNLEKERMTTKQLSQELESLKVRIKELEA 474
Cdd:TIGR02168 254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQK----------ELYALANEISRLEQQKQILRERLANLERQLEELEA 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 475 IESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEetkralksktdVE 554
Cdd:TIGR02168 324 QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQ-----------LE 392
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 555 EKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVER 634
Cdd:TIGR02168 393 LQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 109659847 635 LKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELE 678
Cdd:TIGR02168 473 AEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQS 516
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
113-672 |
4.85e-11 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 67.47 E-value: 4.85e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 113 KKVLEALQRDAFQAKStpwQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEf 192
Cdd:PTZ00121 1314 AKKADEAKKKAEEAKK---KADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE- 1389
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 193 icllEQECERLKKLIDQEIKsqeekeqekekrvttlKEELTKLKSFALMVVDEQQRLTAqltlQRQKIQELTTNAKEthT 272
Cdd:PTZ00121 1390 ----KKKADEAKKKAEEDKK----------------KADELKKAAAAKKKADEAKKKAE----EKKKADEAKKKAEE--A 1443
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 273 KLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEE--- 349
Cdd:PTZ00121 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEakk 1523
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 350 --ELQDIKEKISKGEYGNAGIMAEVEELRKRvldmegkdEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLS--KRIM 425
Cdd:PTZ00121 1524 adEAKKAEEAKKADEAKKAEEKKKADELKKA--------EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAeeARIE 1595
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 426 ALEKLEDAFNKSKQEcySLKcNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLK-EDLTKLKTLTVMF-VDE 503
Cdd:PTZ00121 1596 EVMKLYEEEKKMKAE--EAK-KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKaEEENKIKAAEEAKkAEE 1672
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 504 RKTMSEKLKKTEDKLQAASSQLQV---EQNKVTTVTEKLIEETKRALKSKTDVEE---KMYSVTKERDDLKNK---LKAE 574
Cdd:PTZ00121 1673 DKKKAEEAKKAEEDEKKAAEALKKeaeEAKKAEELKKKEAEEKKKAEELKKAEEEnkiKAEEAKKEAEEDKKKaeeAKKD 1752
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 575 EEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEV----ERLKLKLKDMKAIEDDLM 650
Cdd:PTZ00121 1753 EEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIieggKEGNLVINDSKEMEDSAI 1832
|
570 580
....*....|....*....|..
gi 109659847 651 KTEDEYETLERRYANERDKAQF 672
Cdd:PTZ00121 1833 KEVADSKNMQLEEADAFEKHKF 1854
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
283-688 |
1.55e-10 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 65.17 E-value: 1.55e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 283 EEEQKATRLEKELQTQTTKFHQDQDTImakltnedsqnRQLQQKLAALSRQIDELEETNRSLRKAEE------ELQDIKE 356
Cdd:COG4717 71 KELKELEEELKEAEEKEEEYAELQEEL-----------EELEEELEELEAELEELREELEKLEKLLQllplyqELEALEA 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 357 KISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQS-KDFKLEVEKLSKRImalEKLEDAFN 435
Cdd:COG4717 140 ELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEElQDLAEELEELQQRL---AELEEELE 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 436 KSKQECYSLKCNLE--KERMTTKQLSQELESLKVRIK------ELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTM 507
Cdd:COG4717 217 EAQEELEELEEELEqlENELEAAALEERLKEARLLLLiaaallALLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLARE 296
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 508 SEKLKKTEDKLQAASSQLQVEQNKVTTVTEKL-----------------IEETKRALKSKTDVEEKMY--SVTKERDDLK 568
Cdd:COG4717 297 KASLGKEAEELQALPALEELEEEELEELLAALglppdlspeellelldrIEELQELLREAEELEEELQleELEQEIAALL 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 569 NKLKAEEEKG-----------NDLLSRVNMLKNRLQSL----EAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVE 633
Cdd:COG4717 377 AEAGVEDEEElraaleqaeeyQELKEELEELEEQLEELlgelEELLEALDEEELEEELEELEEELEELEEELEELREELA 456
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*
gi 109659847 634 RLKLKLKDMKAiEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYK 688
Cdd:COG4717 457 ELEAELEQLEE-DGELAELLQELEELKAELRELAEEWAALKLALELLEEAREEYR 510
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
278-694 |
2.75e-10 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 64.68 E-value: 2.75e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 278 EARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEK 357
Cdd:PRK02224 187 GSLDQLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLEAEIEDLRET 266
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 358 ISKGEYGNAGIMAEVEELRKRVLDMEGKD---------------------EELIKMEEQCRDlnkRLERETLQSKDFKLE 416
Cdd:PRK02224 267 IAETEREREELAEEVRDLRERLEELEEERddllaeaglddadaeavearrEELEDRDEELRD---RLEECRVAAQAHNEE 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 417 VEKLSKRIMALE----KLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIK----ELEAIESRLEKTEFTLKE 488
Cdd:PRK02224 344 AESLREDADDLEeraeELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGdapvDLGNAEDFLEELREERDE 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 489 DLTKLKTLTVmfvdERKTMSEKLKKTEDKLQAAS--------------SQLQVEQNKVTTVTEKL--IEETKRALKSKTD 552
Cdd:PRK02224 424 LREREAELEA----TLRTARERVEEAEALLEAGKcpecgqpvegsphvETIEEDRERVEELEAELedLEEEVEEVEERLE 499
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 553 VEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAiEKDFLKNKLnQDSGKSTTALHQENNK----IKEL 628
Cdd:PRK02224 500 RAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRE-RAAELEAEA-EEKREAAAEAEEEAEEareeVAEL 577
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109659847 629 SQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRyaNERDKAQflsKELEhvkmELAKYKLAEKTE 694
Cdd:PRK02224 578 NSKLAELKERIESLERIRTLLAAIADAEDEIERL--REKREAL---AELN----DERRERLAEKRE 634
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
156-709 |
3.65e-10 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 64.68 E-value: 3.65e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 156 RILGQLLVAEKSRRQTILELEEEKRKHKEYMEKS-DEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTK 234
Cdd:TIGR00606 358 RHQEHIRARDSLIQSLATRLELDGFERGPFSERQiKNFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKG 437
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 235 LKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKlalaEARVQEEEQKATRLEKELQTQTTKfhQDQDTIMAKLT 314
Cdd:TIGR00606 438 LGRTIELKKEILEKKQEELKFVIKELQQLEGSSDRILEL----DQELRKAERELSKAEKNSLTETLK--KEVKSLQNEKA 511
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 315 NEDSQNRQLQQKLAALSRQIDELEETNRSLRK---AEEELQDIKEKISKGEYGNAGIMAEVEELRKRvldMEGKDEELIK 391
Cdd:TIGR00606 512 DLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDkmdKDEQIRKIKSRHSDELTSLLGYFPNKKQLEDW---LHSKSKEINQ 588
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 392 MEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALE-KLEDAFNKSKQECY--SLKCNLEKERMTTKQLSQELESLKVR 468
Cdd:TIGR00606 589 TRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEdKLFDVCGSQDEESDleRLKEEIEKSSKQRAMLAGATAVYSQF 668
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 469 IKELEAIES-------RLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIE 541
Cdd:TIGR00606 669 ITQLTDENQsccpvcqRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPE 748
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 542 ETKRALKSKTDVEEKMYSVTKERDDLKNkLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQE 621
Cdd:TIGR00606 749 LRNKLQKVNRDIQRLKNDIEEQETLLGT-IMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQV 827
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 622 NNKIKE-------LSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAK----YKLA 690
Cdd:TIGR00606 828 NQEKQEkqheldtVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSlireIKDA 907
|
570
....*....|....*....
gi 109659847 691 EKTETSHEQWLFKRLQEEE 709
Cdd:TIGR00606 908 KEQDSPLETFLEKDQQEKE 926
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
224-777 |
6.39e-10 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 63.70 E-value: 6.39e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEeeqKATRLEKELQTQTTKFH 303
Cdd:pfam12128 242 EFTKLQQEFNTLESAELRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKE---KRDELNGELSAADAAVA 318
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 304 QDQdtimAKLTNEDSQNRQLQQ----KLAALSRQID----ELEETNRSLRKAEEELQDIKEKIskgeygNAGIMAEVEEL 375
Cdd:pfam12128 319 KDR----SELEALEDQHGAFLDadieTAAADQEQLPswqsELENLEERLKALTGKHQDVTAKY------NRRRSKIKEQN 388
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 376 rkrVLDMEGKDEELIKMEEQcRDLNKRLERETLQSKDFKLEveklskrimalEKLEDAFNKSKQECYSLKCNLEKERMTT 455
Cdd:pfam12128 389 ---NRDIAGIKDKLAKIREA-RDRQLAVAEDDLQALESELR-----------EQLEAGKLEFNEEEYRLKSRLGELKLRL 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 456 KQL---SQELESLKVRIKELEAIESRLEKTeFTLKEDLTklktltvmfvDERKTMSEKLKKTEDKLQAASSQLQVEQNKV 532
Cdd:pfam12128 454 NQAtatPELLLQLENFDERIERAREEQEAA-NAEVERLQ----------SELRQARKRRDQASEALRQASRRLEERQSAL 522
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 533 TTVTEKLIEETKRAL----KSKTDVEEKMYSVTKE----RDDLKNKLKAEEEKGNDLLSRVNMlknRLQSLEAIEKDFLK 604
Cdd:pfam12128 523 DELELQLFPQAGTLLhflrKEAPDWEQSIGKVISPellhRTDLDPEVWDGSVGGELNLYGVKL---DLKRIDVPEWAASE 599
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 605 NKLNQDSGKSTTALHQENNKIKELSQ-------EVERLKLKLKDMKAI----EDDLMKTEDEYETLERR----YANERDK 669
Cdd:pfam12128 600 EELRERLDKAEEALQSAREKQAAAEEqlvqangELEKASREETFARTAlknaRLDLRRLFDEKQSEKDKknkaLAERKDS 679
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 670 AQFLSKELEHVKMELA---KYKLAEKTETSHE---QWLFKRLQEEEAKSGHLSReVDALKEKIHEYMATEDLICHLQGDH 743
Cdd:pfam12128 680 ANERLNSLEAQLKQLDkkhQAWLEEQKEQKREartEKQAYWQVVEGALDAQLAL-LKAAIAARRSGAKAELKALETWYKR 758
|
570 580 590
....*....|....*....|....*....|....
gi 109659847 744 SVlqKKLNQQENRNRDLGREIENLTKELERYRHF 777
Cdd:pfam12128 759 DL--ASLGVDPDVIAKLKREIRTLERKIERIAVR 790
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
169-772 |
1.85e-09 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 62.30 E-value: 1.85e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 169 RQTILELEEEKRKHKEYMEKSDEFICLLEqECERLKKLIDQEIKSQEEKEQEKEKRVTTLKeeltklksfalMVVDEQQR 248
Cdd:pfam02463 166 RLKRKKKEALKKLIEETENLAELIIDLEE-LKLQELKLKEQAKKALEYYQLKEKLELEEEY-----------LLYLDYLK 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 249 LTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKfhqDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:pfam02463 234 LNEERIDLLQELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEE---ELKLLAKEEEELKSELLKLERRKV 310
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 329 ALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMA----EVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLE 404
Cdd:pfam02463 311 DDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREaeeeEEEELEKLQEKLEQLEEELLAKKKLESERLSSAA 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 405 RETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEF 484
Cdd:pfam02463 391 KLKEEELELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKS 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 485 TLKEDLTKLKTLTVMFV--------DERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEK 556
Cdd:pfam02463 471 EDLLKETQLVKLQEQLElllsrqklEERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVI 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 557 MYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEA------------IEKDFLKNKLNQDSGKSTTALHQENNK 624
Cdd:pfam02463 551 VEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLksiavleidpilNLAQLDKATLEADEDDKRAKVVEGILK 630
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 625 IKELSQEVERLKLKL----KDMKAIEDDLMKTEDEYETLE--RRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHE 698
Cdd:pfam02463 631 DTELTKLKESAKAKEsglrKGVSLEEGLAEKSEVKASLSEltKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKE 710
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 699 QWLFKRLQEEEAKSGHLSREVD-------ALKEKIHEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKEL 771
Cdd:pfam02463 711 ELKKLKLEAEELLADRVQEAQDkineelkLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEE 790
|
.
gi 109659847 772 E 772
Cdd:pfam02463 791 E 791
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
325-736 |
2.06e-09 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 61.71 E-value: 2.06e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 325 QKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEgKDEELIKMEEQCRDLNKRLE 404
Cdd:COG4717 71 KELKELEEELKEAEEKEEEYAELQEELEELEEELEELE-------AELEELREELEKLE-KLLQLLPLYQELEALEAELA 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 405 RETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNL-EKERMTTKQLSQELESLKVRIKELEAIESRLEKTE 483
Cdd:COG4717 143 ELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLsLATEEELQDLAEELEELQQRLAELEEELEEAQEEL 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 484 FTLKEDLTKLKTLTvmfvdERKTMSEKLKKTEDKLQAASSQLQVE------------------------------QNKVT 533
Cdd:COG4717 223 EELEEELEQLENEL-----EAAALEERLKEARLLLLIAAALLALLglggsllsliltiagvlflvlgllallfllLAREK 297
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 534 TVTEKLIEETkRALKSKTDVEEKMYSVTKERDDLKNKLKAEE--------EKGNDLLSRVNMLKNRLQ-SLEAIEKDFLK 604
Cdd:COG4717 298 ASLGKEAEEL-QALPALEELEEEELEELLAALGLPPDLSPEEllelldriEELQELLREAEELEEELQlEELEQEIAALL 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 605 NKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEyETLERRYANERDKAQFLSKELEHVKMEL 684
Cdd:COG4717 377 AEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALDE-EELEEELEELEEELEELEEELEELREEL 455
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|...
gi 109659847 685 AKYKLA-EKTETSHEqwLFKRLQEEEaksgHLSREVDALKEKIHEYMATEDLI 736
Cdd:COG4717 456 AELEAElEQLEEDGE--LAELLQELE----ELKAELRELAEEWAALKLALELL 502
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
414-678 |
5.31e-09 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 60.33 E-value: 5.31e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 414 KLEVEKLSKRIMALEKLEDAFNKSKQecYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKteftLKEDLTKL 493
Cdd:PRK05771 39 ELSNERLRKLRSLLTKLSEALDKLRS--YLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKE----LEEEISEL 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 494 KtltvmfvDERKTMSEKLKKTEdKLQAASSQLQVEQN-KVTTVTEKLIEETKralksktdveekmYSVTKERDDLKNKLK 572
Cdd:PRK05771 113 E-------NEIKELEQEIERLE-PWGNFDLDLSLLLGfKYVSVFVGTVPEDK-------------LEELKLESDVENVEY 171
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 573 AEEEKGNDLLSRVNmLKNRLQSLEAI--EKDFLKNKLNqDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLM 650
Cdd:PRK05771 172 ISTDKGYVYVVVVV-LKELSDEVEEElkKLGFERLELE-EEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEEL 249
|
250 260
....*....|....*....|....*...
gi 109659847 651 KTEDEYetlerrYANERDKAQFLSKELE 678
Cdd:PRK05771 250 LALYEY------LEIELERAEALSKFLK 271
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
208-781 |
5.78e-09 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 60.57 E-value: 5.78e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 208 DQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLtlqrQKIQELTTNAKETHTKLALA---------- 277
Cdd:pfam01576 4 EEEMQAKEEELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQL----QAETELCAEAEEMRARLAARkqeleeilhe 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 278 -EARVQEEEQKATrlekELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKE 356
Cdd:pfam01576 80 lESRLEEEEERSQ----QLQNEKKKMQQHIQDLEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLLEDQNSKLSKERK 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 357 KISKGEYGNAGIMAEVEELRKRVLDMEGKDEELI-------KMEEQCR----DLNKRLERETL----QSKDFKLEVEKLS 421
Cdd:pfam01576 156 LLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMIsdleerlKKEEKGRqeleKAKRKLEGESTdlqeQIAELQAQIAELR 235
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 422 KRIMALEK-LEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLkvrikELE-AIESRLEKTEFTLKEDLTKLKTLTVM 499
Cdd:pfam01576 236 AQLAKKEEeLQAALARLEEETAQKNNALKKIRELEAQISELQEDL-----ESErAARNKAEKQRRDLGEELEALKTELED 310
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 500 FVD----------ERKTMSEKLKKT-EDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLK 568
Cdd:pfam01576 311 TLDttaaqqelrsKREQEVTELKKAlEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQ 390
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 569 NKLKAEEEKGNDLLSRVNMLKNRLQSLEA--IEKDFLKNKLNQDSGKS-------TTALHQENNKIKELSQEVERLKLKL 639
Cdd:pfam01576 391 AELRTLQQAKQDSEHKRKKLEGQLQELQArlSESERQRAELAEKLSKLqselesvSSLLNEAEGKNIKLSKDVSSLESQL 470
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 640 KDMK-----------AIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELA--KYKLAEKTETSHeqwlfkrlQ 706
Cdd:pfam01576 471 QDTQellqeetrqklNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSdmKKKLEEDAGTLE--------A 542
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847 707 EEEAKSgHLSREVDALKEKIHEYMATEDlichlqgdhsvlqkKLNQQENRnrdLGREIENLTKELERYRHFSKSL 781
Cdd:pfam01576 543 LEEGKK-RLQRELEALTQQLEEKAAAYD--------------KLEKTKNR---LQQELDDLLVDLDHQRQLVSNL 599
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
170-609 |
1.24e-08 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 59.26 E-value: 1.24e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 170 QTILELEEEKRKHKEYMEKSDEficlLEQECERLKKLID---QEIKSQEEKEQEKEKRVTTLKEELTKLKsfalmvvDEQ 246
Cdd:TIGR04523 201 LLLSNLKKKIQKNKSLESQISE----LKKQNNQLKDNIEkkqQEINEKTTEISNTQTQLNQLKDEQNKIK-------KQL 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 247 QRLTAQLTLQRQKIQELTTNAKETHTKLAlaearvQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQK 326
Cdd:TIGR04523 270 SEKQKELEQNNKKIKELEKQLNQLKSEIS------DLNNQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQ 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 327 LAALSRQIDELEETNRSLRKAEEELQDIKEKISKgeyGNAGIMAEVEELRKRVLDMEGK-----------DEELIKMEEQ 395
Cdd:TIGR04523 344 ISQLKKELTNSESENSEKQRELEEKQNEIEKLKK---ENQSYKQEIKNLESQINDLESKiqnqeklnqqkDEQIKKLQQE 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 396 CRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQ-----------ECYSLKCNLEKERMTTKQLSQELES 464
Cdd:TIGR04523 421 KELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTREsletqlkvlsrSINKIKQNLEQKQKELKSKEKELKK 500
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 465 LKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQaaSSQLQVEQNKVTTVTEKLIEETK 544
Cdd:TIGR04523 501 LNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELK--KENLEKEIDEKNKEIEELKQTQK 578
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847 545 RALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEK--DFLKNKLNQ 609
Cdd:TIGR04523 579 SLKKKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKniKSKKNKLKQ 645
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
70-774 |
1.67e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 58.93 E-value: 1.67e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 70 DDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAqygfVTPKKVLEALQRDAFQAKStpwQEDIYEKPMNELDKVVEK 149
Cdd:TIGR02169 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQ----LRVKEKIGELEAEIASLER---SIAEKERELEDAEERLAK 326
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 150 HKESYRRILGQLlvaEKSRRQtILELEEEKRKHKEYMEKSDEFICLLEQECERLkkliDQEIKSQEEKEQEKEKRVTTLK 229
Cdd:TIGR02169 327 LEAEIDKLLAEI---EELERE-IEEERKRRDKLTEEYAELKEELEDLRAELEEV----DKEFAETRDELKDYREKLEKLK 398
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 230 EELTKLKSFALMVVDEQQRLTAQLtlqrqkiqelttnaKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQdqdtI 309
Cdd:TIGR02169 399 REINELKRELDRLQEELQRLSEEL--------------ADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQ----L 460
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 310 MAKLTNEDSQNRQLQQKLAALSRQideleetnrsLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEEL 389
Cdd:TIGR02169 461 AADLSKYEQELYDLKEEYDRVEKE----------LSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVHGTVAQL 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 390 IKMEEQ--------------------------CRDLNKR--LERETL----QSKDFKLEVEKLSKR---------IMALE 428
Cdd:TIGR02169 531 GSVGERyataievaagnrlnnvvveddavakeAIELLKRrkAGRATFlplnKMRDERRDLSILSEDgvigfavdlVEFDP 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 429 KLEDAFNKSKQECY------SLKCNLEKERMTT--------------------------KQLSQELESLKVRIKELEAIE 476
Cdd:TIGR02169 611 KYEPAFKYVFGDTLvvedieAARRLMGKYRMVTlegelfeksgamtggsraprggilfsRSEPAELQRLRERLEGLKREL 690
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 477 SRlekteftLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKlIEETKRALkskTDVEEK 556
Cdd:TIGR02169 691 SS-------LQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEED-LSSLEQEI---ENVKSE 759
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 557 MYSVTKERDDLKNKLKAEEEKGNDLLSRVNM--LKNRLQSLEAIEKDFLK-----NKLNQDSGKSTTALHQENNKIKELS 629
Cdd:TIGR02169 760 LKELEARIEELEEDLHKLEEALNDLEARLSHsrIPEIQAELSKLEEEVSRiearlREIEQKLNRLTLEKEYLEKEIQELQ 839
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 630 QEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANE----RDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRL 705
Cdd:TIGR02169 840 EQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAlrdlESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRL 919
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847 706 QEEEAKSGHLSREVDALKEKIHE--YMATEDLichlqgDHSVLQKKLNQQENRNRDLG----REIENLTKELERY 774
Cdd:TIGR02169 920 SELKAKLEALEEELSEIEDPKGEdeEIPEEEL------SLEDVQAELQRVEEEIRALEpvnmLAIQEYEEVLKRL 988
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
227-577 |
2.67e-08 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 58.44 E-value: 2.67e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 227 TLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE---KELQTQTTKFH 303
Cdd:pfam02463 663 EVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQeaqDKINEELKLLK 742
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 304 QDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDEL-EETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDM 382
Cdd:pfam02463 743 QKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELaEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQL 822
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 383 EGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQEL 462
Cdd:pfam02463 823 LIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKEL 902
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 463 ESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEE 542
Cdd:pfam02463 903 EEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPEELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAIEE 982
|
330 340 350
....*....|....*....|....*....|....*
gi 109659847 543 TKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEK 577
Cdd:pfam02463 983 FEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQ 1017
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
174-473 |
3.52e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 58.16 E-value: 3.52e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 174 ELEEEKRKHKEYMEKSDEFICLLEQECERLKklidQEIKSQEEKEQEKEKRVTTLKEELTKLK-----SFALMVVDEQQR 248
Cdd:TIGR02169 727 QLEQEEEKLKERLEELEEDLSSLEQEIENVK----SELKELEARIEELEEDLHKLEEALNDLEarlshSRIPEIQAELSK 802
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 249 LTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE---KELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQ 325
Cdd:TIGR02169 803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKeqiKSIEKEIENLNGKKEELEEELEELEAALRDLES 882
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 326 KLAALSRQIDELEETNRSLRKAEEELQdikekiskgeygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLE- 404
Cdd:TIGR02169 883 RLGDLKKERDELEAQLRELERKIEELE-----------------AQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEe 945
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847 405 --RETLQSKDFKLEVEKLSKRIMALE----KLEDAFNKSKQECYSLKCNLEKermttkqLSQELESLKVRIKELE 473
Cdd:TIGR02169 946 ipEEELSLEDVQAELQRVEEEIRALEpvnmLAIQEYEEVLKRLDELKEKRAK-------LEEERKAILERIEEYE 1013
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
142-440 |
3.94e-08 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 57.76 E-value: 3.94e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 142 ELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEficlLEQECERLKKLIDQEIKsqeekeqek 221
Cdd:TIGR02168 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE----LEAQIEQLKEELKALRE--------- 803
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 222 ekRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTK 301
Cdd:TIGR02168 804 --ALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNE 881
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 302 FHQ----------DQDTIMAKLTNEDSQNRQLQQKLAALSrqiDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIM-- 369
Cdd:TIGR02168 882 RASleealallrsELEELSEELRELESKRSELRRELEELR---EKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAea 958
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 370 ------AEVEELRKRVLDMEGKDEEL--IKME--EQCRDLNKRLERETLQSKDFKLEVEKLSKRI-----MALEKLEDAF 434
Cdd:TIGR02168 959 lenkieDDEEEARRRLKRLENKIKELgpVNLAaiEEYEELKERYDFLTAQKEDLTEAKETLEEAIeeidrEARERFKDTF 1038
|
....*.
gi 109659847 435 NKSKQE 440
Cdd:TIGR02168 1039 DQVNEN 1044
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
330-657 |
4.35e-08 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 57.64 E-value: 4.35e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 330 LSRQIDELE-ETNRSLR----KAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLE 404
Cdd:COG1196 198 LERQLEPLErQAEKAERyrelKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELE 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 405 RETLQSKDFKLEVEKLSKRIMALEKledafnkskqecyslkcNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEF 484
Cdd:COG1196 278 ELELELEEAQAEEYELLAELARLEQ-----------------DIARLEERRRELEERLEELEEELAELEEELEELEEELE 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 485 TLKEDLTKLKTltvmfvdERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMysvTKER 564
Cdd:COG1196 341 ELEEELEEAEE-------ELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELE---EAEE 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 565 DDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLK-NKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMK 643
Cdd:COG1196 411 ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEeAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAA 490
|
330
....*....|....
gi 109659847 644 AIEDDLMKTEDEYE 657
Cdd:COG1196 491 ARLLLLLEAEADYE 504
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
246-771 |
5.49e-08 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 57.04 E-value: 5.49e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 246 QQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQ 325
Cdd:pfam05483 228 EEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDIKMSLQR 307
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 326 KLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVldmegkdEELIKMEEQcrdlnkRLER 405
Cdd:pfam05483 308 SMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSL-------EELLRTEQQ------RLEK 374
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 406 ETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFT 485
Cdd:pfam05483 375 NEDQLKIITMELQKKSSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKKQFEKIAEELKGKEQELIFLLQAREKEIHD 454
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 486 LKEDLTKLKTLTVMFVDERKTM-----SEKLKKTEdkLQAASSQLQVEQNKVTTVTEKLIEETK--------------RA 546
Cdd:pfam05483 455 LEIQLTAIKTSEEHYLKEVEDLkteleKEKLKNIE--LTAHCDKLLLENKELTQEASDMTLELKkhqediinckkqeeRM 532
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 547 LKSKTDVEEK-------MYSVTKE----RDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDsgKST 615
Cdd:pfam05483 533 LKQIENLEEKemnlrdeLESVREEfiqkGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKN--KNI 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 616 TALHQENNKIKELSQEverlklKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQflsKELEHVKmeLAKYKLAEKTET 695
Cdd:pfam05483 611 EELHQENKALKKKGSA------ENKQLNAYEIKVNKLELELASAKQKFEEIIDNYQ---KEIEDKK--ISEEKLLEEVEK 679
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 696 SH--EQWLFKRLQEEEAKSGHLSREVDALKEK-IHEYmatEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKEL 771
Cdd:pfam05483 680 AKaiADEAVKLQKEIDKRCQHKIAEMVALMEKhKHQY---DKIIEERDSELGLYKNKEQEQSSAKAALEIELSNIKAEL 755
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
164-775 |
6.12e-08 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 57.29 E-value: 6.12e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 164 AEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKsQEEKEQEKEKRVTTLKEELTKLKSFALMVV 243
Cdd:pfam02463 210 LEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQ-EIEKEEEKLAQVLKENKEEEKEKKLQEEEL 288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 244 DEQQRLTAQLTLQRQKIQELTT----NAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQ 319
Cdd:pfam02463 289 KLLAKEEEELKSELLKLERRKVddeeKLKESEKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKL 368
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 320 NRQLQQKLAALSRQIDELEETNRsLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDL 399
Cdd:pfam02463 369 EQLEEELLAKKKLESERLSSAAK-LKEEELELKSEEEKEAQLL-------LELARQLEDLLKEEKKEELEILEEEEESIE 440
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 400 NKRLERETLQSKDFKLEVEKLSKRIMaLEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRL 479
Cdd:pfam02463 441 LKQGKLTEEKEELEKQELKLLKDELE-LKKSEDLLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDG 519
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 480 EKTEFTLKEDLTKLKTLTVMFVD---ERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEk 556
Cdd:pfam02463 520 VGGRIISAHGRLGDLGVAVENYKvaiSTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLE- 598
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 557 mysvTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTA--LHQENNKIKELSQEVER 634
Cdd:pfam02463 599 ----IDPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEegLAEKSEVKASLSELTKE 674
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 635 LKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAE---KTETSHEQWLFKRLQEEEAK 711
Cdd:pfam02463 675 LLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELLADRVQEAQdkiNEELKLLKQKIDEEEEEEEK 754
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 109659847 712 SGHLSREVDALKEKIHEYMATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYR 775
Cdd:pfam02463 755 SRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLE 818
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
113-653 |
6.13e-08 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 57.46 E-value: 6.13e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 113 KKVLEALQRDAFQAKSTPWQEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEf 192
Cdd:PTZ00121 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEE- 1455
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 193 icllEQECERLKKLIDQEIKSQEEKEQEKEKRVT---TLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQelTTNAKE 269
Cdd:PTZ00121 1456 ----AKKAEEAKKKAEEAKKADEAKKKAEEAKKAdeaKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKK--ADEAKK 1529
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 270 THTKLALAEARVQEEEQKATRLEK--------ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETN 341
Cdd:PTZ00121 1530 AEEAKKADEAKKAEEKKKADELKKaeelkkaeEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 342 RSLRKAEEE---------LQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQcrdlnkrlerETLQSKD 412
Cdd:PTZ00121 1610 EEAKKAEEAkikaeelkkAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEE----------DKKKAEE 1679
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 413 FKLEVEKLSKRIMALEKLEDAFNKSKQecysLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTK 492
Cdd:PTZ00121 1680 AKKAEEDEKKAAEALKKEAEEAKKAEE----LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 493 LKTLTVMFVDERKTMSEKLKKTEdklqaassqlQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLK 572
Cdd:PTZ00121 1756 KKKIAHLKKEEEKKAEEIRKEKE----------AVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSK 1825
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 573 -AEEEKGNDLLSRVNMLKNRLQSLEaiEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVErlklKLKDMKAIEDDLMK 651
Cdd:PTZ00121 1826 eMEDSAIKEVADSKNMQLEEADAFE--KHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIE----EADEIEKIDKDDIE 1899
|
..
gi 109659847 652 TE 653
Cdd:PTZ00121 1900 RE 1901
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
245-797 |
1.32e-07 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 55.90 E-value: 1.32e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 245 EQQRLTAQLT-LQRQKIQ----------ELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKL 313
Cdd:pfam05557 3 ELIESKARLSqLQNEKKQmelehkrariELEKKASALKRQLDRESDRNQELQKRIRLLEKREAEAEEALREQAELNRLKK 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 314 TNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKME 393
Cdd:pfam05557 83 KYLEALNKKLNEKESQLADAREVISCLKNELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQNLEKQQ 162
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 394 EQCRDLNKR---LERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMttkqLSQELESLKVRIK 470
Cdd:pfam05557 163 SSLAEAEQRikeLEFEIQSQEQDSEIVKNSKSELARIPELEKELERLREHNKHLNENIENKLL----LKEEVEDLKRKLE 238
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 471 ELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMseklkKTEDKLQAASSQLQveQNKVTTVTEK--LIEETKRALK 548
Cdd:pfam05557 239 REEKYREEAATLELEKEKLEQELQSWVKLAQDTGLNL-----RSPEDLSRRIEQLQ--QREIVLKEENssLTSSARQLEK 311
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 549 SKTDVEEKMYSVTKERDDLKNKLKAEEEkgndllsrvnmLKNRLQ---SLEAIEKDFLKNKL-NQDSGKSTT-ALHQENN 623
Cdd:pfam05557 312 ARRELEQELAQYLKKIEDLNKKLKRHKA-----------LVRRLQrrvLLLTKERDGYRAILeSYDKELTMSnYSPQLLE 380
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 624 KIKELSQEVERLKLKLKDMKAieddlmktedEYETLERRYANERDKAQFLSKELEhvkmelakyklaektetsheqwlFK 703
Cdd:pfam05557 381 RIEEAEDMTQKMQAHNEEMEA----------QLSVAEEELGGYKQQAQTLERELQ-----------------------AL 427
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 704 RLQEEEAKSGHLSREVDALKEKIHEYMAT-----------EDLICH--LQGD-----HSVLQKKLNQQENRNRDLGREIE 765
Cdd:pfam05557 428 RQQESLADPSYSKEEVDSLRRKLETLELErqrlreqknelEMELERrcLQGDydpkkTKVLHLSMNPAAEAYQQRKNQLE 507
|
570 580 590
....*....|....*....|....*....|..
gi 109659847 766 NLTKELERYRHFSKSLRPSLNGRRISDPQVFS 797
Cdd:pfam05557 508 KLQAEIERLKRLLKKLEDDLEQVLRLPETTST 539
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
228-433 |
1.38e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 55.16 E-value: 1.38e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 228 LKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKF----- 302
Cdd:COG4942 32 LQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELaellr 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 303 ----HQDQDTIMAKLTNEDSQ------------NRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKekiskgeygna 366
Cdd:COG4942 112 alyrLGRQPPLALLLSPEDFLdavrrlqylkylAPARREQAEELRADLAELAALRAELEAERAELEALL----------- 180
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847 367 gimAEVEELRKRVLDMEGKDEELIKmeeqcrDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:COG4942 181 ---AELEEERAALEALKAERQKLLA------RLEKELAELAAELAELQQEAEELEALIARLEAEAAA 238
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
205-683 |
1.50e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 55.80 E-value: 1.50e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 205 KLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEE 284
Cdd:TIGR04523 106 SKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNI 185
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 285 E--------------------QKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQI----DELEET 340
Cdd:TIGR04523 186 QknidkiknkllklelllsnlKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLnqlkDEQNKI 265
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 341 NRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKD--EELIKMEEQCRDLNKRLERETLQSKDFKLEVE 418
Cdd:TIGR04523 266 KKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkSELKNQEKKLEEIQNQISQNNKIISQLNEQIS 345
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 419 KLSKRIM------------------ALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLE 480
Cdd:TIGR04523 346 QLKKELTnsesensekqreleekqnEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLE 425
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 481 KTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSKTDveeKMYSV 560
Cdd:TIGR04523 426 KEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNL-EQKQKELKSKEK---ELKKL 501
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 561 TKERDDLKNKLKAEEEKGNDLLSRVNMLKNRL----QSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLK 636
Cdd:TIGR04523 502 NEEKKELEEKVKDLTKKISSLKEKIEKLESEKkekeSKISDLEDELNKDDFELKKENLEKEIDEKNKEIEELKQTQKSLK 581
|
490 500 510 520
....*....|....*....|....*....|....*....|....*..
gi 109659847 637 lklKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKME 683
Cdd:TIGR04523 582 ---KKQEEKQELIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKE 625
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
143-577 |
1.82e-07 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 55.54 E-value: 1.82e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 143 LDKVVEKHKESYRRILGQLLVAEKSR-RQTILELEEEKRKHKEYMEKSDEfICLLEQECERLKKlidqeiksqeekeqek 221
Cdd:COG4717 47 LLERLEKEADELFKPQGRKPELNLKElKELEEELKEAEEKEEEYAELQEE-LEELEEELEELEA---------------- 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 222 ekRVTTLKEELTKLKSF--ALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQT 299
Cdd:COG4717 110 --ELEELREELEKLEKLlqLLPLYQELEALEAELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLS 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 300 TKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEEtNRSLRKAEEELQDIKEKISKGEYG--------------- 364
Cdd:COG4717 188 LATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEE-ELEQLENELEAAALEERLKEARLLlliaaallallglgg 266
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 365 ---------------NAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFK--LEVEKLSKRIMAL 427
Cdd:COG4717 267 sllsliltiagvlflVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPpdLSPEELLELLDRI 346
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 428 EKLEDAFNKSKQecyslkcnlEKERMTTKQLSQELESL--KVRIKELEAIESRLEKTEfTLKEDLTKLKTLTVMFVDERK 505
Cdd:COG4717 347 EELQELLREAEE---------LEEELQLEELEQEIAALlaEAGVEDEEELRAALEQAE-EYQELKEELEELEEQLEELLG 416
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 506 TMSEKLKKT-----EDKLQAASSQLQVEQNKVTTVTEKL--IEETKRALKSKTDVEEKMYsvtkERDDLKNKLKAEEEK 577
Cdd:COG4717 417 ELEELLEALdeeelEEELEELEEELEELEEELEELREELaeLEAELEQLEEDGELAELLQ----ELEELKAELRELAEE 491
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
318-837 |
3.02e-07 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 55.08 E-value: 3.02e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 318 SQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGnagimaEVEELRKRVLDMEgKDEELIKMEEQCR 397
Cdd:COG5022 806 LGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIQKFG------RSLKAKKRFSLLK-KETIYLQSAQRVE 878
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 398 DLNKRLERETLQSKdfklEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTT-----------KQLSQELESLK 466
Cdd:COG5022 879 LAERQLQELKIDVK----SISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARlkkllnnidleEGPSIEYVKLP 954
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 467 VRIKELEAiESRLEKTEFTLkEDLTKLKTLTVmfvDERKTMSEKLKKTEDKLQAASSQLQVEQNKVttvteKLIEETKRA 546
Cdd:COG5022 955 ELNKLHEV-ESKLKETSEEY-EDLLKKSTILV---REGNKANSELKNFKKELAELSKQYGALQEST-----KQLKELPVE 1024
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 547 LKSKTDVEEKMYSVTKErddLKNKLKAEEEKGNDLLSrVNMLKNRLQSLEaIEKDFLKNKLNQDSGKSTTalhqeNNKIK 626
Cdd:COG5022 1025 VAELQSASKIISSESTE---LSILKPLQKLKGLLLLE-NNQLQARYKALK-LRRENSLLDDKQLYQLEST-----ENLLK 1094
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 627 ElsqeverlkLKLKDMKAIEDDLMKTEDEYETL---ERRYANERDKAQFLSKELEHVKMELAKYKLAEKT---------- 693
Cdd:COG5022 1095 T---------INVKDLEVTNRNLVKPANVLQFIvaqMIKLNLLQEISKFLSQLVNTLEPVFQKLSVLQLEldglfweanl 1165
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 694 -ETSHEQWlFKRLQEEEAKSGH---------------LSREVDALKEKIHEYMATEDLICHL--QGDHSVLQKKLNQQEN 755
Cdd:COG5022 1166 eALPSPPP-FAALSEKRLYQSAlydeksklsssevndLKNELIALFSKIFSGWPRGDKLKKLisEGWVPTEYSTSLKGFN 1244
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 756 RNRDLGREIENLTKelERYRHFSKSLRPSLNGRRISdPQVFSKEVQTEAVDNEPPDYKSLIPLERAVINGQLYEESENQD 835
Cdd:COG5022 1245 NLNKKFDTPASMSN--EKLLSLLNSIDNLLSSYKLE-EEVLPATINSLLQYINVGLFNALRTKASSLRWKSATEVNYNSE 1321
|
..
gi 109659847 836 ED 837
Cdd:COG5022 1322 EL 1323
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
251-819 |
5.40e-07 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 54.20 E-value: 5.40e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 251 AQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAAL 330
Cdd:TIGR00618 166 KELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKELKHLREALQQTQQSHAYL 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 331 SRQIDELEETNRSLRKAEEELQDIKEKISKgeygnagiMAEVEELRKRvLDMEGKDEELIKMEEQCRDLNKRLER--ETL 408
Cdd:TIGR00618 246 TQKREAQEEQLKKQQLLKQLRARIEELRAQ--------EAVLEETQER-INRARKAAPLAAHIKAVTQIEQQAQRihTEL 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 409 QSKDFKLEVE-----KLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKvRIKELEAIESRLEKTE 483
Cdd:TIGR00618 317 QSKMRSRAKLlmkraAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQ-HIHTLQQQKTTLTQKL 395
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 484 FTLKEDLTKLKTL--TVMFVDERKtmseklKKTEDKLQAASSQLQVEQnKVTTVTEKLIEETKRALKSKTDVEEKMYSVT 561
Cdd:TIGR00618 396 QSLCKELDILQREqaTIDTRTSAF------RDLQGQLAHAKKQQELQQ-RYAELCAAAITCTAQCEKLEKIHLQESAQSL 468
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 562 KERD----DLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEaiekdflknklnqdsgKSTTALHQENNKIKELSQEVERLKL 637
Cdd:TIGR00618 469 KEREqqlqTKEQIHLQETRKKAVVLARLLELQEEPCPLC----------------GSCIHPNPARQDIDNPGPLTRRMQR 532
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 638 KLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSR 717
Cdd:TIGR00618 533 GEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLAC 612
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 718 EVDALKEKIHEYMATEDLICHLQGDHSVLQKKLNQqenrnrdLGREIENLTKELERYRHFSKSLRPSLNG-RRISDPQVF 796
Cdd:TIGR00618 613 EQHALLRKLQPEQDLQDVRLHLQQCSQELALKLTA-------LHALQLTLTQERVREHALSIRVLPKELLaSRQLALQKM 685
|
570 580
....*....|....*....|...
gi 109659847 797 SKEVQTEAVDNEPPDYKSLIPLE 819
Cdd:TIGR00618 686 QSEKEQLTYWKEMLAQCQTLLRE 708
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
131-606 |
8.29e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 53.49 E-value: 8.29e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 131 WQEDIYEKPMNELDKVVEKHKESYRRILgQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLID-- 208
Cdd:TIGR04523 194 NKLLKLELLLSNLKKKIQKNKSLESQIS-ELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSek 272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 209 -QEIKSQEEKEQEKEKRVTTLKEELTKLKSfalmvvDEQQRLTAQLTLQRQKIQELTTNAKethTKLALAEARVQEEEQK 287
Cdd:TIGR04523 273 qKELEQNNKKIKELEKQLNQLKSEISDLNN------QKEQDWNKELKSELKNQEKKLEEIQ---NQISQNNKIISQLNEQ 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 288 ATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAG 367
Cdd:TIGR04523 344 ISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKEL 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 368 IMAEVEELRKRVLDMEGKDEELIK----MEEQCRDLNKRLERETLQSKDFKLEVEKLSKRimaLEKLEDAFNKSKQECYS 443
Cdd:TIGR04523 424 LEKEIERLKETIIKNNSEIKDLTNqdsvKELIIKNLDNTRESLETQLKVLSRSINKIKQN---LEQKQKELKSKEKELKK 500
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 444 LKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKT-----LTVMFVDERKTMSEKLKKTEDKL 518
Cdd:TIGR04523 501 LNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFelkkeNLEKEIDEKNKEIEELKQTQKSL 580
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 519 QAASSQLQveqnkvttvtEKLIEETKRALKSKTDVEEKMYSVTKERDDLKnKLKAEEEKGNDLLSRVNMLKNRL-QSLEA 597
Cdd:TIGR04523 581 KKKQEEKQ----------ELIDQKEKEKKDLIKEIEEKEKKISSLEKELE-KAKKENEKLSSIIKNIKSKKNKLkQEVKQ 649
|
....*....
gi 109659847 598 IEKDFLKNK 606
Cdd:TIGR04523 650 IKETIKEIR 658
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
274-546 |
8.40e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 52.46 E-value: 8.40e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 274 LALAEARVQEEEQKAtrLEKELQtqttKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEetnRSLRKAEEELQD 353
Cdd:COG4942 10 LLALAAAAQADAAAE--AEAELE----QLQQEIAELEKELAALKKEEKALLKQLAALERRIAALA---RRIRALEQELAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 354 IKEKISKGEygnagimAEVEELRKRvldmegKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:COG4942 81 LEAELAELE-------KEIAELRAE------LEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPA 147
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 434 fnkskqecyslkcnlEKERMttKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLtvmfVDERKTMSEKLKK 513
Cdd:COG4942 148 ---------------RREQA--EELRADLAELAALRAELEAERAELEALLAELEEERAALEAL----KAERQKLLARLEK 206
|
250 260 270
....*....|....*....|....*....|...
gi 109659847 514 TEDKLQAASSQLQVEQNKVTTVTEKLIEETKRA 546
Cdd:COG4942 207 ELAELAAELAELQQEAEELEALIARLEAEAAAA 239
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
273-777 |
1.12e-06 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 52.53 E-value: 1.12e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 273 KLALAEARVQEEEQKATRLEKELQTQTT--KFHQDQDTIMAKltnedsqnrqlqqKLAALSRQIDELEETNRSLR--KAE 348
Cdd:PRK04778 38 KQELENLPVNDELEKVKKLNLTGQSEEKfeEWRQKWDEIVTN-------------SLPDIEEQLFEAEELNDKFRfrKAK 104
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 349 EELQDIKEKISKGEygnagimAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLeRETLQSKDFKL--EVEKLSKRima 426
Cdd:PRK04778 105 HEINEIESLLDLIE-------EDIEQILEELQELLESEEKNREEVEQLKDLYREL-RKSLLANRFSFgpALDELEKQ--- 173
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 427 LEKLEDAFNKSKQEcySLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTeftLKEDLTKLKT----------- 495
Cdd:PRK04778 174 LENLEEEFSQFVEL--TESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTE---LPDQLQELKAgyrelveegyh 248
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 496 LTVMFVDER-KTMSEKLKKTEDKLqaasSQLQVEqnkvttVTEKLIEETKRALKSKTDVEEKMY----SVTKERDDLKNK 570
Cdd:PRK04778 249 LDHLDIEKEiQDLKEQIDENLALL----EELDLD------EAEEKNEEIQERIDQLYDILEREVkarkYVEKNSDTLPDF 318
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 571 LKAEEEKGNDL-----------------LSRVNMLKNRLQSleaIEKDFLKNKLNQDSGKSTtalhqennkikeLSQEVE 633
Cdd:PRK04778 319 LEHAKEQNKELkeeidrvkqsytlneseLESVRQLEKQLES---LEKQYDEITERIAEQEIA------------YSELQE 383
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 634 RLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKL--------AEKTETSHE-QWLFKR 704
Cdd:PRK04778 384 ELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEKSNLpglpedylEMFFEVSDEiEALAEE 463
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847 705 LQEEEAKSGHLSREVDALKEKIHE-YMATEDLIchlqgDHSVLQKKLNQQENRNRDLGREIEN-LTKELERYRHF 777
Cdd:PRK04778 464 LEEKPINMEAVNRLLEEATEDVETlEEETEELV-----ENATLTEQLIQYANRYRSDNEEVAEaLNEAERLFREY 533
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
447-786 |
1.26e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 52.75 E-value: 1.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 447 NLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFtLKEDLTKL-KTLTVMFVDERKtmsEKLKKTEDKLQAASSQL 525
Cdd:TIGR02168 180 KLERTRENLDRLEDILNELERQLKSLERQAEKAERYKE-LKAELRELeLALLVLRLEELR---EELEELQEELKEAEEEL 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 526 QVEQNKVTTVTEKLieETKRALKSKtdVEEKMYSVTKERDDLKNKLkaeeekgNDLLSRVNMLKNRLQSLEaiekdflkN 605
Cdd:TIGR02168 256 EELTAELQELEEKL--EELRLEVSE--LEEEIEELQKELYALANEI-------SRLEQQKQILRERLANLE--------R 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 606 KLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANerdkaqfLSKELEHVKMELA 685
Cdd:TIGR02168 317 QLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEE-------LEEQLETLRSKVA 389
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 686 KYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEKIHEYMATEdlichLQGDHSVLQKKLNQQENRNRDLGREIE 765
Cdd:TIGR02168 390 QLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKE-----LQAELEELEEELEELQEELERLEEALE 464
|
330 340
....*....|....*....|.
gi 109659847 766 NLTKELERYRHFSKSLRPSLN 786
Cdd:TIGR02168 465 ELREELEEAEQALDAAERELA 485
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
166-490 |
1.29e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 52.72 E-value: 1.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 166 KSRRQTILELEEEKRKHKEYMEKSDEFICLLE---QECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSfalmv 242
Cdd:TIGR04523 366 EEKQNEIEKLKKENQSYKQEIKNLESQINDLEskiQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNS----- 440
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 243 vdEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTkfhqdqdtimaKLTNEDSQNRQ 322
Cdd:TIGR04523 441 --EIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEK-----------ELKKLNEEKKE 507
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 323 LQQKLAALSRQIDELEETNRSL----RKAEEELQDIKEKISKGEYG--NAGIMAEVEELRKRVLDMEGKDEELIKMEEQC 396
Cdd:TIGR04523 508 LEEKVKDLTKKISSLKEKIEKLesekKEKESKISDLEDELNKDDFElkKENLEKEIDEKNKEIEELKQTQKSLKKKQEEK 587
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 397 RDLNKRLEretlqsKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIE 476
Cdd:TIGR04523 588 QELIDQKE------KEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKW 661
|
330
....*....|....
gi 109659847 477 SRLEKTEFTLKEDL 490
Cdd:TIGR04523 662 PEIIKKIKESKTKI 675
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
257-405 |
1.40e-06 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 52.61 E-value: 1.40e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 257 RQKIQELTTnakethtKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDtiMAKLTNEDSQNRQLQQKLAALSRQIDE 336
Cdd:COG4913 609 RAKLAALEA-------ELAELEEELAEAEERLEALEAELDALQERREALQR--LAEYSWDEIDVASAEREIAELEAELER 679
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 337 LEETNRSLRKAEEELQDIKEKISKGEygnagimaevEELRKRVLDMEGKDEELIKMEEQCRDLNKRLER 405
Cdd:COG4913 680 LDASSDDLAALEEQLEELEAELEELE----------EELDELKGEIGRLEKELEQAEEELDELQDRLEA 738
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
197-781 |
1.83e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 52.48 E-value: 1.83e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 197 EQECERLKKLIDQEIKSQEEKEQEKEKR----VTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHT 272
Cdd:pfam01576 326 EQEVTELKKALEEETRSHEAQLQEMRQKhtqaLEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSEH 405
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 273 KLALAEARVQE--------EEQKATRLEK----------------ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:pfam01576 406 KRKKLEGQLQElqarlsesERQRAELAEKlsklqselesvssllnEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKL 485
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 329 ALSRQIDELEETNRSLRKAEEELQDIKEKISKgeygnagimaEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLEREtL 408
Cdd:pfam01576 486 NLSTRLRQLEDERNSLQEQLEEEEEAKRNVER----------QLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQRE-L 554
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 409 QSKDFKLEveklsKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRL----EKTEF 484
Cdd:pfam01576 555 EALTQQLE-----EKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKKFDQMLAEEKAISARYaeerDRAEA 629
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 485 TLKEDLTKLKTLTVMfVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALksktdvEEKMYSVTKER 564
Cdd:pfam01576 630 EAREKETRALSLARA-LEEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHEL-ERSKRAL------EQQVEEMKTQL 701
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 565 DDLKNKLKAEEEKgnDLLSRVNMlknrlQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKA 644
Cdd:pfam01576 702 EELEDELQATEDA--KLRLEVNM-----QALKAQFERDLQARDEQGEEKRRQLVKQVRELEAELEDERKQRAQAVAAKKK 774
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 645 IEDDLMKTEDEYETLERRYAN-----ERDKAQF--LSKELEHVKME----LAKYKLAEKTETSHEQWLFkRLQEEEAKSG 713
Cdd:pfam01576 775 LELDLKELEAQIDAANKGREEavkqlKKLQAQMkdLQRELEEARASrdeiLAQSKESEKKLKNLEAELL-QLQEDLAASE 853
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 714 HLSREVDALKEKIHEYMAT---------------EDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYRHFS 778
Cdd:pfam01576 854 RARRQAQQERDELADEIASgasgksalqdekrrlEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTELAAERSTS 933
|
...
gi 109659847 779 KSL 781
Cdd:pfam01576 934 QKS 936
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
348-773 |
2.85e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 51.31 E-value: 2.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 348 EEELQDIKEKISKGEYGNAGI-MAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMA 426
Cdd:COG4717 48 LERLEKEADELFKPQGRKPELnLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQL 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 427 LEKLEDafnkskqecyslkcnLEKERMTTKQLSQELESLKVRIKELEAIESRLEKteftLKEDLTKLKT-LTVMFVDERK 505
Cdd:COG4717 128 LPLYQE---------------LEALEAELAELPERLEELEERLEELRELEEELEE----LEAELAELQEeLEELLEQLSL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 506 TMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSktdveekmYSVTKERDDLKNKLKAEEE--------- 576
Cdd:COG4717 189 ATEEELQDLAEELEELQQRLAELEEELEEAQEEL-EELEEELEQ--------LENELEAAALEERLKEARLllliaaall 259
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 577 ----KGNDLLSRVNMLKNRLQSLEAI------EKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIE 646
Cdd:COG4717 260 allgLGGSLLSLILTIAGVLFLVLGLlallflLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEEL 339
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 647 DDLMKTEDEYETLERRYANERDKAQFlsKELEHVKMELAKYKLAEkTETSHEQWL--FKRLQEEEAKSGHLSREVDALKE 724
Cdd:COG4717 340 LELLDRIEELQELLREAEELEEELQL--EELEQEIAALLAEAGVE-DEEELRAALeqAEEYQELKEELEELEEQLEELLG 416
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 109659847 725 KIHEYMATEDLIcHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:COG4717 417 ELEELLEALDEE-ELEEELEELEEELEELEEELEELREELAELEAELEQ 464
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
280-711 |
4.00e-06 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 51.20 E-value: 4.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 280 RVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKL----AALSRQIDELEETNRSLRKAEEELQDIK 355
Cdd:TIGR00606 685 RVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLApgrqSIIDLKEKEIPELRNKLQKVNRDIQRLK 764
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 356 EKISKGEYGNAGIMAEvEELRKRVLDMEGKDEELikmEEQCRDLNKRLERET--LQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:TIGR00606 765 NDIEEQETLLGTIMPE-EESAKVCLTDVTIMERF---QMELKDVERKIAQQAakLQGSDLDRTVQQVNQEKQEKQHELDT 840
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 434 FNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLtvmfVDERKTMSEKLKK 513
Cdd:TIGR00606 841 VVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIRE----IKDAKEQDSPLET 916
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 514 TEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKE-RDDLKNKLKAEEEKGNDLLSRVNMLKNRL 592
Cdd:TIGR00606 917 FLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDgKDDYLKQKETELNTVNAQLEECEKHQEKI 996
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 593 -QSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIE--DDLMKTEDEYETLERRYANERDK 669
Cdd:TIGR00606 997 nEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQVLQmkQEHQKLEENIDLIKRNHVLALGR 1076
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 109659847 670 AQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAK 711
Cdd:TIGR00606 1077 QKGYEKEIKHFKKELREPQFRDAEEKYREMMIVMRTTELVNK 1118
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
282-771 |
4.44e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 50.94 E-value: 4.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 282 QEEEQKATRLE----KELQTQTTKFHQDQDTIMAKLTNEDSQnrqLQQKLAALSRQIDELEETNRSL--RKAEEE--LQD 353
Cdd:pfam01576 3 QEEEMQAKEEElqkvKERQQKAESELKELEKKHQQLCEEKNA---LQEQLQAETELCAEAEEMRARLaaRKQELEeiLHE 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 354 IKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEElikmEEQCRDlNKRLERETLQSKdfkleVEKLSKRIMALEKLEDA 433
Cdd:pfam01576 80 LESRLEEEEERSQQLQNEKKKMQQHIQDLEEQLDE----EEAARQ-KLQLEKVTTEAK-----IKKLEEDILLLEDQNSK 149
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 434 FNKSK----QECYSLKCNLEKERMTTKQLSQ-------ELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVD 502
Cdd:pfam01576 150 LSKERklleERISEFTSNLAEEEEKAKSLSKlknkheaMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQEQIAELQA 229
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 503 ERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVE-EKMY--SVTKERDDLKNKLKAEEEKGN 579
Cdd:pfam01576 230 QIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLEsERAArnKAEKQRRDLGEELEALKTELE 309
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 580 DLLSRVNMlKNRLQSLEAIEKDFLKNKLnQDSGKSTTALHQE-----NNKIKELSQEVE---RLKLKL-KDMKAIEDDLM 650
Cdd:pfam01576 310 DTLDTTAA-QQELRSKREQEVTELKKAL-EEETRSHEAQLQEmrqkhTQALEELTEQLEqakRNKANLeKAKQALESENA 387
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 651 KTEDEYETL-----ERRYANERDKAQFLSKELEHVKMELAKYKLAEKTE--TSHEQWLFKRLQEEEAKSGHLSREVDALK 723
Cdd:pfam01576 388 ELQAELRTLqqakqDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSklQSELESVSSLLNEAEGKNIKLSKDVSSLE 467
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 724 EKIHEY-----------MATEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKEL 771
Cdd:pfam01576 468 SQLQDTqellqeetrqkLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQL 526
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
471-767 |
5.81e-06 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 50.84 E-value: 5.81e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 471 ELEAIESRLEKTEFTLKEDLTKLKTLTvmfvDERKTmSEKLKKTEDKLQAASSQLQVEQNKVTtvtEKLIEETKRALKSK 550
Cdd:TIGR02169 178 ELEEVEENIERLDLIIDEKRQQLERLR----REREK-AERYQALLKEKREYEGYELLKEKEAL---ERQKEAIERQLASL 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 551 TDVEEKmysVTKERDDLKNKLKAEEEKGNDLLSRVNmlknRLQSLEAIEkdfLKNKLnqdsGKSTTALHQENNKIKELSQ 630
Cdd:TIGR02169 250 EEELEK---LTEEISELEKRLEEIEQLLEELNKKIK----DLGEEEQLR---VKEKI----GELEAEIASLERSIAEKER 315
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 631 EVERLKLKLKDmkaIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKyklaektetsheqwLFKRLQEEEA 710
Cdd:TIGR02169 316 ELEDAEERLAK---LEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELED--------------LRAELEEVDK 378
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847 711 KSGHLSREVDALKEKIHEYmatEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENL 767
Cdd:TIGR02169 379 EFAETRDELKDYREKLEKL---KREINELKRELDRLQEELQRLSEELADLNAAIAGI 432
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
205-774 |
6.03e-06 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 50.40 E-value: 6.03e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 205 KLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEE 284
Cdd:TIGR04523 78 KILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKL 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 285 EQKATRLEK---ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAAlsrqIDELEETNRSLRKAEEELQDIKEKISKG 361
Cdd:TIGR04523 158 NNKYNDLKKqkeELENELNLLEKEKLNIQKNIDKIKNKLLKLELLLSN----LKKKIQKNKSLESQISELKKQNNQLKDN 233
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 362 eygnagIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQEC 441
Cdd:TIGR04523 234 ------IEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQD 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 442 YS------LKCNLEKERMTTKQLSQELE---SLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLK 512
Cdd:TIGR04523 308 WNkelkseLKNQEKKLEEIQNQISQNNKiisQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIK 387
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 513 KTEDKLQAASSQLQveqnKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRL 592
Cdd:TIGR04523 388 NLESQINDLESKIQ----NQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTR 463
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 593 QSLEAIEKDFLK--NKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKa 670
Cdd:TIGR04523 464 ESLETQLKVLSRsiNKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISD- 542
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 671 qfLSKELEHVKMELAKYKLAEktetsheqwlfkrlqeeeaksghlsrEVDALKEKIHEymatedlichLQGDHSVLQKKL 750
Cdd:TIGR04523 543 --LEDELNKDDFELKKENLEK--------------------------EIDEKNKEIEE----------LKQTQKSLKKKQ 584
|
570 580
....*....|....*....|....
gi 109659847 751 NQQENRNRDLGREIENLTKELERY 774
Cdd:TIGR04523 585 EEKQELIDQKEKEKKDLIKEIEEK 608
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
312-531 |
7.64e-06 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 49.44 E-value: 7.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 312 KLTNEDSQNRQLQQKLAALSRQID----ELEETNRSLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVldmegkDE 387
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQAELDalqaELEELNEEYNELQAELEALQAEIDKLQ-------AEIAEAEAEI------EE 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 388 ELIKMEEQCRDLNKRLERET-----LQSKDFklevEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERmttKQLSQEL 462
Cdd:COG3883 84 RREELGERARALYRSGGSVSyldvlLGSESF----SDFLDRLSALSKIADADADLLEELKADKAELEAKK---AELEAKL 156
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 463 ESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNK 531
Cdd:COG3883 157 AELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAA 225
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
356-685 |
8.01e-06 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 50.02 E-value: 8.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 356 EKISKGEYGNAGIMAEVEELR-----KRVLDMEGKDEELI----KMEEQCRDLNKRLE-RETLQSKDFKLEVE----KLS 421
Cdd:COG3206 71 SGLSSLSASDSPLETQIEILKsrpvlERVVDKLNLDEDPLgeeaSREAAIERLRKNLTvEPVKGSNVIEISYTspdpELA 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 422 KRImaLEKLEDAFNKskqecYSLKCNLEKERMTTKQLSQELESLKvriKELEAIESRLEktEFTLKEDLTKLKtltvmfv 501
Cdd:COG3206 151 AAV--ANALAEAYLE-----QNLELRREEARKALEFLEEQLPELR---KELEEAEAALE--EFRQKNGLVDLS------- 211
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 502 DERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLieETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDL 581
Cdd:COG3206 212 EEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL--GSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPN 289
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 582 LSRVNMLKNRLQSLEAiekdflknKLNQDSGKSTTALHQEnnkIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLER 661
Cdd:COG3206 290 HPDVIALRAQIAALRA--------QLQQEAQRILASLEAE---LEALQAREASLQAQLAQLEARLAELPELEAELRRLER 358
|
330 340
....*....|....*....|....
gi 109659847 662 RYANERDKAQFLSKELEHVKMELA 685
Cdd:COG3206 359 EVEVARELYESLLQRLEEARLAEA 382
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
148-755 |
9.10e-06 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 50.08 E-value: 9.10e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 148 EKHKESYR-RILGQLLVAEKSRRQTIL--ELEEEKRKHKEY-MEKSDEficLLEQECERLKKLIDQEIksqeeKEQEKEK 223
Cdd:COG5022 821 KLQKTIKReKKLRETEEVEFSLKAEVLiqKFGRSLKAKKRFsLLKKET---IYLQSAQRVELAERQLQ-----ELKIDVK 892
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 224 RVTTLKEELTKLKSFALMVV-DEQQRLTAQLTLQRQKIQELTTNAKETHTKLALA-EARVQEEEQKatrlekeLQTQTTK 301
Cdd:COG5022 893 SISSLKLVNLELESEIIELKkSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSiEYVKLPELNK-------LHEVESK 965
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 302 FHQDQDTIMAKLTNEDSQNRQLQ----------QKLAALSRQIDELEETNRSLRKAEEELQDIK--EKISKGEYGNAGIM 369
Cdd:COG5022 966 LKETSEEYEDLLKKSTILVREGNkanselknfkKELAELSKQYGALQESTKQLKELPVEVAELQsaSKIISSESTELSIL 1045
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 370 AEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQ-------SKDFKLEVEKLSKRIMALEKLEDAFNKSKQecy 442
Cdd:COG5022 1046 KPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQlestenlLKTINVKDLEVTNRNLVKPANVLQFIVAQM--- 1122
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 443 sLKCNLEKERMttkqlsqelESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAAS 522
Cdd:COG5022 1123 -IKLNLLQEIS---------KFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKS 1192
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 523 SQLQVEqnkVTTVTEKLIEETKR-------------ALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLK 589
Cdd:COG5022 1193 KLSSSE---VNDLKNELIALFSKifsgwprgdklkkLISEGWVPTEYSTSLKGFNNLNKKFDTPASMSNEKLLSLLNSID 1269
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 590 NRLQS-------LEAIEKDFLKN-----------KLNQDSGKSTTALHQ--ENNKIKELSQEVERLKLKLKDMKAIEDDL 649
Cdd:COG5022 1270 NLLSSykleeevLPATINSLLQYinvglfnalrtKASSLRWKSATEVNYnsEELDDWCREFEISDVDEELEELIQAVKVL 1349
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 650 MKTEDEYETLERRYanerdKAQFLSKELEHVKMeLAKYKLAEKtETSHEQWLFKRLQEEEAKS--GHLSREVDALKEKIH 727
Cdd:COG5022 1350 QLLKDDLNKLDELL-----DACYSLNPAEIQNL-KSRYDPADK-ENNLPKEILKKIEALLIKQelQLSLEGKDETEVHLS 1422
|
650 660
....*....|....*....|....*...
gi 109659847 728 EYMATEDLICHLQGDHSVLQKKLNQQEN 755
Cdd:COG5022 1423 EIFSEEKSLISLDRNSIYKEEVLSSLSA 1450
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
284-434 |
9.26e-06 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 49.86 E-value: 9.26e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 284 EEQKATRLEKELQT-QTTKFHQDQDtimakLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGE 362
Cdd:COG2433 383 EELIEKELPEEEPEaEREKEHEERE-----LTEEEEEIRRLEEQVERLEAEVEELEAELEEKDERIERLERELSEARSEE 457
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 363 YGNAGIMAEVEELRKRVLDMEGkdeELIKMEEQCRDLNKRLER-ETLQSKDFK------LEVEKLSKRimALEKLEDAF 434
Cdd:COG2433 458 RREIRKDREISRLDREIERLER---ELEEERERIEELKRKLERlKELWKLEHSgelvpvKVVEKFTKE--AIRRLEEEY 531
|
|
| EzrA |
pfam06160 |
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ... |
310-662 |
1.41e-05 |
|
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.
Pssm-ID: 428797 [Multi-domain] Cd Length: 542 Bit Score: 49.08 E-value: 1.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 310 MAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISK------GEYGNAgimaeVEELRKRVLDME 383
Cdd:pfam06160 85 KKALDEIEELLDDIEEDIKQILEELDELLESEEKNREEVEELKDKYRELRKtllanrFSYGPA-----IDELEKQLAEIE 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 384 GKDEEL---------IKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRI-MALEKLEDAFNKSKQECYSLK-CNLEKER 452
Cdd:pfam06160 160 EEFSQFeeltesgdyLEAREVLEKLEEETDALEELMEDIPPLYEELKTELpDQLEELKEGYREMEEEGYALEhLNVDKEI 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 453 mttKQLSQELESLKVRIKELEaiesrLEKTEFTLKEDLTKLKTLTVMF---VDERKTMSEKLKKTEDKLQAASSQLQVEQ 529
Cdd:pfam06160 240 ---QQLEEQLEENLALLENLE-----LDEAEEALEEIEERIDQLYDLLekeVDAKKYVEKNLPEIEDYLEHAEEQNKELK 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 530 NKVTTVTEKLI---EETKRALKSKTDVE--EKMYSVTKERDD--------LKNKLKAEEEKGNDLLSRVNMLKNRLQSLE 596
Cdd:pfam06160 312 EELERVQQSYTlneNELERVRGLEKQLEelEKRYDEIVERLEekevayseLQEELEEILEQLEEIEEEQEEFKESLQSLR 391
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 597 AIEKDFLKNKLNQDSGKSTTALHQEN------------------NKIKELSQEVERLKLklkDMKAIEDDLMKTEDEYET 658
Cdd:pfam06160 392 KDELEAREKLDEFKLELREIKRLVEKsnlpglpesyldyffdvsDEIEDLADELNEVPL---NMDEVNRLLDEAQDDVDT 468
|
....
gi 109659847 659 LERR 662
Cdd:pfam06160 469 LYEK 472
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
160-723 |
1.44e-05 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 49.66 E-value: 1.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 160 QLLVAEKSRRQTILELEEEKRKHKEyMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFA 239
Cdd:TIGR00606 232 QLESSREIVKSYENELDPLKNRLKE-IEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLNDLYHNH 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 240 LMVVDEQ-------QRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQT--------------- 297
Cdd:TIGR00606 311 QRTVREKerelvdcQRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATrleldgfergpfser 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 298 QTTKFHQDQDTIMAKLTNEDSQN-RQLQQKLAALSRQIDELEETNRS-----------LRKAEEELQDIKEKISKGEYGN 365
Cdd:TIGR00606 391 QIKNFHTLVIERQEDEAKTAAQLcADLQSKERLKQEQADEIRDEKKGlgrtielkkeiLEKKQEELKFVIKELQQLEGSS 470
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 366 AGIMAEVEELRKRVLDM---------EGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAF-- 434
Cdd:TIGR00606 471 DRILELDQELRKAERELskaeknsltETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDeq 550
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 435 ---NKSKQECY------------SLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKL--KTLT 497
Cdd:TIGR00606 551 irkIKSRHSDEltsllgyfpnkkQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYedKLFD 630
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 498 VMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETK-------RALKSK-------TDVEEKMYSVTKE 563
Cdd:TIGR00606 631 VCGSQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQsccpvcqRVFQTEaelqefiSDLQSKLRLAPDK 710
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 564 RDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAiEKDFLKNKLNqdsgKSTTALHQENNKIKELSQEVERLKLKLKDMK 643
Cdd:TIGR00606 711 LKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEK-EIPELRNKLQ----KVNRDIQRLKNDIEEQETLLGTIMPEEESAK 785
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 644 AIEDDLMKTEDEYETL---ERRYANERDKAQF--LSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSRE 718
Cdd:TIGR00606 786 VCLTDVTIMERFQMELkdvERKIAQQAAKLQGsdLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSK 865
|
....*
gi 109659847 719 VDALK 723
Cdd:TIGR00606 866 TNELK 870
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
144-353 |
1.45e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 1.45e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 144 DKVVEkHKESYRRILGQLLVAEKSRR---------QTILELEEEKRKHKEYMEKSD-----EFICLLEQECERLkkliDQ 209
Cdd:COG4913 228 DALVE-HFDDLERAHEALEDAREQIEllepirelaERYAAARERLAELEYLRAALRlwfaqRRLELLEAELEEL----RA 302
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 210 EIKSQEEKEQEKEKRVTTLKEELTKLKSfALMVVDEQQ--RLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQK 287
Cdd:COG4913 303 ELARLEAELERLEARLDALREELDELEA-QIRGNGGDRleQLEREIERLERELEERERRRARLEALLAALGLPLPASAEE 381
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109659847 288 ATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQD 353
Cdd:COG4913 382 FAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRELEAEIASLERRKSNIPARLLALRD 447
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
273-512 |
1.86e-05 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 48.86 E-value: 1.86e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 273 KLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQ-QKLAALSRQID-ELEETNRSLRKAEEE 350
Cdd:PHA02562 170 KLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKyDELVEEAKTIKaEIEELTDELLNLVMD 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 351 LQDIKEKISKGEYGNAGIMAEVEELRKrVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKL 430
Cdd:PHA02562 250 IEDPSAALNKLNTAAAKIKSKIEQFQK-VIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI 328
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 431 EDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAiESRLEKTEF-TLKEDLTKL---KTLTVMFVDERKT 506
Cdd:PHA02562 329 MDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQA-EFVDNAEELaKLQDELDKIvktKSELVKEKYHRGI 407
|
....*.
gi 109659847 507 MSEKLK 512
Cdd:PHA02562 408 VTDLLK 413
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
140-343 |
1.96e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.22 E-value: 1.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 140 MNELDKVVEKHKESYRRILGQLLVAEK---SRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEE 216
Cdd:COG4942 36 IAELEKELAALKKEEKALLKQLAALERriaALARRIRALEQELAALEAELAELEKEIAELRAELEAQKEELAELLRALYR 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 217 KEQEKEKRVTTLKEELTKL-KSFALM--VVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEK 293
Cdd:COG4942 116 LGRQPPLALLLSPEDFLDAvRRLQYLkyLAPARREQAEELRADLAELAALRAELEAERAELEALLAELEEERAALEALKA 195
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 109659847 294 ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRS 343
Cdd:COG4942 196 ERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
242-884 |
2.82e-05 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 48.50 E-value: 2.82e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 242 VVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE----------KELQTQTTKFHQDQDTIMA 311
Cdd:TIGR00606 194 VRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYEneldplknrlKEIEHNLSKIMKLDNEIKA 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 312 KLTNE---DSQNRQLQQKLA----ALSRQIDELEETN-RSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV---- 379
Cdd:TIGR00606 274 LKSRKkqmEKDNSELELKMEkvfqGTDEQLNDLYHNHqRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQgrlq 353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 380 LDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFkLEVEKLSKRIMALEKLEDAFNKSKQECYSLKcnlEKERMTTKQLS 459
Cdd:TIGR00606 354 LQADRHQEHIRARDSLIQSLATRLELDGFERGPF-SERQIKNFHTLVIERQEDEAKTAAQLCADLQ---SKERLKQEQAD 429
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 460 QELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMfVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKL 539
Cdd:TIGR00606 430 EIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGS-SDRILELDQELRKAERELSKAEKNSLTETLKKEVKSLQN 508
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 540 ----IEETKRALKSK-------TDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRV-------NMLKNRLQSLEA---I 598
Cdd:TIGR00606 509 ekadLDRKLRKLDQEmeqlnhhTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLlgyfpnkKQLEDWLHSKSKeinQ 588
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 599 EKDFLKnKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDM---KAIEDDLMKTEDEYETLERRYANERDKAQFLSK 675
Cdd:TIGR00606 589 TRDRLA-KLNKELASLEQNKNHINNELESKEEQLSSYEDKLFDVcgsQDEESDLERLKEEIEKSSKQRAMLAGATAVYSQ 667
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 676 ELEHVKME------LAKYKLAEKTETsheQWLFKRLQEEEAKSGHLSREVDALKEKIHEymATEDLICHLQGDHSVLQ-- 747
Cdd:TIGR00606 668 FITQLTDEnqsccpVCQRVFQTEAEL---QEFISDLQSKLRLAPDKLKSTESELKKKEK--RRDEMLGLAPGRQSIIDlk 742
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 748 -KKLNQQENRNRDLGREIENLTKELERYRHFSKSLRPSLNGRRISDPQVFSKEVQTEAVDNEPPDYKSLIPLERAVINGQ 826
Cdd:TIGR00606 743 eKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDR 822
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847 827 LYEESENQDEDPNDEGSVLSFKCSQSTPCPVNRKLWIPWMKSKEGHLQNGKMQTKPNA 884
Cdd:TIGR00606 823 TVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNL 880
|
|
| PTZ00440 |
PTZ00440 |
reticulocyte binding protein 2-like protein; Provisional |
197-770 |
2.98e-05 |
|
reticulocyte binding protein 2-like protein; Provisional
Pssm-ID: 240419 [Multi-domain] Cd Length: 2722 Bit Score: 48.67 E-value: 2.98e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 197 EQECERLKKL----IDQEIKSqeekeqekekrvttLKEELTKLKSFALMVVDEQqrltaqLTLQRQKIQELTTNAKETHT 272
Cdd:PTZ00440 672 KNEYEKLEFMksdnIDNIIKN--------------LKKELQNLLSLKENIIKKQ------LNNIEQDISNSLNQYTIKYN 731
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 273 KLALAEARVQEEEQKATRLEKELQTQTTKF-----HQDQDTIMAKLTNED--SQNRQLQQKLAALSRQIDELEEtnrSLR 345
Cdd:PTZ00440 732 DLKSSIEEYKEEEEKLEVYKHQIINRKNEFilhlyENDKDLPDGKNTYEEflQYKDTILNKENKISNDINILKE---NKK 808
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 346 KAEEELQDIKEKISKGEYGNAGIMAEVEELRKRvLDMEGKDEELIKMEEQCRDLNKRLErETLQskdfklEVEKLSKRIM 425
Cdd:PTZ00440 809 NNQDLLNSYNILIQKLEAHTEKNDEELKQLLQK-FPTEDENLNLKELEKEFNENNQIVD-NIIK------DIENMNKNIN 880
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 426 ALEKLEDAFNKS---KQECYSLKCNLEKERMTTKQLSQELES------------LKVRIKELEAIESRLEKTEFT----- 485
Cdd:PTZ00440 881 IIKTLNIAINRSnsnKQLVEHLLNNKIDLKNKLEQHMKIINTdniiqkneklnlLNNLNKEKEKIEKQLSDTKINnlkmq 960
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 486 LKEDLTKLKTLTVMFVDERKTMSEKLKKTED-------KLQAASSQLQVEQNKVTTVT----EKLIEETKRALKSK-TDV 553
Cdd:PTZ00440 961 IEKTLEYYDKSKENINGNDGTHLEKLDKEKDewehfksEIDKLNVNYNILNKKIDDLIkkqhDDIIELIDKLIKEKgKEI 1040
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 554 EEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNklnqdsgksttalhqennkIKELSQEVE 633
Cdd:PTZ00440 1041 EEKVDQYISLLEKMKTKLSSFHFNIDIKKYKNPKIKEEIKLLEEKVEALLKK-------------------IDENKNKLI 1101
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 634 RLKLKLKDMKAIEDDLM-KTEDEYET----LERRYANERDkaqfLSKELEHVKMELAKYKLAEKTETSHEQWLF----KR 704
Cdd:PTZ00440 1102 EIKNKSHEHVVNADKEKnKQTEHYNKkkksLEKIYKQMEK----TLKELENMNLEDITLNEVNEIEIEYERILIdhivEQ 1177
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 109659847 705 LQEEEAKSGHLSREVDALKEKIHEYMAteDLICHLQGDHSVLqkKLNQQENRNRDLGREIENLTKE 770
Cdd:PTZ00440 1178 INNEAKKSKTIMEEIESYKKDIDQVKK--NMSKERNDHLTTF--EYNAYYDKATASYENIEELTTE 1239
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
247-409 |
3.77e-05 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 46.46 E-value: 3.77e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 247 QRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNrqLQQK 326
Cdd:COG1579 20 DRLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLGNVRNNKEYEA--LQKE 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 327 LAALSRQIDELEEtnrSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRV-LDMEGKDEELIKMEEQCRDLNKRLER 405
Cdd:COG1579 98 IESLKRRISDLED---EILELMERIEELEEELAELEAELAELEAELEEKKAELdEELAELEAELEELEAEREELAAKIPP 174
|
....
gi 109659847 406 ETLQ 409
Cdd:COG1579 175 ELLA 178
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
592-775 |
4.68e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 47.45 E-value: 4.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 592 LQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKAQ 671
Cdd:COG4717 40 LAFIRAMLLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELE 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 672 FLSKELEHVKMELAKYKLAEKTETSHEQWlfKRLQEEEAKSGHLSREVDALKEKIHEY------------MATEDLICHL 739
Cdd:COG4717 120 KLEKLLQLLPLYQELEALEAELAELPERL--EELEERLEELRELEEELEELEAELAELqeeleelleqlsLATEEELQDL 197
|
170 180 190
....*....|....*....|....*....|....*.
gi 109659847 740 QGDHSVLQKKLNQQENRNRDLGREIENLTKELERYR 775
Cdd:COG4717 198 AEELEELQQRLAELEEELEEAQEELEELEEELEQLE 233
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
456-712 |
5.68e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 46.68 E-value: 5.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 456 KQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLktltvmfvderktmSEKLKKTEDKLQAASSQLQVEQNKVTTV 535
Cdd:COG4942 23 AEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAAL--------------ERRIAALARRIRALEQELAALEAELAEL 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 536 TEKlIEETKRALKsktdveekmysvtKERDDLKNKLKAEEEKGNdlLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKST 615
Cdd:COG4942 89 EKE-IAELRAELE-------------AQKEELAELLRALYRLGR--QPPLALLLSPEDFLDAVRRLQYLKYLAPARREQA 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 616 TALHQENNKIKELSQEVERLKLKLKDMKAieddlmKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTET 695
Cdd:COG4942 153 EELRADLAELAALRAELEAERAELEALLA------ELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELE 226
|
250
....*....|....*..
gi 109659847 696 SheqwLFKRLQEEEAKS 712
Cdd:COG4942 227 A----LIARLEAEAAAA 239
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
426-782 |
7.58e-05 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 47.27 E-value: 7.58e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 426 ALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERK 505
Cdd:pfam02463 174 ALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEI 253
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 506 TMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKtdvEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRV 585
Cdd:pfam02463 254 ESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEEL---KSELLKLERRKVDDEEKLKESEKEKKKAEKEL 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 586 NMLKNRLQSLEAIEKDFLKNKLNQDSGKSTTALHQENNKIKElsqevERLKLKLKDMKAIEDDLMKTEDEYETLERRYAN 665
Cdd:pfam02463 331 KKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLE-----EELLAKKKLESERLSSAAKLKEEELELKSEEEK 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 666 ERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKR--------LQEEEAKSGHLSREVDALKEKIHEYMATEDLIC 737
Cdd:pfam02463 406 EAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGklteekeeLEKQELKLLKDELELKKSEDLLKETQLVKLQEQ 485
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 109659847 738 HLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYRHFSKSLR 782
Cdd:pfam02463 486 LELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGGRIISAHGR 530
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
224-355 |
9.09e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 46.55 E-value: 9.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 224 RVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKAT---------RLEKE 294
Cdd:COG3206 220 QLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARytpnhpdviALRAQ 299
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109659847 295 LQTQTTKFHQDQDTIMAKLtneDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIK 355
Cdd:COG3206 300 IAALRAQLQQEAQRILASL---EAELEALQAREASLQAQLAQLEARLAELPELEAELRRLE 357
|
|
| COG5283 |
COG5283 |
Phage-related tail protein [Mobilome: prophages, transposons]; |
237-403 |
9.52e-05 |
|
Phage-related tail protein [Mobilome: prophages, transposons];
Pssm-ID: 444094 [Multi-domain] Cd Length: 747 Bit Score: 46.77 E-value: 9.52e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 237 SFALMVVDeqQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMA---KL 313
Cdd:COG5283 2 QVILGAVD--KPFKSALESAKQRVAALAQALKALEAPTRALARALERAKQAAARLQTKYNKLRQSLQRLRQALDQagiDT 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 314 TNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISK-GEYGNAGI-------------------MAEVe 373
Cdd:COG5283 80 RQLSAAQRRLRSSLEQTNRQLERQQQRLARLGARQDRLKAARARLQRlAGAGAAAAaigaalaasvkpaidfedaMADV- 158
|
170 180 190
....*....|....*....|....*....|
gi 109659847 374 elrKRVLDMEGKDEELIKMEEQCRDLNKRL 403
Cdd:COG5283 159 ---AATVDLDKSSEQFKALGKQARELSAQT 185
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
383-586 |
1.29e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 46.45 E-value: 1.29e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 383 EGKDEELIKMEEQCRDLNKRLEretlqskDFKLEVEKLSKRIMALEKLEDAFN-----KSKQECYSlkcNLEKERMTTKQ 457
Cdd:COG4913 613 AALEAELAELEEELAEAEERLE-------ALEAELDALQERREALQRLAEYSWdeidvASAEREIA---ELEAELERLDA 682
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 458 LSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKtltvmfvDERKTMSEKLKKTEDKLQAASSQLQVEQnkvttvtE 537
Cdd:COG4913 683 SSDDLAALEEQLEELEAELEELEEELDELKGEIGRLE-------KELEQAEEELDELQDRLEAAEDLARLEL-------R 748
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 109659847 538 KLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVN 586
Cdd:COG4913 749 ALLEERFAAALGDAVERELRENLEERIDALRARLNRAEEELERAMRAFN 797
|
|
| PRK06975 |
PRK06975 |
bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed |
246-352 |
1.30e-04 |
|
bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
Pssm-ID: 235899 [Multi-domain] Cd Length: 656 Bit Score: 46.25 E-value: 1.30e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 246 QQRL--TAQLTLQRQkiqelttNAKETHTklalAEARVQEEEqkatrlekeLQTQTTKFHQDQDTIMAKLTNEDSQNRQL 323
Cdd:PRK06975 345 NRKVdrLDQELVQRQ-------QANDAQT----AELRVKTEQ---------AQASVHQLDSQFAQLDGKLADAQSAQQAL 404
|
90 100 110
....*....|....*....|....*....|.
gi 109659847 324 QQKLAALSRQID--ELEETNRSLRKAEEELQ 352
Cdd:PRK06975 405 EQQYQDLSRNRDdwMIAEVEQMLSSASQQLQ 435
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
344-493 |
1.41e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 45.92 E-value: 1.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 344 LRKAEEELQDIKEKISKgeygnagimaEVEELRKRVLdMEGKDEELIKMEEQCRDLNKRleRETLQSKDFKLE--VEKLS 421
Cdd:PRK12704 33 IKEAEEEAKRILEEAKK----------EAEAIKKEAL-LEAKEEIHKLRNEFEKELRER--RNELQKLEKRLLqkEENLD 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 422 KRIMALEKLEDAFNKSKQECYSLKCNLEK-----ERMTTKQLsQELEslkvRIKEL---EAIESRLEKTEFTLKEDLTKL 493
Cdd:PRK12704 100 RKLELLEKREEELEKKEKELEQKQQELEKkeeelEELIEEQL-QELE----RISGLtaeEAKEILLEKVEEEARHEAAVL 174
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
198-401 |
1.60e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 45.78 E-value: 1.60e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 198 QECERLKKLIDQEIKS----QEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTK 273
Cdd:PHA02562 184 QTLDMKIDHIQQQIKTynknIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTA 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 274 LALAEARVQEEEQKATRLEK--ELQTQTTKFHQDQDtIMAKLTNedsQNRQLQQKLAALSRQIDELEETNRSLRKAEEEL 351
Cdd:PHA02562 264 AAKIKSKIEQFQKVIKMYEKggVCPTCTQQISEGPD-RITKIKD---KLKELQHSLEKLDTAIDELEEIMDEFNEQSKKL 339
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*..
gi 109659847 352 QDIKEKISKGEYG-------NAGIMAEVEELRKRVLDmegKDEELIKMEEQCRDLNK 401
Cdd:PHA02562 340 LELKNKISTNKQSlitlvdkAKKVKAAIEELQAEFVD---NAEELAKLQDELDKIVK 393
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
329-785 |
1.82e-04 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 45.81 E-value: 1.82e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 329 ALSRQIDELEETNRSLrKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETL 408
Cdd:TIGR00606 170 ALKQKFDEIFSATRYI-KALETLRQVRQTQGQKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELD 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 409 QSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQEleslkvrikELEAIESRLEKTEFTLKE 488
Cdd:TIGR00606 249 PLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDE---------QLNDLYHNHQRTVREKER 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 489 DLTKLKTLTVMFVDERKTMSEklKKTEDKLQAASSQLQVEQNKVTTVTEKLiEETKRALKSKTDVEEKMYSVTKErddLK 568
Cdd:TIGR00606 320 ELVDCQRELEKLNKERRLLNQ--EKTELLVEQGRLQLQADRHQEHIRARDS-LIQSLATRLELDGFERGPFSERQ---IK 393
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 569 NKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLNQDSGKSTT------ALHQENNKIKELSQEVERLKLKLKDM 642
Cdd:TIGR00606 394 NFHTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTielkkeILEKKQEELKFVIKELQQLEGSSDRI 473
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 643 KAIEDDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTE------TSHEQWLF---KRLQEEEAKSG 713
Cdd:TIGR00606 474 LELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEqlnhhtTTRTQMEMltkDKMDKDEQIRK 553
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847 714 HLSREVDALKEKIHEYMATEDLICHLQGdhsvLQKKLNQQENRNRDLGREIENLTKELERYRHFSKSLRPSL 785
Cdd:TIGR00606 554 IKSRHSDELTSLLGYFPNKKQLEDWLHS----KSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQL 621
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
256-427 |
1.91e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 45.54 E-value: 1.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 256 QRQKIQELTTNAKETHTKLALAEARvQEEEQKATRLEKE-------LQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:PRK12704 39 EAKRILEEAKKEAEAIKKEALLEAK-EEIHKLRNEFEKElrerrneLQKLEKRLLQKEENLDRKLELLEKREEELEKKEK 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 329 ALSRQIDELEETNRSLRKAEEELQDIKEKISkgeygnaGIMAevEELRKRVLDmegkdeeliKMEEQCRD----LNKRLE 404
Cdd:PRK12704 118 ELEQKQQELEKKEEELEELIEEQLQELERIS-------GLTA--EEAKEILLE---------KVEEEARHeaavLIKEIE 179
|
170 180
....*....|....*....|...
gi 109659847 405 RETlqskdfKLEVEKLSKRIMAL 427
Cdd:PRK12704 180 EEA------KEEADKKAKEILAQ 196
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
166-792 |
1.93e-04 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 45.66 E-value: 1.93e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 166 KSRRQTILELEEEKRKHKEYMEKSDEFICLLEQEcERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSFAlmvvDE 245
Cdd:PRK01156 176 DMLRAEISNIDYLEEKLKSSNLELENIKKQIADD-EKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLE----DM 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 246 QQRLTAQLtlqrqkiqelttnaKETHTKLALAEARVQEEEQKATRLEKelqTQTTKFHQDQDTIMAKLTNEdSQNRQLQQ 325
Cdd:PRK01156 251 KNRYESEI--------------KTAESDLSMELEKNNYYKELEERHMK---IINDPVYKNRNYINDYFKYK-NDIENKKQ 312
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 326 KLAALSRQIDELEETNRSLRKAEEELQDIKEKISkgeygnagimaEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLER 405
Cdd:PRK01156 313 ILSNIDAEINKYHAIIKKLSVLQKDYNDYIKKKS-----------RYDDLNNQILELEGYEMDYNSYLKSIESLKKKIEE 381
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 406 ETLQSKDFKLEVEKLSKRIMALEKledafnkskqecySLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTE-- 483
Cdd:PRK01156 382 YSKNIERMSAFISEILKIQEIDPD-------------AIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMem 448
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 484 -----------FTLKEDltKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQlQVEQNKVTTVTEKliEETKRALKSKTD 552
Cdd:PRK01156 449 lngqsvcpvcgTTLGEE--KSNHIINHYNEKKSRLEEKIREIEIEVKDIDEK-IVDLKKRKEYLES--EEINKSINEYNK 523
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 553 VEEKMYSVTKERDDLkNKLKAEEEKGNDLLSRVNMLKnrLQSLEAIEKDFLkNKLNQDSGKSTTALHQE----NNKIKEL 628
Cdd:PRK01156 524 IESARADLEDIKIKI-NELKDKHDKYEEIKNRYKSLK--LEDLDSKRTSWL-NALAVISLIDIETNRSRsneiKKQLNDL 599
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 629 SQEVERLKLKLKDMKA-IEDDLMKTEDEYETLERRYANERDKA---QFLSKELEHVKMELAKYKLAEKTETSheqwLFKR 704
Cdd:PRK01156 600 ESRLQEIEIGFPDDKSyIDKSIREIENEANNLNNKYNEIQENKiliEKLRGKIDNYKKQIAEIDSIIPDLKE----ITSR 675
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 705 LQEEEAKSGHLSREVDALKEKIHEYMATedlichlqgdHSVLQKKLNQQENRNRDLGREIENLTK------ELERYRH-F 777
Cdd:PRK01156 676 INDIEDNLKKSRKALDDAKANRARLEST----------IEILRTRINELSDRINDINETLESMKKikkaigDLKRLREaF 745
|
650
....*....|....*
gi 109659847 778 SKSLRPSLNGRRISD 792
Cdd:PRK01156 746 DKSGVPAMIRKSASQ 760
|
|
| CCCAP |
pfam15964 |
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ... |
224-607 |
2.09e-04 |
|
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.
Pssm-ID: 435040 [Multi-domain] Cd Length: 703 Bit Score: 45.28 E-value: 2.09e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 224 RVTTLKEEL-TKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKF 302
Cdd:pfam15964 304 RLTKERDDLmSALVSVRSSLAEAQQRESSAYEQVKQAVQMTEEANFEKTKALIQCEQLKSELERQKERLEKELASQQEKR 383
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 303 HQDQDTIMAKLTNEDSQNRQ----LQQKLAALSRQIDELEETNRSLRKAEEELQ---------------DIKEKISKGEY 363
Cdd:pfam15964 384 AQEKEALRKEMKKEREELGAtmlaLSQNVAQLEAQVEKVTREKNSLVSQLEEAQkqlasqemdvtkvcgEMRYQLNQTKM 463
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 364 GNAGIMAEVEELRKRVL-DMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKrimALEKLEDAFNKSKQECY 442
Cdd:pfam15964 464 KKDEAEKEHREYRTKTGrQLEIKDQEIEKLGLELSESKQRLEQAQQDAARAREECLKLTE---LLGESEHQLHLTRLEKE 540
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 443 SLKCNLEKE-RMTTKQLSQELESLKVRIKELEAiesRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAA 521
Cdd:pfam15964 541 SIQQSFSNEaKAQALQAQQREQELTQKMQQMEA---QHDKTVNEQYSLLTSQNTFIAKLKEECCTLAKKLEEITQKSRSE 617
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 522 SSQLQVEQNKVTTVTEKLIEETKRaLKSKTDVEEKMYSVTKER-DDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLeAIEK 600
Cdd:pfam15964 618 VEQLSQEKEYLQDRLEKLQKRNEE-LEEQCVQHGRMHERMKQRlRQLDKHCQATAQQLVQLLSKQNQLFKERQNL-TEEV 695
|
....*..
gi 109659847 601 DFLKNKL 607
Cdd:pfam15964 696 QSLRSQV 702
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
224-409 |
2.40e-04 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 45.39 E-value: 2.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 224 RVTTLKEEL----TKLKSFA----LMVVDEQ-QRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLE-- 292
Cdd:COG3206 183 QLPELRKELeeaeAALEEFRqkngLVDLSEEaKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLqs 262
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 293 ---KELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDelEETNRSLRKAEEELQDIKEKIskgeygnAGIM 369
Cdd:COG3206 263 pviQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQ--QEAQRILASLEAELEALQARE-------ASLQ 333
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 109659847 370 AEVEELRKRVLDMEGKDEELIKMEEQcRDLNKRLERETLQ 409
Cdd:COG3206 334 AQLAQLEARLAELPELEAELRRLERE-VEVARELYESLLQ 372
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
63-409 |
3.47e-04 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 44.93 E-value: 3.47e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 63 QAEDLSRDDLLFLLSILEGELQARDEVIGILKAEKMDLALLEAQY----GFVTPKKVLEALQRDAFQAKSTPWQEDIYEK 138
Cdd:COG1196 459 EALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYegflEGVKAALLLAGLRGLAGAVAVLIGVEAAYEA 538
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 139 PMNE------LDKVVEKHKESYRRIlgQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFI--------CLLEQECERLK 204
Cdd:COG1196 539 ALEAalaaalQNIVVEDDEVAAAAI--EYLKAAKAGRATFLPLDKIRARAALAAALARGAIgaavdlvaSDLREADARYY 616
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 205 KLIDQE---------IKSQEEKEQEKEKRVTTLKEELTKLKSFALMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLA 275
Cdd:COG1196 617 VLGDTLlgrtlvaarLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELE 696
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 276 LAEARVQEEEQKATRLEKELQTQTtkfHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIK 355
Cdd:COG1196 697 EALLAEEEEERELAEAEEERLEEE---LEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLE 773
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 109659847 356 EKISKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQ 409
Cdd:COG1196 774 REIEALGPVNLLAIEEYEELEERYDFLSEQREDLEEARETLEEAIEEIDRETRE 827
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
328-496 |
3.54e-04 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 44.85 E-value: 3.54e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 328 AALSRQI-----DELEETNRSLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEG----KDEELIKMEEQCRD 398
Cdd:COG2433 380 EALEELIekelpEEEPEAEREKEHEERELTEEEEEIRRLE-------EQVERLEAEVEELEAeleeKDERIERLERELSE 452
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 399 LNKRLERETLQSKdfklEVEKLSKRIMALEKledafnkskqecyslkcnlekermttkqlsqELESLKVRIKELEAIESR 478
Cdd:COG2433 453 ARSEERREIRKDR----EISRLDREIERLER-------------------------------ELEEERERIEELKRKLER 497
|
170
....*....|....*....
gi 109659847 479 LEKTEFTL-KEDLTKLKTL 496
Cdd:COG2433 498 LKELWKLEhSGELVPVKVV 516
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
232-440 |
5.95e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 43.99 E-value: 5.95e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 232 LTKLKSFALMVVDEQQRLTAQLTLQRQKIQEL-------TTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQ 304
Cdd:COG4717 293 LAREKASLGKEAEELQALPALEELEEEELEELlaalglpPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEI 372
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 305 DQDTIMAKLTNEDS---------QNRQLQQKLAALSRQIDELEETNRSLRKA------EEELQDIKEKISKGEygnagim 369
Cdd:COG4717 373 AALLAEAGVEDEEElraaleqaeEYQELKEELEELEEQLEELLGELEELLEAldeeelEEELEELEEELEELE------- 445
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109659847 370 AEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKdFKLEVEKLSKRIMALEKLEDAFNKSKQE 440
Cdd:COG4717 446 EELEELREELAELE---AELEQLEEDGELAELLQELEELKAE-LRELAEEWAALKLALELLEEAREEYREE 512
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
132-360 |
6.17e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 43.85 E-value: 6.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 132 QEDIYEKPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLIDQei 211
Cdd:PHA02562 196 QIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQ-- 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 212 ksqeekeqekekrvttlkeeLTKLKSFaLMVVDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKatrl 291
Cdd:PHA02562 274 --------------------FQKVIKM-YEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI---- 328
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847 292 EKELQTQTTKFHQDQDTIMAK---LTNEDSQNRQLQqklAALSRQIDELEETNRSLRKAEEELQDIKEKISK 360
Cdd:PHA02562 329 MDEFNEQSKKLLELKNKISTNkqsLITLVDKAKKVK---AAIEELQAEFVDNAEELAKLQDELDKIVKTKSE 397
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
310-700 |
9.38e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 43.50 E-value: 9.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 310 MAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEE-LQDIKEKISKGEYGN--AGIMAEVEEL---RKRVLD-- 381
Cdd:TIGR01612 485 IDENSKQDNTVKLILMRMKDFKDIIDFMELYKPDEVPSKNIiGFDIDQNIKAKLYKEieAGLKESYELAknwKKLIHEik 564
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 382 --MEGKDEELIKMEEQCRDL-NKRLE--RETLQSKDFKLEVEklskrimalEKLEDAFNKSK--QECYSLKCNLEKERMT 454
Cdd:TIGR01612 565 keLEEENEDSIHLEKEIKDLfDKYLEidDEIIYINKLKLELK---------EKIKNISDKNEyiKKAIDLKKIIENNNAY 635
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 455 TKQLS-----QELESLKVRIKELEAIESRLEKTeftLKEDLTKLKTLTVMFVDErktmsEKLKKTEDK--LQAASSQLQV 527
Cdd:TIGR01612 636 IDELAkispyQVPEHLKNKDKIYSTIKSELSKI---YEDDIDALYNELSSIVKE-----NAIDNTEDKakLDDLKSKIDK 707
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 528 EQNKVTTVTEKLIE------ETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNrlqslEAIEKD 601
Cdd:TIGR01612 708 EYDKIQNMETATVElhlsniENKKNELLDIIVEIKKHIHGEINKDLNKILEDFKNKEKELSNKINDYAK-----EKDELN 782
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 602 FLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLerryanerdKAQFLSKELEHVK 681
Cdd:TIGR01612 783 KYKSKISEIKNHYNDQINIDNIKDEDAKQNYDKSKEYIKTISIKEDEIFKIINEMKFM---------KDDFLNKVDKFIN 853
|
410
....*....|....*....
gi 109659847 682 MElakYKLAEKTETSHEQW 700
Cdd:TIGR01612 854 FE---NNCKEKIDSEHEQF 869
|
|
| HOOK |
pfam05622 |
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from ... |
322-801 |
9.87e-04 |
|
HOOK protein coiled-coil region; This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organizms. The different members of the human gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three human HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas this central coiled-coil motif mediates homodimerization and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes, whereas both HOOK1 and HOOK2 are localized to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head. This entry includes the central coiled-coiled domain and the divergent C-terminal domain.
Pssm-ID: 461694 [Multi-domain] Cd Length: 528 Bit Score: 43.14 E-value: 9.87e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 322 QLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGIMAEVEELRKRVldmEGKDEELIKMeEQCRDlnk 401
Cdd:pfam05622 11 ELAQRCHELDQQVSLLQEEKNSLQQENKKLQERLDQLESGDDSGTPGGKKYLLLQKQL---EQLQEENFRL-ETARD--- 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 402 rleretlqskDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMT---TKQLSQELESLKVRIKELEaiesr 478
Cdd:pfam05622 84 ----------DYRIKCEELEKEVLELQHRNEELTSLAEEAQALKDEMDILRESsdkVKKLEATVETYKKKLEDLG----- 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 479 lektefTLKEDLTKLKTLTVMFVDERKTMSEKLKKTedklQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTD---VEE 555
Cdd:pfam05622 149 ------DLRRQVKLLEERNAEYMQRTLQLEEELKKA----NALRGQLETYKRQVQELHGKLSEESKKADKLEFEykkLEE 218
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 556 KMYSVTKERDDLKNK----------LKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKLnqdsgKSTTALHQENNKI 625
Cdd:pfam05622 219 KLEALQKEKERLIIErdtlretneeLRCAQLQQAELSQADALLSPSSDPGDNLAAEIMPAEI-----REKLIRLQHENKM 293
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 626 KELSQEvERLKLKLKDMKAIEDDLMKTEDEYETlERRYANERdkaqflSKELEHVKMELAKYKLAEKTETSHEQWLFKRL 705
Cdd:pfam05622 294 LRLGQE-GSYRERLTELQQLLEDANRRKNELET-QNRLANQR------ILELQQQVEELQKALQEQGSKAEDSSLLKQKL 365
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 706 QEEEAKSGHLSREVDALKEKIHEYMATEDLICHLQGDHsvLQKKLNQQENRNRDLGreiENLTKELERYRHFSKSLRPSL 785
Cdd:pfam05622 366 EEHLEKLHEAQSELQKKKEQIEELEPKQDSNLAQKIDE--LQEALRKKDEDMKAME---ERYKKYVEKAKSVIKTLDPKQ 440
|
490
....*....|....*.
gi 109659847 786 NGRRISDPQVFSKEVQ 801
Cdd:pfam05622 441 NPASPPEIQALKNQLL 456
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
535-722 |
1.00e-03 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 43.31 E-value: 1.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 535 VTEKLIEEtkralksKTDVEEKMYSVTKERDDlkNKLKAEEEKGNDLLSRVNMLKNRLQSLEaiekdflknklnqdsgks 614
Cdd:COG2433 381 ALEELIEK-------ELPEEEPEAEREKEHEE--RELTEEEEEIRRLEEQVERLEAEVEELE------------------ 433
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 615 ttalhqenNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANerdkaqfLSKELEHVKMELAkyKLAEKTE 694
Cdd:COG2433 434 --------AELEEKDERIERLERELSEARSEERREIRKDREISRLDREIER-------LERELEEERERIE--ELKRKLE 496
|
170 180
....*....|....*....|....*....
gi 109659847 695 tsheqwLFKRLQEEEAKSGHLS-REVDAL 722
Cdd:COG2433 497 ------RLKELWKLEHSGELVPvKVVEKF 519
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
507-726 |
1.15e-03 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 42.99 E-value: 1.15e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 507 MSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERddlKNKLKAEEEKGNDLLSRVN 586
Cdd:PRK05771 51 LLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEE---ISELENEIKELEQEIERLE 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 587 MLKN------RLQSLE-------AIEKDFLKNKLNQDSGKSTTALHQENNKI-------KELSQEVERLkLKLKDMKAIE 646
Cdd:PRK05771 128 PWGNfdldlsLLLGFKyvsvfvgTVPEDKLEELKLESDVENVEYISTDKGYVyvvvvvlKELSDEVEEE-LKKLGFERLE 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 647 -DDLMKTEDEYETLERRYANERDKAQFLSKELEHVKMELAKYKLAEKTETSHEqwlfkrLQEEE-----AKSGHL----- 715
Cdd:PRK05771 207 lEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIE------LERAEalskfLKTDKTfaieg 280
|
250
....*....|....
gi 109659847 716 ---SREVDALKEKI 726
Cdd:PRK05771 281 wvpEDRVKKLKELI 294
|
|
| Spc7 |
smart00787 |
Spc7 kinetochore protein; This domain is found in cell division proteins which are required ... |
291-430 |
1.30e-03 |
|
Spc7 kinetochore protein; This domain is found in cell division proteins which are required for kinetochore-spindle association.
Pssm-ID: 197874 [Multi-domain] Cd Length: 312 Bit Score: 42.31 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 291 LEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKA-EEELQDIKEKISKGEYGNAGIM 369
Cdd:smart00787 145 LKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCdPTELDRAKEKLKKLLQEIMIKV 224
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 109659847 370 AEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKL-EVEKLSKRIMALEKL 430
Cdd:smart00787 225 KKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRGFTFkEIEKLKEQLKLLQSL 286
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
219-798 |
1.35e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 42.70 E-value: 1.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 219 QEKEKRVTTLKEEL----TKLKSFALMVVD-----------------EQQRLTAQLTLQRQKIQELTTNAKETHTKLALA 277
Cdd:TIGR04523 36 KQLEKKLKTIKNELknkeKELKNLDKNLNKdeekinnsnnkikileqQIKDLNDKLKKNKDKINKLNSDLSKINSEIKND 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 278 EARVQEEEQKATRLEKELQtqttKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELE----ETNRSLRKAEEELQD 353
Cdd:TIGR04523 116 KEQKNKLEVELNKLEKQKK----ENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELEnelnLLEKEKLNIQKNIDK 191
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 354 IKEKISKGEY----------GNAGIMAEVEELRKRVL-----------DMEGKDEELIKMEEQCRDLNKRLERETLQSKD 412
Cdd:TIGR04523 192 IKNKLLKLELllsnlkkkiqKNKSLESQISELKKQNNqlkdniekkqqEINEKTTEISNTQTQLNQLKDEQNKIKKQLSE 271
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 413 FKLEVEKLSKRImalEKLEDAFNKSKQECYSLkcNLEKERMTTKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTK 492
Cdd:TIGR04523 272 KQKELEQNNKKI---KELEKQLNQLKSEISDL--NNQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQ 346
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 493 LKTLTVMFVDERKTMSEKLKKTEDKLQ-----------------AASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEE 555
Cdd:TIGR04523 347 LKKELTNSESENSEKQRELEEKQNEIEklkkenqsykqeiknleSQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEK 426
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 556 KMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLK--NKLNQDSGKSTTALHQENNKIKELSQEVE 633
Cdd:TIGR04523 427 EIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRsiNKIKQNLEQKQKELKSKEKELKKLNEEKK 506
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 634 RLKLKLKDMKAIEDDLMKTEDEYETLERRYANERDKaqfLSKELEHVKMELAKYKLAEK--------TETSHEQWLFKRL 705
Cdd:TIGR04523 507 ELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISD---LEDELNKDDFELKKENLEKEideknkeiEELKQTQKSLKKK 583
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 706 QEE-EAKSGHLSREVDALKEKIHEYMATEdlichlqgdhSVLQKKLNQQENRNRDLGREIENLTKELERYRHFSKSLRPS 784
Cdd:TIGR04523 584 QEEkQELIDQKEKEKKDLIKEIEEKEKKI----------SSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKET 653
|
650
....*....|....
gi 109659847 785 LNGRRISDPQVFSK 798
Cdd:TIGR04523 654 IKEIRNKWPEIIKK 667
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
350-550 |
1.44e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 41.45 E-value: 1.44e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 350 ELQDIKEKISKgeygnagIMAEVEELRKRVLDMEgkdEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEK 429
Cdd:COG1579 11 DLQELDSELDR-------LEHRLKELPAELAELE---DELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEE 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 430 LEDAFNKSKQecYslkcnlekermttKQLSQELESLKVRIKELEAIESRLEKTEFTLKEDLTKLKtltvmfvDERKTMSE 509
Cdd:COG1579 81 QLGNVRNNKE--Y-------------EALQKEIESLKRRISDLEDEILELMERIEELEEELAELE-------AELAELEA 138
|
170 180 190 200
....*....|....*....|....*....|....*....|.
gi 109659847 510 KLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSK 550
Cdd:COG1579 139 ELEEKKAELDEELAELEAELEELEAEREELAAKIPPELLAL 179
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
137-660 |
1.83e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 42.47 E-value: 1.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 137 EKPMNELDKVVEKHKESYRRILGQLLVAEKSRRqtilELEEEKRKHKEYMEKSDEFICLLEQECERL------------- 203
Cdd:pfam01576 172 EEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQ----ELEKAKRKLEGESTDLQEQIAELQAQIAELraqlakkeeelqa 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 204 ----------------KKL---------IDQEIKSQEEKEQEKEKRVTTLKEELTKLKSfalMVVDEQQRLTAQLTLQRQ 258
Cdd:pfam01576 248 alarleeetaqknnalKKIreleaqiseLQEDLESERAARNKAEKQRRDLGEELEALKT---ELEDTLDTTAAQQELRSK 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 259 KIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELE 338
Cdd:pfam01576 325 REQEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEKAKQALESENAELQAELRTLQQAKQDSE 404
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 339 etnRSLRKAEEELQDIKEKISKGEYGNA-------GIMAEVEELRKRVLDMEGKD----EELIKMEEQCRDLNKRLERET 407
Cdd:pfam01576 405 ---HKRKKLEGQLQELQARLSESERQRAelaeklsKLQSELESVSSLLNEAEGKNiklsKDVSSLESQLQDTQELLQEET 481
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 408 LQ-----SKDFKLEVEKLSKRIMALEKLEDAFNKSKQ------ECYSLKCNLEKERMTT-------KQLSQELESLKVRI 469
Cdd:pfam01576 482 RQklnlsTRLRQLEDERNSLQEQLEEEEEAKRNVERQlstlqaQLSDMKKKLEEDAGTLealeegkKRLQRELEALTQQL 561
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 470 KELEAIESRLEKTEFTLKEDLTKLktltVMFVDERKTMSEKLKKTEDKLqaasSQLQVEQNkvtTVTEKLIEETKRALKS 549
Cdd:pfam01576 562 EEKAAAYDKLEKTKNRLQQELDDL----LVDLDHQRQLVSNLEKKQKKF----DQMLAEEK---AISARYAEERDRAEAE 630
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 550 KTDVEEKMYSVTKERDDLKNKlKAEEEKGNdllsrvnmlknrlQSLEAIEKDFLKNKlnQDSGKSttaLHQENNKIKELS 629
Cdd:pfam01576 631 AREKETRALSLARALEEALEA-KEELERTN-------------KQLRAEMEDLVSSK--DDVGKN---VHELERSKRALE 691
|
570 580 590
....*....|....*....|....*....|.
gi 109659847 630 QEVERLKLKLKDMkaiEDDLMKTEDEYETLE 660
Cdd:pfam01576 692 QQVEEMKTQLEEL---EDELQATEDAKLRLE 719
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
132-636 |
1.84e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 42.73 E-value: 1.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 132 QEDIYE---KPMNELDKVvEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSdeficlLEQECERLkklid 208
Cdd:TIGR01612 1185 KKNIYDeikKLLNEIAEI-EKDKTSLEEVKGINLSYGKNLGKLFLEKIDEEKKKSEHMIKA------MEAYIEDL----- 1252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 209 QEIKSQEEKEQEKEKRVTTLKEELtklKSFALMVVDEQQRLTAQltlqrQKIQELTTNAKETHTKLALAEARvqeeEQKA 288
Cdd:TIGR01612 1253 DEIKEKSPEIENEMGIEMDIKAEM---ETFNISHDDDKDHHIIS-----KKHDENISDIREKSLKIIEDFSE----ESDI 1320
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 289 TRLEKELQT---QTTKFHQDQDTIMAKLTN-----EDSQNRQLQQKLAALSRQIDELEET-NRSLRKAEEELQDIKEKIS 359
Cdd:TIGR01612 1321 NDIKKELQKnllDAQKHNSDINLYLNEIANiynilKLNKIKKIIDEVKEYTKEIEENNKNiKDELDKSEKLIKKIKDDIN 1400
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 360 KGEYGNA-----------GIMAEVEELRKRVLDMEGKDEELIKMEEQCRDlNKRLERETLQSKDFKLE-VEKLSK----- 422
Cdd:TIGR01612 1401 LEECKSKiestlddkdidECIKKIKELKNHILSEESNIDTYFKNADENNE-NVLLLFKNIEMADNKSQhILKIKKdnatn 1479
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 423 ----RIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEaIESRLEKTEFTLKEDLTKLKTLTV 498
Cdd:TIGR01612 1480 dhdfNINELKEHIDKSKGCKDEADKNAKAIEKNKELFEQYKKDVTELLNKYSALA-IKNKFAKTKKDSEIIIKEIKDAHK 1558
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 499 MFVDERKTMSEKLKKTEDKlqaassQLQVE--------QNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNK 570
Cdd:TIGR01612 1559 KFILEAEKSEQKIKEIKKE------KFRIEddaakndkSNKAAIDIQLSLENFENKFLKISDIKKKINDCLKETESIEKK 1632
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109659847 571 LK-----AEEEKGNDLLSRVNMLKNRLQSLEAIEKDFLKNKlnqdsgkstTALHQENNKIKELSQEVERLK 636
Cdd:TIGR01612 1633 ISsfsidSQDTELKENGDNLNSLQEFLESLKDQKKNIEDKK---------KELDELDSEIEKIEIDVDQHK 1694
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
244-532 |
1.97e-03 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 42.63 E-value: 1.97e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 244 DEQQRLTAQLTLQRQKIQELttnaKETHTKLA--LAEARVQEEEQKATRLEkELQTQTTKFHQDQDTIMAKltneDSQNR 321
Cdd:COG3096 850 RELAQHRAQEQQLRQQLDQL----KEQLQLLNklLPQANLLADETLADRLE-ELREELDAAQEAQAFIQQH----GKALA 920
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 322 QLQQKLAALSRQIDELEETNRSLRKAEEELQDIKekiskgeygnAGIMAeVEELRKRVLDMEGKDEEliKMEEQCRDLNK 401
Cdd:COG3096 921 QLEPLVAVLQSDPEQFEQLQADYLQAKEQQRRLK----------QQIFA-LSEVVQRRPHFSYEDAV--GLLGENSDLNE 987
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 402 RLERETLQSKdfklevEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKVRIKELEAIESRLEK 481
Cdd:COG3096 988 KLRARLEQAE------EARREAREQLRQAQAQYSQYNQVLASLKSSRDAKQQTLQELEQELEELGVQADAEAEERARIRR 1061
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 482 TEftLKEDLTKL--------KTLTVmFVDERKTMSEKLKKTEDKLQAASSqlQVEQNKV 532
Cdd:COG3096 1062 DE--LHEELSQNrsrrsqleKQLTR-CEAEMDSLQKRLRKAERDYKQERE--QVVQAKA 1115
|
|
| CagA_N |
pfam18971 |
CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial ... |
246-532 |
2.05e-03 |
|
CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial virulence determinant and critical for gastric carcinogenesis. X-ray crystallographic analysis of the N-terminal CagA fragment (residues 1-876) revealed that the region has a structure comprised of three discrete domains. Domain I constitutes a mobile CagA N terminus, while Domain II tethers CagA to the plasma membrane by interacting with membrane phosphatidylserine. Domain III interacts intramolecularly with the intrinsically disordered C-terminal region, and this interaction potentiates the pathogenic scaffold/hub function of CagA.
Pssm-ID: 408741 [Multi-domain] Cd Length: 876 Bit Score: 42.45 E-value: 2.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 246 QQRLTAQ-LTLQRQK--IQELTTNAKETHTKL-----ALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNED 317
Cdd:pfam18971 562 ENKLTAKgLSLQEANklIKDFLSSNKELAGKAlnfnkAVAEAKSTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKS 641
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 318 SQNRQLQQKLAALSrQIDEL-----EETNR---------SLRKAEEELQDIKEKISKgeygnagimaEVEELRKRVLDME 383
Cdd:pfam18971 642 GNKNKMEAKAQANS-QKDEIfalinKEANRdaraiaytqNLKGIKRELSDKLEKISK----------DLKDFSKSFDEFK 710
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 384 -GKDEELIKMEEQCRDLNKRLeretlqsKDFKLEVEKLSK---RIMALEKLEDAFNKSKQECYSLKCNLE---KERMTTK 456
Cdd:pfam18971 711 nGKNKDFSKAEETLKALKGSV-------KDLGINPEWISKvenLNAALNEFKNGKNKDFSKVTQAKSDLEnsvKDVIINQ 783
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 457 QLSQELESLKVRIKELEAIE--SRLEKTeftlkedLTKLKTLTvmfvdeRKTMSEKLKKTEDKLQAASSQL-QVEQNKV 532
Cdd:pfam18971 784 KVTDKVDNLNQAVSVAKAMGdfSRVEQV-------LADLKNFS------KEQLAQQAQKNEDFNTGKNSELyQSVKNSV 849
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
321-576 |
2.29e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 42.21 E-value: 2.29e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 321 RQLQQKLAALSRQIDELEETNRSLRKAEEE---LQDIKEKisKGEYGNAGIMAEVEELRKRVLDMEGKDEELIKMEEQCR 397
Cdd:COG4913 221 PDTFEAADALVEHFDDLERAHEALEDAREQielLEPIREL--AERYAAARERLAELEYLRAALRLWFAQRRLELLEAELE 298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 398 DLNKRLERETLQSKDFKLEVEKLSKRIMALEK-LEDAFNKSKQEcyslkcnLEKERmttKQLSQELESLKVRIKELEAIE 476
Cdd:COG4913 299 ELRAELARLEAELERLEARLDALREELDELEAqIRGNGGDRLEQ-------LEREI---ERLERELEERERRRARLEALL 368
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 477 SRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQnkvttvtEKLIEETKRALKSKTDVEEK 556
Cdd:COG4913 369 AALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRREL-------RELEAEIASLERRKSNIPAR 441
|
250 260
....*....|....*....|
gi 109659847 557 MYSVtkeRDDLKNKLKAEEE 576
Cdd:COG4913 442 LLAL---RDALAEALGLDEA 458
|
|
| PRK10246 |
PRK10246 |
exonuclease subunit SbcC; Provisional |
244-480 |
2.56e-03 |
|
exonuclease subunit SbcC; Provisional
Pssm-ID: 182330 [Multi-domain] Cd Length: 1047 Bit Score: 42.09 E-value: 2.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 244 DEQQRLTAQLTLQRQKIQELTTNAKEThtkLALAEARVQEEEQKATRLEKELQTQTTKFHQDQD------------TIMA 311
Cdd:PRK10246 297 ERIQEQSAALAHTRQQIEEVNTRLQST---MALRARIRHHAAKQSAELQAQQQSLNTWLAEHDRfrqwnnelagwrAQFS 373
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 312 KLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKaeeelQDIKEKISKgeygnagiMAEVEELRKRvldmegkdeeLIK 391
Cdd:PRK10246 374 QQTSDREQLRQWQQQLTHAEQKLNALPAITLTLTA-----DEVAAALAQ--------HAEQRPLRQR----------LVA 430
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 392 MEEQCRDLNKRLERetlqskdfkleveklskrimalekLEDAFNKSKQECYSLKCNLEKERMTTKQLSQELESLKV---- 467
Cdd:PRK10246 431 LHGQIVPQQKRLAQ------------------------LQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTiceq 486
|
250
....*....|....*
gi 109659847 468 --RIKELEAIESRLE 480
Cdd:PRK10246 487 eaRIKDLEAQRAQLQ 501
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
198-728 |
2.62e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 42.08 E-value: 2.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 198 QECERLKKLIDQEIksqeekeqekekrvTTLKEELTklksfalmvvdEQQRLTAQLTLQRQKIQElttnakethtKLALA 277
Cdd:pfam01576 204 QELEKAKRKLEGES--------------TDLQEQIA-----------ELQAQIAELRAQLAKKEE----------ELQAA 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 278 EARVQEEEQKATRLEK---ELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQI-DELEETNrslrkAEEELQD 353
Cdd:pfam01576 249 LARLEEETAQKNNALKkirELEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELeDTLDTTA-----AQQELRS 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 354 IKEkiskgeygnagimAEVEELRKRVldmegkDEELIKMEEQCRDLNKRlERETLQSKDFKLEVEKLSKriMALEKLEDA 433
Cdd:pfam01576 324 KRE-------------QEVTELKKAL------EEETRSHEAQLQEMRQK-HTQALEELTEQLEQAKRNK--ANLEKAKQA 381
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 434 FNKSKQEcyslkcnLEKERMTTKQLSQELE----SLKVRIKELEAIESRLEKTEFTLKEDLTKLKTltvmfvdERKTMSE 509
Cdd:pfam01576 382 LESENAE-------LQAELRTLQQAKQDSEhkrkKLEGQLQELQARLSESERQRAELAEKLSKLQS-------ELESVSS 447
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 510 KLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTdveeKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLK 589
Cdd:pfam01576 448 LLNEAEGKNIKLSKDVSSLESQLQDTQELLQEETRQKLNLST----RLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQ 523
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 590 NRLQSleaiekdfLKNKLNQDSGksttALHQENNKIKELSQEVERLKLKLKDMKAIEDDLMKT----EDEYETLERRYAN 665
Cdd:pfam01576 524 AQLSD--------MKKKLEEDAG----TLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTknrlQQELDDLLVDLDH 591
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109659847 666 ERDKAQFLSKELEHVKMELAKYKLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKEKIHE 728
Cdd:pfam01576 592 QRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRAEAEAREKETRALSLARALEEALEAKEE 654
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
486-724 |
2.66e-03 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 41.92 E-value: 2.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 486 LKEDLTKLKTLTVMfvderktmsEKLKKteDKLQAASSQLQVEQNKVTTVTEKL------IEETKRalKSKTDVEEKMYS 559
Cdd:PHA02562 155 LVEDLLDISVLSEM---------DKLNK--DKIRELNQQIQTLDMKIDHIQQQIktynknIEEQRK--KNGENIARKQNK 221
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 560 VTKERDDLKNkLKAEEEKGNDLLSRVNM------------------LKNRLQSLEAIEKDFLKNklnQDSGKSTTALHQE 621
Cdd:PHA02562 222 YDELVEEAKT-IKAEIEELTDELLNLVMdiedpsaalnklntaaakIKSKIEQFQKVIKMYEKG---GVCPTCTQQISEG 297
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 622 NNKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEY--------------ETLERRYANERDKAQFLSKELEHVKMELAKY 687
Cdd:PHA02562 298 PDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFneqskkllelknkiSTNKQSLITLVDKAKKVKAAIEELQAEFVDN 377
|
250 260 270
....*....|....*....|....*....|....*..
gi 109659847 688 KLAEKTETSHEQWLFKRLQEEEAKSGHLSREVDALKE 724
Cdd:PHA02562 378 AEELAKLQDELDKIVKTKSELVKEKYHRGIVTDLLKD 414
|
|
| DUF4407 |
pfam14362 |
Domain of unknown function (DUF4407); This family of proteins is found in bacteria. Proteins ... |
292-433 |
3.19e-03 |
|
Domain of unknown function (DUF4407); This family of proteins is found in bacteria. Proteins in this family are typically between 366 and 597 amino acids in length. There is a single completely conserved residue R that may be functionally important.
Pssm-ID: 464151 [Multi-domain] Cd Length: 295 Bit Score: 41.08 E-value: 3.19e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 292 EKELQTQ-TTKFHQDQDTIMAKLTNE-DSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGNAGI- 368
Cdd:pfam14362 105 EKEIDRElLEIQQEEADAAKAQLAAAyRARLAELEAQIAALDAEIDAAEARLDALQAEARCELDGTPGTGTGVPGDGPVa 184
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 369 ---MAEVEELRKRVLDMEGK-DEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDA 433
Cdd:pfam14362 185 ktkQAQLDAAQAELAALQAQnDARLAALRAELARLTAERAAARARSQAAIDGDDGLLARLEALNRLTTE 253
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
138-660 |
3.38e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 41.96 E-value: 3.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 138 KPMNELDKVVEKHKESYRRILGQLLVAEKSRRQTILELEEEKRKHKEYMEKSDEfiCLLEQECERLKKLIDQEIKSQEEK 217
Cdd:TIGR01612 917 KKVDEYIKICENTKESIEKFHNKQNILKEILNKNIDTIKESNLIEKSYKDKFDN--TLIDKINELDKAFKDASLNDYEAK 994
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 218 EQEKEKRVTTLKEELTKLKSFAL-MVVDEQQRLTAQLTlqrQKIQELTTNAKETHTKLALAEARVQEEEQKA-------- 288
Cdd:TIGR01612 995 NNELIKYFNDLKANLGKNKENMLyHQFDEKEKATNDIE---QKIEDANKNIPNIEIAIHTSIYNIIDEIEKEigkniell 1071
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 289 -TRLEKELQTQTTKFHQDQDTImaKLTNEDSQNRQLQQKLAALSRQI-DELEETNRSLRKAEEELQDIKEKisKGEYGNA 366
Cdd:TIGR01612 1072 nKEILEEAEINITNFNEIKEKL--KHYNFDDFGKEENIKYADEINKIkDDIKNLDQKIDHHIKALEEIKKK--SENYIDE 1147
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 367 gIMAEVEELRKrVLDMEGKDEELIKMEEQCRDLNKRLEREtlqsKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKC 446
Cdd:TIGR01612 1148 -IKAQINDLED-VADKAISNDDPEEIEKKIENIVTKIDKK----KNIYDEIKKLLNEIAEIEKDKTSLEEVKGINLSYGK 1221
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 447 NLEKerMTTKQLSQELESLKVRIKELEAIESRLEKteftLKEDLTKLKTLTVMFVDERKTMsEKLKKTEDKLQAASSQLQ 526
Cdd:TIGR01612 1222 NLGK--LFLEKIDEEKKKSEHMIKAMEAYIEDLDE----IKEKSPEIENEMGIEMDIKAEM-ETFNISHDDDKDHHIISK 1294
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 527 VEQNKVTTVTEKlieetkrALKSKTDVEEKmysvtKERDDLKNKLK---AEEEKGNdllSRVNMLKNRLQSLEAIEK-DF 602
Cdd:TIGR01612 1295 KHDENISDIREK-------SLKIIEDFSEE-----SDINDIKKELQknlLDAQKHN---SDINLYLNEIANIYNILKlNK 1359
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847 603 LKNKLNqdsgksttalhqennKIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLE 660
Cdd:TIGR01612 1360 IKKIID---------------EVKEYTKEIEENNKNIKDELDKSEKLIKKIKDDINLE 1402
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
169-406 |
3.69e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 41.44 E-value: 3.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 169 RQTILELEEEKRKHKEYMEKSDEFICLLEQECERLKKLidQEIKSQEEKEQEKEKRVTTLKEELTKLKSFAlmvvDEQQR 248
Cdd:COG4913 616 EAELAELEEELAEAEERLEALEAELDALQERREALQRL--AEYSWDEIDVASAEREIAELEAELERLDASS----DDLAA 689
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 249 LTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTTKFHQDQDTIMAKLTNEDSQNRQLQQKLA 328
Cdd:COG4913 690 LEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRE 769
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 329 ALSRQIDELEETnrsLRKAEEELQDIKEKIsKGEYGN--AGIMAEVEELRK--RVLDmEGKDEELIKMEEQCRDLNKRLE 404
Cdd:COG4913 770 NLEERIDALRAR---LNRAEEELERAMRAF-NREWPAetADLDADLESLPEylALLD-RLEEDGLPEYEERFKELLNENS 844
|
..
gi 109659847 405 RE 406
Cdd:COG4913 845 IE 846
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
372-816 |
3.72e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 41.43 E-value: 3.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 372 VEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLEVEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKE 451
Cdd:PRK01156 175 IDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMKNRY 254
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 452 RMTTKQLSQELESLKVRIKELEAIESRLEKTE--------------FTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDk 517
Cdd:PRK01156 255 ESEIKTAESDLSMELEKNNYYKELEERHMKIIndpvyknrnyindyFKYKNDIENKKQILSNIDAEINKYHAIIKKLSV- 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 518 LQAASSQLQVEQNKvttvteklIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVN-MLKNRLQSLE 596
Cdd:PRK01156 334 LQKDYNDYIKKKSR--------YDDLNNQILELEGYEMDYNSYLKSIESLKKKIEEYSKNIERMSAFISeILKIQEIDPD 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 597 AIEKDF--LKNKLNQDSGKsTTALHQENNKIKELSQEVERLKLKL--------------------------KDMKAIEDD 648
Cdd:PRK01156 406 AIKKELneINVKLQDISSK-VSSLNQRIRALRENLDELSRNMEMLngqsvcpvcgttlgeeksnhiinhynEKKSRLEEK 484
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 649 LMKTEDEYETLERRYANERDKAQFLSKE------LEHVKMELAKYKLaEKTETSHEQWLFKRLQEEEAKSGHLSREVDAL 722
Cdd:PRK01156 485 IREIEIEVKDIDEKIVDLKKRKEYLESEeinksiNEYNKIESARADL-EDIKIKINELKDKHDKYEEIKNRYKSLKLEDL 563
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 723 KEKIHEY---MATEDLIC--HLQGDHSVLQKKLNQQENRNRDLGREIENLTKELERYrhfSKSLRPSLNGRRISDPQVFS 797
Cdd:PRK01156 564 DSKRTSWlnaLAVISLIDieTNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKS---IREIENEANNLNNKYNEIQE 640
|
490
....*....|....*....
gi 109659847 798 KEVQTEAVDNEPPDYKSLI 816
Cdd:PRK01156 641 NKILIEKLRGKIDNYKKQI 659
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
257-479 |
4.14e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 41.48 E-value: 4.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 257 RQKIQELTTNAKETHTKLALAEARVQEEEQkATRLEKELQTQTTKfHQDQDTIMAKLTNEDSQnRQLQQKLAALSRQIDE 336
Cdd:PRK04863 448 QAKEQEATEELLSLEQKLSVAQAAHSQFEQ-AYQLVRKIAGEVSR-SEAWDVARELLRRLREQ-RHLAEQLQQLRMRLSE 524
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 337 LEETNRSLRKAEEELQDIKEKISKGeYGNAgimAEVEELRkrvldmEGKDEELIKMEEQCRDLNKRLERETLQSKDFKLE 416
Cdd:PRK04863 525 LEQRLRQQQRAERLLAEFCKRLGKN-LDDE---DELEQLQ------EELEARLESLSESVSEARERRMALRQQLEQLQAR 594
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 109659847 417 VEKLSKRIM-------ALEKLEDAFN---KSKQECYSLKCN-LEKERmttkQLSQELESLKVRIKELEAIESRL 479
Cdd:PRK04863 595 IQRLAARAPawlaaqdALARLREQSGeefEDSQDVTEYMQQlLERER----ELTVERDELAARKQALDEEIERL 664
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
247-352 |
4.34e-03 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 41.16 E-value: 4.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 247 QRLTAQLTLQRQKIQELTTNAKETH-------TKLALAEARVQEEEQKA-TRLEKELQTQTTKFHQDQDTImAKLTNEDS 318
Cdd:COG3206 266 QQLRAQLAELEAELAELSARYTPNHpdvialrAQIAALRAQLQQEAQRIlASLEAELEALQAREASLQAQL-AQLEARLA 344
|
90 100 110
....*....|....*....|....*....|....
gi 109659847 319 QNRQLQQKLAALSRQIDELEETNRSLRKAEEELQ 352
Cdd:COG3206 345 ELPELEAELRRLEREVEVARELYESLLQRLEEAR 378
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
509-726 |
4.55e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 40.90 E-value: 4.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 509 EKLKKTEDKLQAASSQLQVEQNKvttvTEKLIEETKRALKsktDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNML 588
Cdd:COG4942 23 AEAEAELEQLQQEIAELEKELAA----LKKEEKALLKQLA---ALERRIAALARRIRALEQELAALEAELAELEKEIAEL 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 589 KNRLQSLEAIEKDFLKN-KLNQDSGKSTTALHQENnkIKELSQEVERLKLKLKDMKAIEDDLMKTEDEYETLERRYANER 667
Cdd:COG4942 96 RAELEAQKEELAELLRAlYRLGRQPPLALLLSPED--FLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAER 173
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 109659847 668 DKAQFLSKELEHVKMELAKYKlAEKTETSHEqwLFKRLQEEEAKSGHLSREVDALKEKI 726
Cdd:COG4942 174 AELEALLAELEEERAALEALK-AERQKLLAR--LEKELAELAAELAELQQEAEELEALI 229
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
148-365 |
4.56e-03 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 41.11 E-value: 4.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 148 EKHKESYRRILGQLLVAEKSRRQTILELEEEKRK---HKEYMEKSDEFICLLEQECERLKKLIDQEIKSQEEKEQEKEKR 224
Cdd:TIGR00618 658 ERVREHALSIRVLPKELLASRQLALQKMQSEKEQltyWKEMLAQCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAR 737
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 225 VTTLKEELTKLKSFALMV----VDEQQRLTAQLTLQRQKIQELTTNAKETHTKLALAEARVQEEEQKATRLEKELQTQTT 300
Cdd:TIGR00618 738 EDALNQSLKELMHQARTVlkarTEAHFNNNEEVTAALQTGAELSHLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDED 817
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109659847 301 KFHQDQDTIMAKLTNEDSQNRQLQQKLAALSRQIDELEETNRSLRKAEEELQDIKEKISKGEYGN 365
Cdd:TIGR00618 818 ILNLQCETLVQEEEQFLSRLEEKSATLGEITHQLLKYEECSKQLAQLTQEQAKIIQLSDKLNGIN 882
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
95-301 |
5.32e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 40.88 E-value: 5.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 95 AEKMDLALLEAQYGFVTPKKVLEALQRDAFQAKSTPWQEdIYEKPMNELDKVVEKHKESYRRIlGQLLVAEKSRRQTILE 174
Cdd:pfam17380 401 ARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRR-LEEERAREMERVRLEEQERQQQV-ERLRQQEEERKRKKLE 478
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 175 LEEEKRKHKEYMEKSDEficLLEQECERLKKLIDQEIKSQEEKEQEKEKRVTTLKEELTKLKSfalmvvdEQQRLTAQLT 254
Cdd:pfam17380 479 LEKEKRDRKRAEEQRRK---ILEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREA-------EEERRKQQEM 548
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 109659847 255 LQRQKIQELTTNAKETHTKLALAEA------RVQEEEQKATRLEKELQTQTTK 301
Cdd:pfam17380 549 EERRRIQEQMRKATEERSRLEAMEReremmrQIVESEKARAEYEATTPITTIK 601
|
|
| RecN |
COG0497 |
DNA repair ATPase RecN [Replication, recombination and repair]; |
227-433 |
6.24e-03 |
|
DNA repair ATPase RecN [Replication, recombination and repair];
Pssm-ID: 440263 [Multi-domain] Cd Length: 555 Bit Score: 40.44 E-value: 6.24e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 227 TLKEELTKLKSFALMVVDEQQRLTAQLT------LQRQKIQELTT------NAKETHTKLALAEARVQEEEQKATRLEKE 294
Cdd:COG0497 169 ALKKELEELRADEAERARELDLLRFQLEeleaaaLQPGEEEELEEerrrlsNAEKLREALQEALEALSGGEGGALDLLGQ 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 295 LQTQttkfhqdqdtiMAKLTNEDSQnrqLQQKLAALSRQIDELEETNRSLRKA-------EEELQDIKEKIS-----KGE 362
Cdd:COG0497 249 ALRA-----------LERLAEYDPS---LAELAERLESALIELEEAASELRRYldslefdPERLEEVEERLAllrrlARK 314
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 109659847 363 YGN--AGIMAEVEELRKRVLDMEGKDEELIKMEEQCRDLNKRLERETlqskdfklevEKLSK-RIMALEKLEDA 433
Cdd:COG0497 315 YGVtvEELLAYAEELRAELAELENSDERLEELEAELAEAEAELLEAA----------EKLSAaRKKAAKKLEKA 378
|
|
| PTZ00108 |
PTZ00108 |
DNA topoisomerase 2-like protein; Provisional |
372-669 |
6.30e-03 |
|
DNA topoisomerase 2-like protein; Provisional
Pssm-ID: 240271 [Multi-domain] Cd Length: 1388 Bit Score: 40.80 E-value: 6.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 372 VEELRKRVL----DMEGKDEELIKMEEqcrDLNKRLERETLQSKDFKLEVEK------LSKRI--MALEKLEdafnkskq 439
Cdd:PTZ00108 1037 VKELKKLGYvrfkDIIKKKSEKITAEE---EEGAEEDDEADDEDDEEELGAAvsydylLSMPIwsLTKEKVE-------- 1105
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 440 ecyslkcNLEKERMTTKQLSQELESLKVR---IKELEAIESRLEKTEFTLKEDLTKLKTLTVM-FVDERKTMSEKLKKTE 515
Cdd:PTZ00108 1106 -------KLNAELEKKEKELEKLKNTTPKdmwLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKtKGKASKLRKPKLKKKE 1178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 516 DKLQAASSQLQVEQNKVTTVTEKLIEETKRALKSKTDVEEKMYSVTKERDDLKNKLKAEEEKGNDLLSRVNMLKNRLQSL 595
Cdd:PTZ00108 1179 KKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDND 1258
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 109659847 596 EAIEKDFLKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKD--MKAIEDDLMKTEDEYE--TLERRYANERDK 669
Cdd:PTZ00108 1259 EFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSptKKKVKKRLEGSLAALKkkKKSEKKTARKKK 1336
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
507-671 |
6.55e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 40.20 E-value: 6.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 507 MSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKlIEETKRALKS-KTDVEEKMYSVTKERDDLKNKLKAEEEKGN------ 579
Cdd:COG3883 28 LQAELEAAQAELDALQAELEELNEEYNELQAE-LEALQAEIDKlQAEIAEAEAEIEERREELGERARALYRSGGsvsyld 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 580 ---------DLLSRVNMLK----NRLQSLEAIEKDflKNKLNQDSGKSTTALHQENNKIKELSQEVERLKLKLKDMKAIE 646
Cdd:COG3883 107 vllgsesfsDFLDRLSALSkiadADADLLEELKAD--KAELEAKKAELEAKLAELEALKAELEAAKAELEAQQAEQEALL 184
|
170 180
....*....|....*....|....*
gi 109659847 647 DDLMKTEDEYETLERRYANERDKAQ 671
Cdd:COG3883 185 AQLSAEEAAAEAQLAELEAELAAAE 209
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
624-775 |
7.30e-03 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 40.44 E-value: 7.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 624 KIKELSQEVERL---KLKLKDMKAIEDDLMKTEdEYETLERRYANERDKAQfLSKELEHVKMELAKY--KLAEKTETSHE 698
Cdd:TIGR02169 192 IIDEKRQQLERLrreREKAERYQALLKEKREYE-GYELLKEKEALERQKEA-IERQLASLEEELEKLteEISELEKRLEE 269
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 699 qwLFKRLQEEEAKSGHL-SREVDALKEKIHEYMA----TEDLICHLQGDHSVLQKKLNQQENRNRDLGREIENLTKELER 773
Cdd:TIGR02169 270 --IEQLLEELNKKIKDLgEEEQLRVKEKIGELEAeiasLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEE 347
|
..
gi 109659847 774 YR 775
Cdd:TIGR02169 348 ER 349
|
|
| ClyA_Cry6Aa-like |
cd22656 |
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ... |
247-451 |
7.80e-03 |
|
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.
Pssm-ID: 439154 [Multi-domain] Cd Length: 309 Bit Score: 39.66 E-value: 7.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 247 QRLTAQLTLQRQKIQELTTNAKETHTKLalaearvqeeeqkaTRLEKELQTQTTKFHQDQDTIMAKLTNEDSqnRQLQQK 326
Cdd:cd22656 117 KTIKALLDDLLKEAKKYQDKAAKVVDKL--------------TDFENQTEKDQTALETLEKALKDLLTDEGG--AIARKE 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 327 LAALSRQIDELEETNrsLRKAEEELQDIKEKISKGEygnagimAEVEELRKRVLDMEgkdeeliKMEEQCRDLNkrlere 406
Cdd:cd22656 181 IKDLQKELEKLNEEY--AAKLKAKIDELKALIADDE-------AKLAAALRLIADLT-------AADTDLDNLL------ 238
|
170 180 190 200
....*....|....*....|....*....|....*....|....*
gi 109659847 407 tlqskdfklevEKLSKRIMALEKLEDAFNKSKQECYSLKCNLEKE 451
Cdd:cd22656 239 -----------ALIGPAIPALEKLQGAWQAIATDLDSLKDLLEDD 272
|
|
| CortBP2 |
pfam09727 |
Cortactin-binding protein-2; This entry is the first approximately 250 residues of ... |
387-505 |
8.88e-03 |
|
Cortactin-binding protein-2; This entry is the first approximately 250 residues of cortactin-binding protein 2. In addition to being a positional candidate for autism this protein is expressed at highest levels in the brain in humans. The human protein has six associated ankyrin repeat domains pfam00023 towards the C-terminus which act as protein-protein interaction domains.
Pssm-ID: 462860 [Multi-domain] Cd Length: 187 Bit Score: 38.74 E-value: 8.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 387 EELIKMEEQCRDLNKRLERETLqskdfklEVEKLSKRimALEKLEDafNKSKQECYSLK-----CNLEKERmttKQLSQE 461
Cdd:pfam09727 80 AELEKLVEKQRETQRRMLEQLA-------AAEKRHRR--VIRELEE--EKRKHARDTAQgddftYLLEKER---ERLKQE 145
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 109659847 462 LESLKVRIKELEAiesRLEKTEFTLKEDLTKLKTLTVMFVDERK 505
Cdd:pfam09727 146 LEQEKAQQKRLEK---ELKKLLEKLEEELSKQKQIALLLVKERK 186
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
316-671 |
9.67e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 40.00 E-value: 9.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 316 EDSQNRQLQQKLAALSRQIdeLEETNRSLRKAEEEL-QDIKEKISkgeygnagimAEVEELRKRVLDMEGKDEELIKMEE 394
Cdd:NF033838 82 KHTQNVALNKKLSDIKTEY--LYELNVLKEKSEAELtSKTKKELD----------AAFEQFKKDTLEPGKKVAEATKKVE 149
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 395 QCRDLNKRLERE------TLQSKDFKLEVEKLSKRImalekledafNKSKQECYSLKCNLEKERMTTKQLSQELESLKVR 468
Cdd:NF033838 150 EAEKKAKDQKEEdrrnypTNTYKTLELEIAESDVEV----------KKAELELVKEEAKEPRDEEKIKQAKAKVESKKAE 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 469 IKELEAIESRLEKTEFTLKEDLTKLKTLTVMFVDERKTMSEKLKKTEDKLQAASSQLQVEQNKVTTVTEKLIEETKRALK 548
Cdd:NF033838 220 ATRLEKIKTDREKAEEEAKRRADAKLKEAVEKNVATSEQDKPKRRAKRGVLGEPATPDKKENDAKSSDSSVGEETLPSPS 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109659847 549 SKTdvEEKMYSVTKERDDLKNKLKAEEEKgndllSRVNMLKNRLQSL--EAIEKDFLKNKLNQDSGKSTTALHQENNKIK 626
Cdd:NF033838 300 LKP--EKKVAEAEKKVEEAKKKAKDQKEE-----DRRNYPTNTYKTLelEIAESDVKVKEAELELVKEEAKEPRNEEKIK 372
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 109659847 627 ELSQEVERLKLKLKDMKAIEDDLMKTEDEyetlERRYANERDKAQ 671
Cdd:NF033838 373 QAKAKVESKKAEATRLEKIKTDRKKAEEE----AKRKAAEEDKVK 413
|
|
|