|
Name |
Accession |
Description |
Interval |
E-value |
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
11-256 |
2.69e-38 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 145.48 E-value: 2.69e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 11 KNRHAADWNKYDDRLMKAAERGDVEKVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRN 90
Cdd:COG0666 43 ALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGNTLLHAAARNGDLEIVKLLLEAGADVNARDKDGET 122
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 91 ALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTIC 170
Cdd:COG0666 123 PLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAAENGHLEIV 202
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 171 QLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALGHDSSYYARIGDNLDILTLLKTASENTNKGR 250
Cdd:COG0666 203 KLLLEAGADVNAKDNDGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAAAGAALIVKLLLLALLLLAAAL 282
|
....*.
gi 530406156 251 ELWKKG 256
Cdd:COG0666 283 LDLLTL 288
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
11-248 |
1.60e-37 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 143.17 E-value: 1.60e-37
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 11 KNRHAADWNKYDDRLMKAAERGDVEKVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRN 90
Cdd:COG0666 10 LLLAALLLLLLLALLLLAAALLLLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGNT 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 91 ALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTIC 170
Cdd:COG0666 90 LLHAAARNGDLEIVKLLLEAGADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIV 169
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530406156 171 QLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALGHDSSYYARIGDNLDILTLLKTASENTNK 248
Cdd:COG0666 170 KLLLEAGADVNARDNDGETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAAENGNLEIVKLLLEAGADLNA 247
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
40-239 |
2.58e-29 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 119.29 E-value: 2.58e-29
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 40 ILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPTEHADL 119
Cdd:COG0666 6 LLLLLLLAALLLLLLLALLLLAAALLLLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDD 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 120 QGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGC 199
Cdd:COG0666 86 GGNTLLHAAARNGDLEIVKLLLEAGADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGN 165
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 530406156 200 RDAVEVLIKNGADISLLDALGHDSSYYARIGDNLDILTLL 239
Cdd:COG0666 166 LEIVKLLLEAGADVNARDNDGETPLHLAAENGHLEIVKLL 205
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
13-191 |
6.53e-21 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 94.64 E-value: 6.53e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 13 RHAADWNKYDDR----LMKAAERGDVEKVTsILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAG 88
Cdd:COG0666 108 EAGADVNARDKDgetpLHLAAYNGNLEIVK-LLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDNDG 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 89 RNALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPT 168
Cdd:COG0666 187 ETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAAAGAAL 266
|
170 180
....*....|....*....|...
gi 530406156 169 ICQLLIDRGADVNSRDKQNRTAL 191
Cdd:COG0666 267 IVKLLLLALLLLAAALLDLLTLL 289
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
35-218 |
1.58e-19 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 93.55 E-value: 1.58e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 35 EKVTSILAKKGVNPGKLDVEGRSVFHV-VTSKG-NLECLNAILIHGVDITTSDTAGRNALHLAAKYGH--ALCLQKLLQY 110
Cdd:PHA03095 132 PKVIRLLLRKGADVNALDLYGMTPLAVlLKSRNaNVELLRLLIDAGADVYAVDDRFRSLLHHHLQSFKprARIVRELIRA 211
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 111 NCPTEHADLQGRTALHDAAM-ADCPSS-IQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNR 188
Cdd:PHA03095 212 GCDPAATDMLGNTPLHSMATgSSCKRSlVLPLLIAGISINARNRYGQTPLHYAAVFNNPRACRRLIALGADINAVSSDGN 291
|
170 180 190
....*....|....*....|....*....|
gi 530406156 189 TALMLGCEYGCRDAVEVLIKNGADISLLDA 218
Cdd:PHA03095 292 TPLSLMVRNNNGRAVRAALAKNPSAETVAA 321
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
521-1322 |
1.58e-19 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 95.51 E-value: 1.58e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 521 QLKDLKVKYEGASAEVGKLRNQIKQNEMivEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKL- 599
Cdd:TIGR02168 214 RYKELKAELRELELALLVLRLEELREEL--EELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELy 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 600 ALSIPAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQhvKPEEHEQVKSRLEqksgELGK 679
Cdd:TIGR02168 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEE--LKEELESLEAELE----ELEA 365
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 680 KITELTLKNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEME 759
Cdd:TIGR02168 366 ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEEL 445
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 760 KLLLEndslsKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQYVP-- 837
Cdd:TIGR02168 446 EEELE-----ELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLsg 520
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 838 --------VKTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKD---------------KNEILKRNLENTQNQI 894
Cdd:TIGR02168 521 ilgvlselISVDEGYEAAIEAALGGRLQAVVVENLNAAKKAIAFLKQNElgrvtflpldsikgtEIQGNDREILKNIEGF 600
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 895 KAEYISLAEHEAK----MSSLSQSMRKVQD-SNAEILANYRKGQEEIVTL------------------HAEIKAQKKELD 951
Cdd:TIGR02168 601 LGVAKDLVKFDPKlrkaLSYLLGGVLVVDDlDNALELAKKLRPGYRIVTLdgdlvrpggvitggsaktNSSILERRREIE 680
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 952 TIQECIkvkyapivsfEECERKFKATEKELKDQLSEQTQKysvsEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSH 1031
Cdd:TIGR02168 681 ELEEKI----------EELEEKIAELEKALAELRKELEEL----EEELEQLRKELEELSRQISALRKDLARLEAEVEQLE 746
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1032 EMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEEL 1111
Cdd:TIGR02168 747 ERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERL 826
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1112 KSMQRCYEKEQQTVTKLHQLLE-----------NQKNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKL 1180
Cdd:TIGR02168 827 ESLERRIAATERRLEDLEEQIEelsedieslaaEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELREL 906
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1181 QSEVQNTKQALKKLETRevvdLSKYKATKSDLETQISSLNEKLANL-NRKYEEVCEEVLHAKKKEISAKDEKELLHFSIE 1259
Cdd:TIGR02168 907 ESKRSELRRELEELREK----LAQLELRLEGLEVRIDNLQERLSEEySLTLEEAEALENKIEDDEEEARRRLKRLENKIK 982
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530406156 1260 ----------QEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDV-ERLKQALNGLSQLTYTSG 1322
Cdd:TIGR02168 983 elgpvnlaaiEEYEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREARERFKDTfDQVNENFQRVFPKLFGGG 1056
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
71-239 |
7.36e-19 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 88.86 E-value: 7.36e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 71 LNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAK 150
Cdd:COG0666 4 LLLLLLLLLAALLLLLLLALLLLAAALLLLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAK 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 151 DVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALGHDSSYYARIG 230
Cdd:COG0666 84 DDGGNTLLHAAARNGDLEIVKLLLEAGADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAAN 163
|
....*....
gi 530406156 231 DNLDILTLL 239
Cdd:COG0666 164 GNLEIVKLL 172
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
92-184 |
2.37e-18 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 81.32 E-value: 2.37e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 92 LHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHgASVNAKDvDGRTPLVLATQMSRPTICQ 171
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEH-ADVNLKD-NGRTALHYAARSGHLEIVK 78
|
90
....*....|...
gi 530406156 172 LLIDRGADVNSRD 184
Cdd:pfam12796 79 LLLEKGADINVKD 91
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
59-151 |
2.60e-17 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 78.23 E-value: 2.60e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 59 FHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPTEhaDLQGRTALHDAAMADCPSSIQ 138
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEHADVNL--KDNGRTALHYAARSGHLEIVK 78
|
90
....*....|...
gi 530406156 139 LLCDHGASVNAKD 151
Cdd:pfam12796 79 LLLEKGADINVKD 91
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
125-217 |
1.53e-16 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 75.92 E-value: 1.53e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 125 LHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLIDRgADVNSRDkQNRTALMLGCEYGCRDAVE 204
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEH-ADVNLKD-NGRTALHYAARSGHLEIVK 78
|
90
....*....|...
gi 530406156 205 VLIKNGADISLLD 217
Cdd:pfam12796 79 LLLEKGADINVKD 91
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
748-1390 |
1.81e-15 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 82.41 E-value: 1.81e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 748 TQKYTEKKLEMEKLLLENDSLSKD--VSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENT 825
Cdd:TIGR02168 212 AERYKELKAELRELELALLVLRLEelREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 826 NLKKMMSNQYVPVKTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHE 905
Cdd:TIGR02168 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELE 371
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 906 AKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVS--FEECERKFKATEKELKD 983
Cdd:TIGR02168 372 SRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEaeLKELQAELEELEEELEE 451
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 984 QLSEQTQKysvsEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKE 1063
Cdd:TIGR02168 452 LQEELERL----EEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEGFSEGVKALLKNQSGLSGILGVLSE 527
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1064 KL-VEENAKQTSEIL---AVQNLL-QKQHVPLEQVEALKKSLNG-------------TIENLKEELKSMQRCYEKEQ--- 1122
Cdd:TIGR02168 528 LIsVDEGYEAAIEAAlggRLQAVVvENLNAAKKAIAFLKQNELGrvtflpldsikgtEIQGNDREILKNIEGFLGVAkdl 607
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1123 -QTVTKLHQLLENQKNSSVP---LAEHLQIKEAFEKEVGII--------------------KASLREKEEESQNKMEEVS 1178
Cdd:TIGR02168 608 vKFDPKLRKALSYLLGGVLVvddLDNALELAKKLRPGYRIVtldgdlvrpggvitggsaktNSSILERRREIEELEEKIE 687
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1179 KLQSEVQNTKQALKKLETREV---VDLSKYKATKSDLETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKELLH 1255
Cdd:TIGR02168 688 ELEEKIAELEKALAELRKELEeleEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELE 767
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1256 FSI-----------------EQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALNGLSQLt 1318
Cdd:TIGR02168 768 ERLeeaeeelaeaeaeieelEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQ- 846
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 530406156 1319 ytsgnpTKRQSQLIDTLQHQVKSLEQQLADADRQHQEVIAIYRT-----HLLSAAQGHMDEDVQEALLQIIQMRQGL 1390
Cdd:TIGR02168 847 ------IEELSEDIESLAAEIEELEELIEELESELEALLNERASleealALLRSELEELSEELRELESKRSELRREL 917
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
665-1303 |
4.99e-15 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 80.45 E-value: 4.99e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 665 QVKSRLEQKSGELGKKITELTLKNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKL 744
Cdd:TIGR04523 117 EQKNKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKL 196
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 745 LDVTQKYTEKKLEMEKllleNDSLSKDVSRLEtvfvppekheKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSEN 824
Cdd:TIGR04523 197 LKLELLLSNLKKKIQK----NKSLESQISELK----------KQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQ 262
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 825 TNLKKMMSNQYVPVKTHEEVKMTLNDTLAKTNRELLDVKKKfedinqefvKIKDKNEILKRNLENTQNQIKAEYISLAEH 904
Cdd:TIGR04523 263 NKIKKQLSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQ---------KEQDWNKELKSELKNQEKKLEEIQNQISQN 333
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 905 EAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKFKATEKELKDQ 984
Cdd:TIGR04523 334 NKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQ 413
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 985 LSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKshemeraLSRKTDELNKQLKDLSQKYTEVKNVKEK 1064
Cdd:TIGR04523 414 IKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKN-------LDNTRESLETQLKVLSRSINKIKQNLEQ 486
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1065 LVEENAKQTSEILAVQNLlqkqhvpleqvealKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKnssvplae 1144
Cdd:TIGR04523 487 KQKELKSKEKELKKLNEE--------------KKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLE-------- 544
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1145 hlqiKEAFEKEVGIIKASLrekEEESQNKMEEVSKLQSEVQNTKQALKKLETrevvDLSKYKATKSDLETQISSLNEKLA 1224
Cdd:TIGR04523 545 ----DELNKDDFELKKENL---EKEIDEKNKEIEELKQTQKSLKKKQEEKQE----LIDQKEKEKKDLIKEIEEKEKKIS 613
|
570 580 590 600 610 620 630
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530406156 1225 NLNRKYEEVCEEvlhaKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIeakdNKITELLND 1303
Cdd:TIGR04523 614 SLEKELEKAKKE----NEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNKWPEIIKKIKESKTKI----DDIIELMKD 684
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
606-1195 |
2.12e-14 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 78.57 E-value: 2.12e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 606 EKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAqhvKPEEHEQVKSRLEQKSGELGKKITELT 685
Cdd:PRK03918 182 EKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVK---ELEELKEEIEELEKELESLEGSKRKLE 258
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 686 LKNQTLQKEIEKVYLDNKLLKEQAhnltiemknhyvplKVSEDMKKSHDAIIddlnrKLLDVTQKYTEKKLEMEKLLlen 765
Cdd:PRK03918 259 EKIRELEERIEELKKEIEELEEKV--------------KELKELKEKAEEYI-----KLSEFYEEYLDELREIEKRL--- 316
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 766 DSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCgEDQEKIHALTSENTNLKKMMSNQyvpvktheevk 845
Cdd:PRK03918 317 SRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH-ELYEEAKAKKEELERLKKRLTGL----------- 384
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 846 mtlndTLAKTNRELLDVKKKFEDINQEFVKIKDKneilKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMrkVQDSNAEI 925
Cdd:PRK03918 385 -----TPEKLEKELEELEKAKEEIEEEISKITAR----IGELKKEIKELKKAIEELKKAKGKCPVCGREL--TEEHRKEL 453
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 926 LANYRKGQEEIVTLHAEIKAQKKELdtiqECIKVKYAPIVSFEECERKFKATEKELKDqLSEQTQKYSVSE-----EEVK 1000
Cdd:PRK03918 454 LEEYTAELKRIEKELKEIEEKERKL----RKELRELEKVLKKESELIKLKELAEQLKE-LEEKLKKYNLEElekkaEEYE 528
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1001 KNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQK-YTEVKNVKEKLVE---------ENA 1070
Cdd:PRK03918 529 KLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELgFESVEELEERLKElepfyneylELK 608
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1071 KQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQqtvtklHQLLENQKNSsvplaehlqike 1150
Cdd:PRK03918 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEE------YEELREEYLE------------ 670
|
570 580 590 600
....*....|....*....|....*....|....*....|....*
gi 530406156 1151 aFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLE 1195
Cdd:PRK03918 671 -LSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELE 714
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
442-1249 |
3.40e-14 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 78.26 E-value: 3.40e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 442 ELAHKVAECKALALECERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKehltseaASGNHRLTEELKDQ 521
Cdd:PTZ00121 1167 EEARKAEDAKKAEAARKAEEVRKAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKK-------AEAVKKAEEAKKDA 1239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 522 LKDLKVKYEGASAEVGKLRN-QIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLA 600
Cdd:PTZ00121 1240 EEAKKAEEERNNEEIRKFEEaRMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADE 1319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 601 LSIPAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQVKSRLEQKSGELGKK 680
Cdd:PTZ00121 1320 AKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 681 ITELTLKNQTLQKEIEKvyldnKLLKEQAHNLTIEMKnhyvplKVSEDMKKSHDAIIDDLNRKLLDVTQKYTE--KKLEM 758
Cdd:PTZ00121 1400 AEEDKKKADELKKAAAA-----KKKADEAKKKAEEKK------KADEAKKKAEEAKKADEAKKKAEEAKKAEEakKKAEE 1468
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 759 EKLLLENDSLSKDVSRLETVFVPPEKHEKEIIALKSNiVELKKQLSELKKkcGEDQEKIHALTSENTNLKKMMSNQYVPV 838
Cdd:PTZ00121 1469 AKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA-AEAKKKADEAKK--AEEAKKADEAKKAEEAKKADEAKKAEEK 1545
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 839 KTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKV 918
Cdd:PTZ00121 1546 KKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEEL 1625
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 919 QDSNAEilanyRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKfkaTEKELKDQLSEQTQKysvsEEE 998
Cdd:PTZ00121 1626 KKAEEE-----KKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKK---KAEEAKKAEEDEKKA----AEA 1693
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 999 VKKNKQEndklKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKnvkeklVEENAKQTSEILA 1078
Cdd:PTZ00121 1694 LKKEAEE----AKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK------KDEEEKKKIAHLK 1763
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1079 VQNLLQKQHVPLEQVEALKKSLNgtienlKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEVGI 1158
Cdd:PTZ00121 1764 KEEEKKAEEIRKEKEAVIEEELD------EEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKEVAD 1837
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1159 IKASLREKEEESQNKMEEVSKLQSEVQNTKQALKK--------LETREVVDLSKyKATKSDLETQISSLNEKLAN----- 1225
Cdd:PTZ00121 1838 SKNMQLEEADAFEKHKFNKNNENGEDGNKEADFNKekdlkeddEEEIEEADEIE-KIDKDDIEREIPNNNMAGKNndiid 1916
|
810 820 830
....*....|....*....|....*....|..
gi 530406156 1226 --------LNRKYEEVCEEVLHAKKKEISAKD 1249
Cdd:PTZ00121 1917 dkldkdeyIKRDAEETREEIIKISKKDMCIND 1948
|
|
| PLN03192 |
PLN03192 |
Voltage-dependent potassium channel; Provisional |
62-213 |
1.04e-13 |
|
Voltage-dependent potassium channel; Provisional
Pssm-ID: 215625 [Multi-domain] Cd Length: 823 Bit Score: 76.06 E-value: 1.04e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 62 VTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLC 141
Cdd:PLN03192 532 VASTGNAALLEELLKAKLDPDIGDSKGRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWNAISAKHHKIFRILY 611
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530406156 142 dHGASVNAKDVDGRTpLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADI 213
Cdd:PLN03192 612 -HFASISDPHAAGDL-LCTAAKRNDLTAMKELLKQGLNVDSEDHQGATALQVAMAEDHVDMVRLLIMNGADV 681
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
516-1116 |
1.18e-13 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 76.26 E-value: 1.18e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 516 EELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEF---KRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQA 592
Cdd:PRK03918 203 EEVLREINEISSELPELREELEKLEKEVKELEELKEEIeelEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV 282
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 593 KELsaklalsipaekfenmksslsNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLaqhvkpEEHEQVKSRLEQ 672
Cdd:PRK03918 283 KEL---------------------KELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEI------NGIEERIKELEE 335
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 673 KSGELGkkitELTLKNQTLQKEIEKVYLDNKLLkEQAHNLTIEMKNHyvplkvsedMKKSHDAIIDDLNRKLLDVTQKYT 752
Cdd:PRK03918 336 KEERLE----ELKKKLKELEKRLEELEERHELY-EEAKAKKEELERL---------KKRLTGLTPEKLEKELEELEKAKE 401
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 753 EKKLEMEKLllendslskdvsrletvfvppekhEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTnlKKMMS 832
Cdd:PRK03918 402 EIEEEISKI------------------------TARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHR--KELLE 455
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 833 NQYVPVKTHEEVKMTLNDTLAKTNRELLDVKKKFEDiNQEFVKIKDKNEILKrNLENTQNQIKAEYISLAEHEakmssls 912
Cdd:PRK03918 456 EYTAELKRIEKELKEIEEKERKLRKELRELEKVLKK-ESELIKLKELAEQLK-ELEEKLKKYNLEELEKKAEE------- 526
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 913 qsmrkvqdsnaeilanYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKyapivsfEECERKFKATEKELKDQLSEQTQKY 992
Cdd:PRK03918 527 ----------------YEKLKEKLIKLKGEIKSLKKELEKLEELKKKL-------AELEKKLDELEEELAELLKELEELG 583
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 993 SVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKL-VEENAK 1071
Cdd:PRK03918 584 FESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYsEEEYEE 663
|
570 580 590 600
....*....|....*....|....*....|....*....|....*
gi 530406156 1072 QTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQR 1116
Cdd:PRK03918 664 LREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREK 708
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
862-1356 |
2.81e-13 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 74.72 E-value: 2.81e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 862 VKKKFEDINQEFVKIKDKNEILKRNLENtqnqIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHA 941
Cdd:PRK03918 198 KEKELEEVLREINEISSELPELREELEK----LEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKK 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 942 EIKAQKKELDTIQEcIKVKYAPIVSFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKkEIFTLQKDLR 1021
Cdd:PRK03918 274 EIEELEEKVKELKE-LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLE-ELKKKLKELE 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1022 DKTVLIEKSHE--------MERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQV 1093
Cdd:PRK03918 352 KRLEELEERHElyeeakakKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEEL 431
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1094 EA-------------------LKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEK 1154
Cdd:PRK03918 432 KKakgkcpvcgrelteehrkeLLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEK 511
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1155 EVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLEtrevvdlsKYKATKSDLETQISSLNEKLANLNRKYEEV- 1233
Cdd:PRK03918 512 LKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLE--------ELKKKLAELEKKLDELEEELAELLKELEELg 583
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1234 --CEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELL-----NDVER 1306
Cdd:PRK03918 584 feSVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEkkyseEEYEE 663
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|
gi 530406156 1307 LKQALNGLSQLTYTSGNPTKRQSQLIDTLQHQVKSLEQQLADADRQHQEV 1356
Cdd:PRK03918 664 LREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKEL 713
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
32-232 |
3.41e-13 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 73.52 E-value: 3.41e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 32 GDVEKVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILI-HGVDITTSDTAGRNALH--LAAKYGHALCLQKLL 108
Cdd:PHA03095 60 EKVKDIVRLLLEAGADVNAPERCGFTPLHLYLYNATTLDVIKLLIkAGADVNAKDKVGRTPLHvyLSGFNINPKVIRLLL 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 109 QYNCPTEHADLQGRTALH------DAamadCPSSIQLLCDHGASVNAKDVDGRTPL--VLATQMSRPTICQLLIDRGADV 180
Cdd:PHA03095 140 RKGADVNALDLYGMTPLAvllksrNA----NVELLRLLIDAGADVYAVDDRFRSLLhhHLQSFKPRARIVRELIRAGCDP 215
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 530406156 181 NSRDKQNRTALMLGCEYG-CRDA-VEVLIKNGADISLLDALGHDSSYYARIGDN 232
Cdd:PHA03095 216 AATDMLGNTPLHSMATGSsCKRSlVLPLLIAGISINARNRYGQTPLHYAAVFNN 269
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
458-1233 |
3.97e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 74.72 E-value: 3.97e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 458 ERVKEDSDEQIKQL-------EDALK--DVQKRMYESEGKVkqmqtHFLALKEHLTSEAASGNHRltEELKDQLKDLKVK 528
Cdd:TIGR02169 187 ERLDLIIDEKRQQLerlrrerEKAERyqALLKEKREYEGYE-----LLKEKEALERQKEAIERQL--ASLEEELEKLTEE 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 529 YEGASAEVGKLRNQIKqnemivEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLA-LSIPAEK 607
Cdd:TIGR02169 260 ISELEKRLEEIEQLLE------ELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAkLEAEIDK 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 608 FENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVkpEEHEQVKSRLEQKSGELGKKITELTLK 687
Cdd:TIGR02169 334 LLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETR--DELKDYREKLEKLKREINELKRELDRL 411
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 688 NQTLQK-EIEKVYLDNKL--LKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLE 764
Cdd:TIGR02169 412 QEELQRlSEELADLNAAIagIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRE 491
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 765 NDSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKC--------------------GEDQEKIHALTSEN 824
Cdd:TIGR02169 492 LAEAEAQARASEERVRGGRAVEEVLKASIQGVHGTVAQLGSVGERYataievaagnrlnnvvveddAVAKEAIELLKRRK 571
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 825 TN------LKKMMSNQYVPVKTHEE-------------------VKMTLNDTLA----KTNRELLD-------------- 861
Cdd:TIGR02169 572 AGratflpLNKMRDERRDLSILSEDgvigfavdlvefdpkyepaFKYVFGDTLVvediEAARRLMGkyrmvtlegelfek 651
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 862 --------VKKKFEDINQefVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQ 933
Cdd:TIGR02169 652 sgamtggsRAPRGGILFS--RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLE 729
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 934 EEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKFKATEKELKDQLSEQTQKYsvSEEEVKKNKQENDKLKKEI 1013
Cdd:TIGR02169 730 QEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL--SHSRIPEIQAELSKLEEEV 807
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1014 FTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLsqkyTEVKNVKEKLVEENAKQTSEILAVqnlLQKQHVPLEQV 1093
Cdd:TIGR02169 808 SRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDL----KEQIKSIEKEIENLNGKKEELEEE---LEELEAALRDL 880
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1094 EALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSsvplaehlqiKEAFEKEVGIIKASLREKEEESQNK 1173
Cdd:TIGR02169 881 ESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAK----------LEALEEELSEIEDPKGEDEEIPEEE 950
|
810 820 830 840 850 860 870
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1174 MEEvSKLQSEVQNTKQALKKLET----------REVVDLSKYKATKSDLETQISSLNEKLANLNRKYEEV 1233
Cdd:TIGR02169 951 LSL-EDVQAELQRVEEEIRALEPvnmlaiqeyeEVLKRLDELKEKRAKLEEERKAILERIEEYEKKKREV 1019
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
608-1306 |
6.23e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 74.02 E-value: 6.23e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 608 FENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQVKSRLEQKSGELGKKITELTLK 687
Cdd:PTZ00121 1081 FDAKEDNRADEATEEAFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKA 1160
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 688 NQTLQKEIEKVYLDNKllKEQAHNLTIEMKNHYvPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDS 767
Cdd:PTZ00121 1161 EDARKAEEARKAEDAK--KAEAARKAEEVRKAE-ELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKK 1237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 768 LSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNlkkmmsnqyvpvKTHEEVKMT 847
Cdd:PTZ00121 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAK------------KAEEKKKAD 1305
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 848 LNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAeyislAEHEAKMSSLSQSMRKVQDSNAEILA 927
Cdd:PTZ00121 1306 EAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEA-----AADEAEAAEEKAEAAEKKKEEAKKKA 1380
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 928 NYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKFKATEK--ELKDQLSEQTQKYSVSEEEVKKNKQE 1005
Cdd:PTZ00121 1381 DAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKadEAKKKAEEAKKADEAKKKAEEAKKAE 1460
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1006 NDKLKKEIFTLQKDLRDKTVLIEKSHEMERA---LSRKTDELNKQlKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNL 1082
Cdd:PTZ00121 1461 EAKKKAEEAKKADEAKKKAEEAKKADEAKKKaeeAKKKADEAKKA-AEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1083 LQKQHV----PLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEVGI 1158
Cdd:PTZ00121 1540 KKAEEKkkadELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAK 1619
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1159 IKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSDLETQISSLNEKLANLNRKYEEVCEEVL 1238
Cdd:PTZ00121 1620 IKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1239 HAKKKEISAKDEKELLHfSIEQEIKDQKERCDKS--LTTITELQRRIQESAKQIEAKDNKITELLNDVER 1306
Cdd:PTZ00121 1700 EAKKAEELKKKEAEEKK-KAEELKKAEEENKIKAeeAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
403-1113 |
7.79e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.55 E-value: 7.79e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 403 ELSLPSQTSYSENEILKKELEAMrtfcESAKQDRLKLQNELAHKVAECKALALECERVKEDSDEQIKQLEDALKDVQKRM 482
Cdd:TIGR02168 257 ELTAELQELEEKLEELRLEVSEL----EEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKL 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 483 YESEGKVKQMQTHFLALKEHLTSEAASGN--HRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKL 560
Cdd:TIGR02168 333 DELAEELAELEEKLEELKEELESLEAELEelEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERL 412
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 561 IEENKRLQKELSMCEMEREKKGRKVTEME-GQAKELSAKLALSIpaEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSE 639
Cdd:TIGR02168 413 EDRRERLQQEIEELLKKLEEAELKELQAElEELEEELEELQEEL--ERLEEALEELREELEEAEQALDAAERELAQLQAR 490
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 640 IRQLKRELENVK--------AKLAQHVKPEEHEQVKSRLEQKSG-------ELGKKITELTLKNQTLQKEIekVYLDNKL 704
Cdd:TIGR02168 491 LDSLERLQENLEgfsegvkaLLKNQSGLSGILGVLSELISVDEGyeaaieaALGGRLQAVVVENLNAAKKA--IAFLKQN 568
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 705 LKEQAHNLTIEMKNHYVPLKVSEDMKKSHD---------------------------AIIDDLN-----RKLLD-----V 747
Cdd:TIGR02168 569 ELGRVTFLPLDSIKGTEIQGNDREILKNIEgflgvakdlvkfdpklrkalsyllggvLVVDDLDnalelAKKLRpgyriV 648
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 748 T--------------------QKYTEKKLEMEKLLLENDSLSKDVSRLETVFvppEKHEKEIIALKSNIVELKKQLSELK 807
Cdd:TIGR02168 649 TldgdlvrpggvitggsaktnSSILERRREIEELEEKIEELEEKIAELEKAL---AELRKELEELEEELEQLRKELEELS 725
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 808 KKCGEDQEKIHALTSENTNLKKMMSNQYVPVKTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNL 887
Cdd:TIGR02168 726 RQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL 805
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 888 ENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKY------ 961
Cdd:TIGR02168 806 DELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLnerasl 885
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 962 -APIVSFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDK-TVLIEKSHEMERALSR 1039
Cdd:TIGR02168 886 eEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEySLTLEEAEALENKIED 965
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530406156 1040 KTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEilavQNLLQKQHvplEQVEALKKSLNGTIENLKEELKS 1113
Cdd:TIGR02168 966 DEEEARRRLKRLENKIKELGPVNLAAIEEYEELKER----YDFLTAQK---EDLTEAKETLEEAIEEIDREARE 1032
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
496-1293 |
9.26e-13 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 73.64 E-value: 9.26e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 496 FLALKEHLTSEAASGNHRLTEELKdQLKDLKVKYEGASAEVGKLRNQIKQNEmivEEFKRDEGKLIEENKRLQKELSMCE 575
Cdd:PTZ00121 1081 FDAKEDNRADEATEEAFGKAEEAK-KTETGKAEEARKAEEAKKKAEDARKAE---EARKAEDARKAEEARKAEDAKRVEI 1156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 576 MEREKKGRKVTEMEGQAKELSAKlalsiPAEKFENMKSSLSNEVNEKAKKLVEMER-EHEKSLSEIRQLKRELENVKAKL 654
Cdd:PTZ00121 1157 ARKAEDARKAEEARKAEDAKKAE-----AARKAEEVRKAEELRKAEDARKAEAARKaEEERKAEEARKAEDAKKAEAVKK 1231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 655 AQHVKPEEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEI--EKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKS 732
Cdd:PTZ00121 1232 AEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIkaEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKA 1311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 733 HDAiiddlnRKLLDVTQKYTEKKLEMEKLLLENDSLSKdvsRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGE 812
Cdd:PTZ00121 1312 EEA------KKADEAKKKAEEAKKKADAAKKKAEEAKK---AAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADA 1382
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 813 DQEKIHALTSENTNLKKMMSNQyvpvKTHEEVKMTlndtlAKTNRELLDVKKKFEDinqefvkiKDKNEILKRNLENTQn 892
Cdd:PTZ00121 1383 AKKKAEEKKKADEAKKKAEEDK----KKADELKKA-----AAAKKKADEAKKKAEE--------KKKADEAKKKAEEAK- 1444
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 893 qiKAEYISLAEHEAKMSSlsqsmrkvqdsNAEILANYRKGQEEIVTlHAEIKAQKKELDTIQECIKVKYApivsfeecER 972
Cdd:PTZ00121 1445 --KADEAKKKAEEAKKAE-----------EAKKKAEEAKKADEAKK-KAEEAKKADEAKKKAEEAKKKAD--------EA 1502
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 973 KFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKdlS 1052
Cdd:PTZ00121 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMA--L 1580
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1053 QKYTEVKNVKEKLVEENAKQTSEILAVQ-NLLQKQHVPLEQVEALKKSlngtienlKEELKSMQRCYEKEQQTVTKLHQL 1131
Cdd:PTZ00121 1581 RKAEEAKKAEEARIEEVMKLYEEEKKMKaEEAKKAEEAKIKAEELKKA--------EEEKKKVEQLKKKEAEEKKKAEEL 1652
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1132 LENQKNSSVPlAEHLQIKEAFEKEvgiiKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSD 1211
Cdd:PTZ00121 1653 KKAEEENKIK-AAEEAKKAEEDKK----KAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1212 LETQISSLNEKLANLNRKYEEvceevlhAKKKEisaKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIE 1291
Cdd:PTZ00121 1728 NKIKAEEAKKEAEEDKKKAEE-------AKKDE---EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
|
..
gi 530406156 1292 AK 1293
Cdd:PTZ00121 1798 KK 1799
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
527-1305 |
1.07e-12 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 73.18 E-value: 1.07e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 527 VKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKG----RKVTEMEGQAKELSAKLA-L 601
Cdd:TIGR02169 170 RKKEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGyellKEKEALERQKEAIERQLAsL 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 602 SIPAEKFENMKSSLSNEVNEKAKKLVEMERE-HEKSLSEIRQLKRELENVKAKLAQHV-KPEEHEQVKSRLEQKSGELGK 679
Cdd:TIGR02169 250 EEELEKLTEEISELEKRLEEIEQLLEELNKKiKDLGEEEQLRVKEKIGELEAEIASLErSIAEKERELEDAEERLAKLEA 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 680 KITELTLKNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLK-VSEDMKKSHDAIiDDLNRKLLDVTQKYTEKKLEM 758
Cdd:TIGR02169 330 EIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEeVDKEFAETRDEL-KDYREKLEKLKREINELKREL 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 759 EKLLLENDSLSKDVSRletvfvppekHEKEIIALKSNIVELKKQLSELkkkcgedQEKIHALTSENTNLKKMMSNQYVPV 838
Cdd:TIGR02169 409 DRLQEELQRLSEELAD----------LNAAIAGIEAKINELEEEKEDK-------ALEIKKQEWKLEQLAADLSKYEQEL 471
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 839 KTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIkAEYISLAEHEAKMSSLSQSMR-- 916
Cdd:TIGR02169 472 YDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAVEEVLKASIQGVHGTV-AQLGSVGERYATAIEVAAGNRln 550
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 917 -------KVQDSNAEILANYRKGQEEIVTLHaEIKAQKKELDTIQECIKVKYApiVSFEECERKFkatekelkdqlsEQT 989
Cdd:TIGR02169 551 nvvveddAVAKEAIELLKRRKAGRATFLPLN-KMRDERRDLSILSEDGVIGFA--VDLVEFDPKY------------EPA 615
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 990 QKYSVSEEEVKKNKQENDKL--KKEIFTLQKDLRDKTVLIEKSH--------------EMERALSRKTDELNKQLKDLSQ 1053
Cdd:TIGR02169 616 FKYVFGDTLVVEDIEAARRLmgKYRMVTLEGELFEKSGAMTGGSraprggilfsrsepAELQRLRERLEGLKRELSSLQS 695
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1054 KYTEVKN-VKEKL--VEENAKQTSEILAVQNLLQKQHVP----LEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVT 1126
Cdd:TIGR02169 696 ELRRIENrLDELSqeLSDASRKIGEIEKEIEQLEQEEEKlkerLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLH 775
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1127 KLHQLLEN--QKNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREvvdlSK 1204
Cdd:TIGR02169 776 KLEEALNDleARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQI----KS 851
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1205 YKATKSDLETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKEllhfSIEQEIKDQKERCDKSLTTITELQRRIQ 1284
Cdd:TIGR02169 852 IEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLR----ELERKIEELEAQIEKKRKRLSELKAKLE 927
|
810 820
....*....|....*....|.
gi 530406156 1285 ESAKQIEAKDNKITELLNDVE 1305
Cdd:TIGR02169 928 ALEEELSEIEDPKGEDEEIPE 948
|
|
| PHA02874 |
PHA02874 |
ankyrin repeat protein; Provisional |
35-222 |
2.22e-12 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165205 [Multi-domain] Cd Length: 434 Bit Score: 70.76 E-value: 2.22e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 35 EKVTSILaKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPT 114
Cdd:PHA02874 105 DMIKTIL-DCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYA 183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 115 EHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTIcQLLIDrGADVNSRDKQNRTALMLG 194
Cdd:PHA02874 184 NVKDNNGESPLHNAAEYGDYACIKLLIDHGNHIMNKCKNGFTPLHNAIIHNRSAI-ELLIN-NASINDQDIDGSTPLHHA 261
|
170 180
....*....|....*....|....*....
gi 530406156 195 CEYGC-RDAVEVLIKNGADISLLDALGHD 222
Cdd:PHA02874 262 INPPCdIDIIDILLYHKADISIKDNKGEN 290
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
436-958 |
3.04e-12 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 71.63 E-value: 3.04e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 436 RLKLQNELAHKVAECKALALECERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSEaasgnhRLT 515
Cdd:PRK03918 229 EVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLS------EFY 302
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 516 EELKDQLKDLKVKYEGASAEVGKLRNQIKQnemiVEEFKRDEGKLIEENKRLQKELSMCEmEREKKGRKVTEMEGQAKEL 595
Cdd:PRK03918 303 EEYLDELREIEKRLSRLEEEINGIEERIKE----LEEKEERLEELKKKLKELEKRLEELE-ERHELYEEAKAKKEELERL 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 596 SAKLAlSIPAEKFENMKSSLSN---EVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAK---LAQHVKPEEHEQVKSR 669
Cdd:PRK03918 378 KKRLT-GLTPEKLEKELEELEKakeEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKcpvCGRELTEEHRKELLEE 456
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 670 LEQKSGELGKKITELTLKNQTLQKEIEKVylDNKLLKEQAhnlTIEMKNHYVPLKVSEDMKKSHDAiiddlnrklldvtQ 749
Cdd:PRK03918 457 YTAELKRIEKELKEIEEKERKLRKELREL--EKVLKKESE---LIKLKELAEQLKELEEKLKKYNL-------------E 518
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 750 KYTEKKLEMEKLLLENDSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQ-EKIHALTSENTNLK 828
Cdd:PRK03918 519 ELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGfESVEELEERLKELE 598
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 829 KMMsNQYVPVKTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNL-ENTQNQIKAEYISLAEHEAK 907
Cdd:PRK03918 599 PFY-NEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYsEEEYEELREEYLELSRELAG 677
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 530406156 908 MSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIK 958
Cdd:PRK03918 678 LRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELRE 728
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
466-1262 |
1.45e-11 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 69.70 E-value: 1.45e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 466 EQIKQLEDALKDVQKRMYESegKVKQMQTHFLALKEHLTSEAASGNHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQ 545
Cdd:TIGR01612 974 DKINELDKAFKDASLNDYEA--KNNELIKYFNDLKANLGKNKENMLYHQFDEKEKATNDIEQKIEDANKNIPNIEIAIHT 1051
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 546 NEM-IVEEFKRDEGKLIEE-NKRLQKE--LSMCEMEREKKGRKVTEMEGQAKELSAKLALSIpaEKFENMKSSLSNEVNE 621
Cdd:TIGR01612 1052 SIYnIIDEIEKEIGKNIELlNKEILEEaeINITNFNEIKEKLKHYNFDDFGKEENIKYADEI--NKIKDDIKNLDQKIDH 1129
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 622 KAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQ----VKSRLEQKSG---ELGKKITELTL--KNQTLQ 692
Cdd:TIGR01612 1130 HIKALEEIKKKSENYIDEIKAQINDLEDVADKAISNDDPEEIEKkienIVTKIDKKKNiydEIKKLLNEIAEieKDKTSL 1209
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 693 KEIEKVYLD-----NKLLKEQAHnltiEMKnhyvplKVSEDMKKSHDAIIDDLNrkllDVTQKYTEKKLEMEKLLLENds 767
Cdd:TIGR01612 1210 EEVKGINLSygknlGKLFLEKID----EEK------KKSEHMIKAMEAYIEDLD----EIKEKSPEIENEMGIEMDIK-- 1273
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 768 lskdvSRLETVFVPPEKHEKEIIALKSNivelKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQYVPVKTHEEVKMT 847
Cdd:TIGR01612 1274 -----AEMETFNISHDDDKDHHIISKKH----DENISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLY 1344
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 848 LNDTLAKTN-RELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKA--EYISLAEHEAKMSSLSQSmrkvQDSNaE 924
Cdd:TIGR01612 1345 LNEIANIYNiLKLNKIKKIIDEVKEYTKEIEENNKNIKDELDKSEKLIKKikDDINLEECKSKIESTLDD----KDID-E 1419
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 925 ILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIvsfEECERKFKATEKELKD-----------QLSEQTQKYS 993
Cdd:TIGR01612 1420 CIKKIKELKNHILSEESNIDTYFKNADENNENVLLLFKNI---EMADNKSQHILKIKKDnatndhdfninELKEHIDKSK 1496
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 994 VSEEEVKKNKQENDKlKKEIFTLQKdlRDKTVLIEKSHEMEralsrKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQT 1073
Cdd:TIGR01612 1497 GCKDEADKNAKAIEK-NKELFEQYK--KDVTELLNKYSALA-----IKNKFAKTKKDSEIIIKEIKDAHKKFILEAEKSE 1568
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1074 SEIlavqNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQrcyekeqqtvTKLHQLLENQKNSSVPLAEhlqiKEAFE 1153
Cdd:TIGR01612 1569 QKI----KEIKKEKFRIEDDAAKNDKSNKAAIDIQLSLENFE----------NKFLKISDIKKKINDCLKE----TESIE 1630
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1154 KEVGIIkaSLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETRevvdlskyKATKSDLETQISSLNEKLANLNRKYE-- 1231
Cdd:TIGR01612 1631 KKISSF--SIDSQDTELKENGDNLNSLQEFLESLKDQKKNIEDK--------KKELDELDSEIEKIEIDVDQHKKNYEig 1700
|
810 820 830
....*....|....*....|....*....|...
gi 530406156 1232 --EVCEEVLHAKKKEISAKdeKELLHFSIEQEI 1262
Cdd:TIGR01612 1701 iiEKIKEIAIANKEEIESI--KELIEPTIENLI 1731
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
630-1309 |
1.47e-11 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 69.28 E-value: 1.47e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 630 EREHEKSLSEIRQLKRELENVKAKLAQHVKPE-EHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIEKVYLDNKLLKEQ 708
Cdd:TIGR04523 32 DTEEKQLEKKLKTIKNELKNKEKELKNLDKNLnKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSE 111
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 709 AHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLldvtqkyTEKKLEMEKLLLENDSLSKDVSRLETVFvppEKHEKE 788
Cdd:TIGR04523 112 IKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEI-------KKKEKELEKLNNKYNDLKKQKEELENEL---NLLEKE 181
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 789 IIALKSNIVELKKQLSELKKKCGEDQEKIH---ALTSENTNLKKMMSNqyvpvktheevkmtLNDTLAKTNRELLDVKKK 865
Cdd:TIGR04523 182 KLNIQKNIDKIKNKLLKLELLLSNLKKKIQknkSLESQISELKKQNNQ--------------LKDNIEKKQQEINEKTTE 247
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 866 FEDINQEFVKIKDKNEILKRNLENTQNQIKaeyislaEHEAKMSSLSQsmrKVQDSNAEILA-NYRKGQEEIVTLHAEIK 944
Cdd:TIGR04523 248 ISNTQTQLNQLKDEQNKIKKQLSEKQKELE-------QNNKKIKELEK---QLNQLKSEISDlNNQKEQDWNKELKSELK 317
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 945 AQKKELDTIQECIKVKYAPIVSFEECERKFKATEK-------ELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQ 1017
Cdd:TIGR04523 318 NQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTnsesensEKQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLE 397
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1018 KDLRDKTVLiekshemeralsrkTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEIlavQNLLQKQHVpleqVEALK 1097
Cdd:TIGR04523 398 SKIQNQEKL--------------NQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEI---KDLTNQDSV----KELII 456
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1098 KSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNkmeEV 1177
Cdd:TIGR04523 457 KNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLES---EK 533
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1178 SKLQSEVQNTKQALKKLETRevvdlskykATKSDLETQISSLNEKLanlnrkyeevcEEVLHAKKKEISAKDEKELLHFS 1257
Cdd:TIGR04523 534 KEKESKISDLEDELNKDDFE---------LKKENLEKEIDEKNKEI-----------EELKQTQKSLKKKQEEKQELIDQ 593
|
650 660 670 680 690
....*....|....*....|....*....|....*....|....*....|..
gi 530406156 1258 IEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQ 1309
Cdd:TIGR04523 594 KEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQ 645
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
283-1116 |
3.89e-11 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 68.16 E-value: 3.89e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 283 IQDLEIENEDLKERLRKIQQEQRILLDKVNGLQLQLNErekLKSLLAAKEKQHEESLRTIEALKNRFKYFESDHLgsgsH 362
Cdd:TIGR02168 234 LEELREELEELQEELKEAEEELEELTAELQELEEKLEE---LRLEVSELEEEIEELQKELYALANEISRLEQQKQ----I 306
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 363 FSNRKEDMLLKQgqmymadsqctspgipahmqsrsmlrpLELSLPSQTSYSENEILKKELEAMRTFCESAKQDRLKLQNE 442
Cdd:TIGR02168 307 LRERLANLERQL---------------------------EELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE 359
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 443 LAHKVAECKALalecERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTseaasgnhRLTEELKDQL 522
Cdd:TIGR02168 360 LEELEAELEEL----ESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRE--------RLQQEIEELL 427
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 523 KDLKvkyegasaevgklRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLALs 602
Cdd:TIGR02168 428 KKLE-------------EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDS- 493
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 603 ipAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIR---QLKRELENVKAKLAQHVKPEEHEQVKSRLE-QKSGELG 678
Cdd:TIGR02168 494 --LERLQENLEGFSEGVKALLKNQSGLSGILGVLSELISvdeGYEAAIEAALGGRLQAVVVENLNAAKKAIAfLKQNELG 571
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 679 KkiteLTLKNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKK--SHDAIIDDLnrklldvtqkytEKKL 756
Cdd:TIGR02168 572 R----VTFLPLDSIKGTEIQGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYllGGVLVVDDL------------DNAL 635
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 757 EMEKLLLENDSL-SKDVSRLETVFVPPEKHEKEiialKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQy 835
Cdd:TIGR02168 636 ELAKKLRPGYRIvTLDGDLVRPGGVITGGSAKT----NSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEEL- 710
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 836 vpvkthEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKA-------EYISLAEHEAKM 908
Cdd:TIGR02168 711 ------EEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEEleerleeAEEELAEAEAEI 784
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 909 SSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKFKATEKELKDQLSEQ 988
Cdd:TIGR02168 785 EELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEEL 864
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 989 TQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVK----NVKEK 1064
Cdd:TIGR02168 865 EELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEvridNLQER 944
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|..
gi 530406156 1065 LVEEnakqtseilavqnllqkQHVPLEQVEALKKSLNGTIENLKEELKSMQR 1116
Cdd:TIGR02168 945 LSEE-----------------YSLTLEEAEALENKIEDDEEEARRRLKRLEN 979
|
|
| PHA03100 |
PHA03100 |
ankyrin repeat protein; Provisional |
33-181 |
3.90e-11 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222984 [Multi-domain] Cd Length: 422 Bit Score: 67.00 E-value: 3.90e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 33 DVEKVTSILAKKGVNPGKLDVEGRSVFHV--VTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHA------LCL 104
Cdd:PHA03100 84 DVKEIVKLLLEYGANVNAPDNNGITPLLYaiSKKSNSYSIVEYLLDNGANVNIKNSDGENLLHLYLESNKIdlkilkLLI 163
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 105 QK------------LLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQL 172
Cdd:PHA03100 164 DKgvdinaknrvnyLLSYGVPINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKL 243
|
....*....
gi 530406156 173 LIDRGADVN 181
Cdd:PHA03100 244 LLNNGPSIK 252
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
24-257 |
5.08e-11 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 67.40 E-value: 5.08e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 24 RLMKAAERGDVEKVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALC 103
Cdd:PHA02876 147 KLIKERIQQDELLIAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDLSVLECAVDSKNIDT 226
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 104 LQKLLQYNCPTEHADLQGRTALHDaamADCPSSIqLLCDHGASVNAKDVDGRTPLVLATQMsrPTICQL---LIDRGADV 180
Cdd:PHA02876 227 IKAIIDNRSNINKNDLSLLKAIRN---EDLETSL-LLYDAGFSVNSIDDCKNTPLHHASQA--PSLSRLvpkLLERGADV 300
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530406156 181 NSRDKQNRTALMLGCEYGC-RDAVEVLIKNGADISLLDALGHDSSYYARIGD-NLDILTLLKTASENTNkGRELWKKGP 257
Cdd:PHA02876 301 NAKNIKGETPLYLMAKNGYdTENIRTLIMLGADVNAADRLYITPLHQASTLDrNKDIVITLLELGANVN-ARDYCDKTP 378
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
418-1187 |
7.04e-11 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 67.07 E-value: 7.04e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 418 LKKELEAMRTFCESAKQDRLKLQNELAHKVAECKALALECERVKEDSDEQIKQLedalkdvQKRMYESEGKVKQMQTHFL 497
Cdd:pfam15921 122 MQMERDAMADIRRRESQSQEDLRNQLQNTVHELEAAKCLKEDMLEDSNTQIEQL-------RKMMLSHEGVLQEIRSILV 194
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 498 ALKEHLTSEAASGNHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIE-----ENKRLQKELS 572
Cdd:pfam15921 195 DFEEASGKKIYEHDSMSTMHFRSLGSAISKILRELDTEISYLKGRIFPVEDQLEALKSESQNKIElllqqHQDRIEQLIS 274
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 573 MCEMEREKKGRKVTEMEGQAKELSAKLalsipaekfENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKA 652
Cdd:pfam15921 275 EHEVEITGLTEKASSARSQANSIQSQL---------EIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEDKIE 345
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 653 KLAQHVKPEEHEQVKSRLE-----QKSGELGKKITELTLKNQTLQKEIEkvyldnkLLKEQAHNLTiemknhyvplkvse 727
Cdd:pfam15921 346 ELEKQLVLANSELTEARTErdqfsQESGNLDDQLQKLLADLHKREKELS-------LEKEQNKRLW-------------- 404
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 728 DMKKSHDAIIDDLNRKLldvtqkytekklemeklllenDSLSKDVSRLETVfvppekhekeIIALKSNIV-ELKKQLSEL 806
Cdd:pfam15921 405 DRDTGNSITIDHLRREL---------------------DDRNMEVQRLEAL----------LKAMKSECQgQMERQMAAI 453
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 807 KKKcGEDQEKIHALTSENTNLKKMMSnqyvpvKTHEEV---KMTLND---TLAKTNRELLDVKKKFEDINQEFVKIKDKN 880
Cdd:pfam15921 454 QGK-NESLEKVSSLTAQLESTKEMLR------KVVEELtakKMTLESserTVSDLTASLQEKERAIEATNAEITKLRSRV 526
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 881 EILKRNLENTQNQikAEYISLAEHEAKMSSLSQSMRkvqDSNAEILANYRKGQEEIVTLHAEIK-AQKKELDTIQECIKV 959
Cdd:pfam15921 527 DLKLQELQHLKNE--GDHLRNVQTECEALKLQMAEK---DKVIEILRQQIENMTQLVGQHGRTAgAMQVEKAQLEKEIND 601
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 960 KYAPIVSFEECERKFKATEKELKDQLS----EQTQKYSVSEEE---VKKNKQENDKLKKEIFTLQKDLRDKTVLIE---- 1028
Cdd:pfam15921 602 RRLELQEFKILKDKKDAKIRELEARVSdlelEKVKLVNAGSERlraVKDIKQERDQLLNEVKTSRNELNSLSEDYEvlkr 681
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1029 ----KSHEMERAlsrkTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHvplEQVEALKKSlngtI 1104
Cdd:pfam15921 682 nfrnKSEEMETT----TNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGMQKQITAKR---GQIDALQSK----I 750
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1105 ENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEK---------EVGIIKASLREKEEESQNKME 1175
Cdd:pfam15921 751 QFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERrlkekvanmEVALDKASLQFAECQDIIQRQ 830
|
810
....*....|..
gi 530406156 1176 EVSKLQSEVQNT 1187
Cdd:pfam15921 831 EQESVRLKLQHT 842
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
25-223 |
1.17e-10 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 66.24 E-value: 1.17e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 25 LMKAAERGDVEKVTSILAKKGVNPGKLDVEGRSVFHVVTSKG-NLECLNAILIHGVDITTSDTAGRNALHLAAKYG-HAL 102
Cdd:PHA02876 277 LHHASQAPSLSRLVPKLLERGADVNAKNIKGETPLYLMAKNGyDTENIRTLIMLGADVNAADRLYITPLHQASTLDrNKD 356
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 103 CLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRP-TICQLLIDRGADVN 181
Cdd:PHA02876 357 IVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEALSQKIGTALHFALCGTNPyMSVKTLIDRGANVN 436
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 530406156 182 SRDKQNRTALMLGCEYGCR-DAVEVLIKNGADIS---------LLDALGHDS 223
Cdd:PHA02876 437 SKNKDLSTPLHYACKKNCKlDVIEMLLDNGADVNainiqnqypLLIALEYHG 488
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
458-1295 |
1.77e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 65.91 E-value: 1.77e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 458 ERVKEDSDEQIKqledalkDVQKRMYES------------------EGKVKQMQTHFLALKEHLTSEAASgnhrlTEELK 519
Cdd:pfam15921 77 ERVLEEYSHQVK-------DLQRRLNESnelhekqkfylrqsvidlQTKLQEMQMERDAMADIRRRESQS-----QEDLR 144
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 520 DQLKDLKVKYEGA-SAEVGKLRNQIKQNEMIVEEFKRDEGKlieenkrLQKELSMCEMEREKKGRKVTEMEgqakelsak 598
Cdd:pfam15921 145 NQLQNTVHELEAAkCLKEDMLEDSNTQIEQLRKMMLSHEGV-------LQEIRSILVDFEEASGKKIYEHD--------- 208
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 599 lalSIPAEKFENMKSSLSnevnekaKKLVEMEREHEKSLSEIRQLKRELENVKAKlAQHVKPEEHEQVKSRLEQKSGELG 678
Cdd:pfam15921 209 ---SMSTMHFRSLGSAIS-------KILRELDTEISYLKGRIFPVEDQLEALKSE-SQNKIELLLQQHQDRIEQLISEHE 277
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 679 KKITELTLKNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVS--------EDMKKSHDAIIDDLNRKLLDVTQK 750
Cdd:pfam15921 278 VEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQLSDLESTvsqlrselREAKRMYEDKIEELEKQLVLANSE 357
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 751 YTEKKLEMEKLLLENDSLSkdvSRLETVFVPPEKHEKEIialksnivelkkQLSELKKKCGEDQEKIHALTSEntNLKKM 830
Cdd:pfam15921 358 LTEARTERDQFSQESGNLD---DQLQKLLADLHKREKEL------------SLEKEQNKRLWDRDTGNSITID--HLRRE 420
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 831 MSNQYVPVKTHEEVKMTL-NDTLAKTNRELLDVKKKFEDInQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMS 909
Cdd:pfam15921 421 LDDRNMEVQRLEALLKAMkSECQGQMERQMAAIQGKNESL-EKVSSLTAQLESTKEMLRKVVEELTAKKMTLESSERTVS 499
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 910 SLSQSM----RKVQDSNAEIlanyRKGQEEIVTLHAEIKAQKKELDTIQECIKvkyapivsfeECER-KFKATEKE---- 980
Cdd:pfam15921 500 DLTASLqekeRAIEATNAEI----TKLRSRVDLKLQELQHLKNEGDHLRNVQT----------ECEAlKLQMAEKDkvie 565
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 981 -LKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDL----SQKY 1055
Cdd:pfam15921 566 iLRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIRELEARVSDLELEKVKLvnagSERL 645
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1056 TEVKNVKE---KLVEENAKQTSEIlavQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKlhqlL 1132
Cdd:pfam15921 646 RAVKDIKQerdQLLNEVKTSRNEL---NSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKS----M 718
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1133 ENQKNSSVPLAEHLQIK-EAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREvvdlSKYKATKSD 1211
Cdd:pfam15921 719 EGSDGHAMKVAMGMQKQiTAKRGQIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEK----NKMAGELEV 794
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1212 LETQISSLNEKLANLNRKYEEVCEEVlhAKKKEISAKDEKELLHFSIEQEIkDQKERCDKSLTTITELQRRIQESAKQIE 1291
Cdd:pfam15921 795 LRSQERRLKEKVANMEVALDKASLQF--AECQDIIQRQEQESVRLKLQHTL-DVKELQGPGYTSNSSMKPRLLQPASFTR 871
|
....
gi 530406156 1292 AKDN 1295
Cdd:pfam15921 872 THSN 875
|
|
| PHA02875 |
PHA02875 |
ankyrin repeat protein; Provisional |
40-216 |
2.28e-10 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165206 [Multi-domain] Cd Length: 413 Bit Score: 64.63 E-value: 2.28e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 40 ILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGV---DITTSDtaGRNALHLAAKYGHALCLQKLLQYNCPTEH 116
Cdd:PHA02875 53 LLMKHGAIPDVKYPDIESELHDAVEEGDVKAVEELLDLGKfadDVFYKD--GMTPLHLATILKKLDIMKLLIARGADPDI 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 117 ADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALM-LGC 195
Cdd:PHA02875 131 PNTDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCTPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVAALcYAI 210
|
170 180
....*....|....*....|.
gi 530406156 196 EYGCRDAVEVLIKNGADISLL 216
Cdd:PHA02875 211 ENNKIDIVRLFIKRGADCNIM 231
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
106-221 |
2.29e-10 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 64.66 E-value: 2.29e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 106 KLLQYNCPTEHADLQGRTALH---DAAMADCPSSIQLLCDHGASVNAKDVDGRTPL-VLATQMSRPTICQLLIDRGADVN 181
Cdd:PHA03095 32 RLLAAGADVNFRGEYGKTPLHlylHYSSEKVKDIVRLLLEAGADVNAPERCGFTPLhLYLYNATTLDVIKLLIKAGADVN 111
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 530406156 182 SRDKQNRTALMLGCEYGCRDA--VEVLIKNGADISLLDALGH 221
Cdd:PHA03095 112 AKDKVGRTPLHVYLSGFNINPkvIRLLLRKGADVNALDLYGM 153
|
|
| PHA02874 |
PHA02874 |
ankyrin repeat protein; Provisional |
25-248 |
2.49e-10 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165205 [Multi-domain] Cd Length: 434 Bit Score: 64.60 E-value: 2.49e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 25 LMKAAERGDVEKVTSILaKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCL 104
Cdd:PHA02874 128 LHYAIKKGDLESIKMLF-EYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACI 206
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 105 QKLLQYNCPTEHADLQGRTALHDAAMADcPSSIQLLCDHgASVNAKDVDGRTPLVLATQMS-RPTICQLLIDRGADVNSR 183
Cdd:PHA02874 207 KLLIDHGNHIMNKCKNGFTPLHNAIIHN-RSAIELLINN-ASINDQDIDGSTPLHHAINPPcDIDIIDILLYHKADISIK 284
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530406156 184 DKQNRTALMLGCEYGCRDAV-------EVLIKNGADISLLDALGHDSsyyarIGDNLDILTLLKTASENTNK 248
Cdd:PHA02874 285 DNKGENPIDTAFKYINKDPVikdiianAVLIKEADKLKDSDFLEHIE-----IKDNKEFSDFIKECNEEIED 351
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
465-1278 |
3.44e-10 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 64.66 E-value: 3.44e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 465 DEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSeaasgnhrlteeLKDQLKDLKvkyegasAEVGKLRNQIK 544
Cdd:TIGR04523 32 DTEEKQLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINN------------SNNKIKILE-------QQIKDLNDKLK 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 545 QNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLalsipaEKFENMKSSLSNEVNEKAK 624
Cdd:TIGR04523 93 KNKDKINKLNSDLSKINSEIKNDKEQKNKLEVELNKLEKQKKENKKNIDKFLTEI------KKKEKELEKLNNKYNDLKK 166
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 625 KLVEMEREHEKSLSEIRQLKRELENVKAKLAqhvKPEEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIEKvyldnkl 704
Cdd:TIGR04523 167 QKEELENELNLLEKEKLNIQKNIDKIKNKLL---KLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEK------- 236
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 705 LKEQAHNLTIEMKNHYVPLKvseDMKKSHDAIIDDLNRKLLDVTQkyTEKKLemeklllenDSLSKDVSRLETVF--VPP 782
Cdd:TIGR04523 237 KQQEINEKTTEISNTQTQLN---QLKDEQNKIKKQLSEKQKELEQ--NNKKI---------KELEKQLNQLKSEIsdLNN 302
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 783 EKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNqyvpvktheevkmtLNDTLAKTNRELLDV 862
Cdd:TIGR04523 303 QKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTN--------------SESENSEKQRELEEK 368
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 863 KKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQsmrkvqdsnaeilaNYRKGQEEIVTLHAE 942
Cdd:TIGR04523 369 QNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQ--------------EKELLEKEIERLKET 434
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 943 IKAQKKELDTIQECIKVKyapivsfeecerkfKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLrd 1022
Cdd:TIGR04523 435 IIKNNSEIKDLTNQDSVK--------------ELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKEL-- 498
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1023 ktvliekshemeRALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPL--EQVEALKKSL 1100
Cdd:TIGR04523 499 ------------KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELkkENLEKEIDEK 566
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1101 NGTIENLKEELKSMQRCYEKEQQtvtKLHQLLENQKNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKL 1180
Cdd:TIGR04523 567 NKEIEELKQTQKSLKKKQEEKQE---LIDQKEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKL 643
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1181 QSEVQNTKQALKKLetrevvdlskyKATKSDLETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKELL------ 1254
Cdd:TIGR04523 644 KQEVKQIKETIKEI-----------RNKWPEIIKKIKESKTKIDDIIELMKDWLKELSLHYKKYITRMIRIKDLpkleek 712
|
810 820
....*....|....*....|....
gi 530406156 1255 HFSIEQEIKDQKErCDKSLTTITE 1278
Cdd:TIGR04523 713 YKEIEKELKKLDE-FSKELENIIK 735
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
934-1344 |
5.27e-10 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 63.89 E-value: 5.27e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 934 EEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKFKATEKELKDQLSEQTQK-------YSVSEEEVKKNKQEN 1006
Cdd:TIGR04523 40 KKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKinklnsdLSKINSEIKNDKEQK 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1007 DKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQ 1086
Cdd:TIGR04523 120 NKLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIKNKLLKL 199
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1087 HVPLEQVEALK---KSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNssvplaEHLQIKEAFEKevgiIKASL 1163
Cdd:TIGR04523 200 ELLLSNLKKKIqknKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQT------QLNQLKDEQNK----IKKQL 269
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1164 REKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKY--------KATKSDLETQISSLNEKLANLNRKYEEVCE 1235
Cdd:TIGR04523 270 SEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKElkselknqEKKLEEIQNQISQNNKIISQLNEQISQLKK 349
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1236 EVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSL---TTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALN 1312
Cdd:TIGR04523 350 ELTNSESENSEKQRELEEKQNEIEKLKKENQSYKQEIKnleSQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIE 429
|
410 420 430
....*....|....*....|....*....|..
gi 530406156 1313 GLSQLTYTSGNPTKRQSQLIDTLQHQVKSLEQ 1344
Cdd:TIGR04523 430 RLKETIIKNNSEIKDLTNQDSVKELIIKNLDN 461
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
606-1352 |
5.37e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 64.37 E-value: 5.37e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 606 EKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQlkrELENVKAKLAQHVKPEEHEQVKSRLEQKSGElgkkitelT 685
Cdd:pfam15921 74 EHIERVLEEYSHQVKDLQRRLNESNELHEKQKFYLRQ---SVIDLQTKLQEMQMERDAMADIRRRESQSQE--------D 142
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 686 LKNQtLQKEIEKVYLdNKLLKEQAhnltieMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKklemeklLLEN 765
Cdd:pfam15921 143 LRNQ-LQNTVHELEA-AKCLKEDM------LEDSNTQIEQLRKMMLSHEGVLQEIRSILVDFEEASGKK-------IYEH 207
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 766 DSLSKdvsrletvfvppeKHEKEIIALKSNIV-ELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQyvpvktHEEv 844
Cdd:pfam15921 208 DSMST-------------MHFRSLGSAISKILrELDTEISYLKGRIFPVEDQLEALKSESQNKIELLLQQ------HQD- 267
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 845 kmTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKrnlENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAE 924
Cdd:pfam15921 268 --RIEQLISEHEVEITGLTEKASSARSQANSIQSQLEIIQ---EQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYED 342
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 925 ILANYRKgqeEIVTLHAEIKAQKKELDtiqecikvkyapivsfeecerKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQ 1004
Cdd:pfam15921 343 KIEELEK---QLVLANSELTEARTERD---------------------QFSQESGNLDDQLQKLLADLHKREKELSLEKE 398
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1005 ENDKL-------KKEIFTLQKDLRDKTVLIEKsheMERALSRKTDELNKQLKdlsQKYTEVKNVKEKLvEENAKQTSEIL 1077
Cdd:pfam15921 399 QNKRLwdrdtgnSITIDHLRRELDDRNMEVQR---LEALLKAMKSECQGQME---RQMAAIQGKNESL-EKVSSLTAQLE 471
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1078 AVQNLLQKQhvpLEQVEALKKSLNG---TIENLKEELKSMQRCYEKEQQTVTKLH-------QLLENQKNSsvplAEHLQ 1147
Cdd:pfam15921 472 STKEMLRKV---VEELTAKKMTLESserTVSDLTASLQEKERAIEATNAEITKLRsrvdlklQELQHLKNE----GDHLR 544
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1148 IKEAfekEVGIIKASLREKE---EESQNKMEEVSKLQSE-------VQNTKQALKKLETREVVDLSKYKATKSDLETQIS 1217
Cdd:pfam15921 545 NVQT---ECEALKLQMAEKDkviEILRQQIENMTQLVGQhgrtagaMQVEKAQLEKEINDRRLELQEFKILKDKKDAKIR 621
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1218 SLNEKLANLNRKyeevceevlhaKKKEISAKDEKELLHFSIEQE---IKDQKERCDKSLTTITE----LQRRIQESAKQI 1290
Cdd:pfam15921 622 ELEARVSDLELE-----------KVKLVNAGSERLRAVKDIKQErdqLLNEVKTSRNELNSLSEdyevLKRNFRNKSEEM 690
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530406156 1291 EAKDNKITELLNDVE-RLKQALNGLSQLTYTSGNPTK----RQSQL------IDTLQHQVKSLEQQLADADRQ 1352
Cdd:pfam15921 691 ETTTNKLKMQLKSAQsELEQTRNTLKSMEGSDGHAMKvamgMQKQItakrgqIDALQSKIQFLEEAMTNANKE 763
|
|
| PHA03100 |
PHA03100 |
ankyrin repeat protein; Provisional |
59-239 |
6.74e-10 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222984 [Multi-domain] Cd Length: 422 Bit Score: 63.15 E-value: 6.74e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 59 FHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQK-----LLQYNCPTEHADLQGRTALHDAAMADC 133
Cdd:PHA03100 39 LYLAKEARNIDVVKILLDNGADINSSTKNNSTPLHYLSNIKYNLTDVKeivklLLEYGANVNAPDNNGITPLLYAISKKS 118
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 134 --PSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPT--ICQLLIDRGADVN----------------SRDKQNRTALML 193
Cdd:PHA03100 119 nsYSIVEYLLDNGANVNIKNSDGENLLHLYLESNKIDlkILKLLIDKGVDINaknrvnyllsygvpinIKDVYGFTPLHY 198
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 530406156 194 GCEYGCRDAVEVLIKNGADISLLDALGHDSSYYARIGDNLDILTLL 239
Cdd:PHA03100 199 AVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLL 244
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
755-1346 |
7.16e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 63.98 E-value: 7.16e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 755 KLEMEKLLLENDSLSKDVSRleTVFVPPEKHEKEIIALKSNIVELKKQLSELK-------------KKCGED-----QEK 816
Cdd:pfam15921 73 KEHIERVLEEYSHQVKDLQR--RLNESNELHEKQKFYLRQSVIDLQTKLQEMQmerdamadirrreSQSQEDlrnqlQNT 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 817 IHALTSENTNLKKMMSNQYVPVKTHEEVKMTLNDTLAKTNRELLDV-----KKKFEDINQEFVKIKDKNEILKRNLENTQ 891
Cdd:pfam15921 151 VHELEAAKCLKEDMLEDSNTQIEQLRKMMLSHEGVLQEIRSILVDFeeasgKKIYEHDSMSTMHFRSLGSAISKILRELD 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 892 NQI---KAEYISLAEHEAKMSSLSQS-----MRKVQDSNAEILANYrkgQEEIVTLHAEIKAQKKELDTIQ---ECIKVK 960
Cdd:pfam15921 231 TEIsylKGRIFPVEDQLEALKSESQNkiellLQQHQDRIEQLISEH---EVEITGLTEKASSARSQANSIQsqlEIIQEQ 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 961 YAPIVSFEECE-RKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQendkLKKEIFTLQKDLRDKtvLIEKSHEMERALSR 1039
Cdd:pfam15921 308 ARNQNSMYMRQlSDLESTVSQLRSELREAKRMYEDKIEELEKQLV----LANSELTEARTERDQ--FSQESGNLDDQLQK 381
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1040 KTDELNKQLKDLSQKytevKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYE 1119
Cdd:pfam15921 382 LLADLHKREKELSLE----KEQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKN 457
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1120 KEQQTVTKLHQLLENQKNSSVPLAEHLQIK----EAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLE 1195
Cdd:pfam15921 458 ESLEKVSSLTAQLESTKEMLRKVVEELTAKkmtlESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQELQHLK 537
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1196 TREvVDLSKYKATKSDLETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKELLHFSIEQ---EIKDQKERCDKS 1272
Cdd:pfam15921 538 NEG-DHLRNVQTECEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKEINDrrlELQEFKILKDKK 616
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1273 LTTITELQRRIQ----ESAKQIEAKDNKI----------TELLNDVERLKQALNGLSQ----LTYTSGNPTKRQSQLIDT 1334
Cdd:pfam15921 617 DAKIRELEARVSdlelEKVKLVNAGSERLravkdikqerDQLLNEVKTSRNELNSLSEdyevLKRNFRNKSEEMETTTNK 696
|
650
....*....|..
gi 530406156 1335 LQHQVKSLEQQL 1346
Cdd:pfam15921 697 LKMQLKSAQSEL 708
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
903-1254 |
8.68e-10 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 63.54 E-value: 8.68e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 903 EHEAKMSSLSQSMRKVQDSNAEILANYRK--GQEEIVTLHAEIKAQKKELDtiqecIKVKYAPIVSFEECERKFKATEKE 980
Cdd:TIGR02168 176 ETERKLERTRENLDRLEDILNELERQLKSleRQAEKAERYKELKAELRELE-----LALLVLRLEELREELEELQEELKE 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 981 LKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKN 1060
Cdd:TIGR02168 251 AEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELES 330
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1061 VKEKLVEEnakqtseilavqnllqkqhvpLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSsv 1140
Cdd:TIGR02168 331 KLDELAEE---------------------LAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSK-- 387
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1141 pLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNtkQALKKLETRevvdLSKYKATKSDLETQISSLN 1220
Cdd:TIGR02168 388 -VAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEE--AELKELQAE----LEELEEELEELQEELERLE 460
|
330 340 350
....*....|....*....|....*....|....
gi 530406156 1221 EKLANLNRKYEEVCEEVLHAKKKEISAKDEKELL 1254
Cdd:TIGR02168 461 EALEELREELEEAEQALDAAERELAQLQARLDSL 494
|
|
| PHA02874 |
PHA02874 |
ankyrin repeat protein; Provisional |
67-227 |
9.73e-10 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165205 [Multi-domain] Cd Length: 434 Bit Score: 62.67 E-value: 9.73e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 67 NLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGAS 146
Cdd:PHA02874 103 EKDMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAY 182
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 147 VNAKDVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGcRDAVEVLIKNgADISLLDALGHDSSYY 226
Cdd:PHA02874 183 ANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHIMNKCKNGFTPLHNAIIHN-RSAIELLINN-ASINDQDIDGSTPLHH 260
|
.
gi 530406156 227 A 227
Cdd:PHA02874 261 A 261
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1017-1390 |
1.02e-09 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 63.42 E-value: 1.02e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1017 QKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEAL 1096
Cdd:COG1196 221 ELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARL 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1097 KKSLNGTIENLKEELKSMQRcYEKEQQTVTKLHQLLENQknssvpLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEE 1176
Cdd:COG1196 301 EQDIARLEERRRELEERLEE-LEEELAELEEELEELEEE------LEELEEELEEAEEELEEAEAELAEAEEALLEAEAE 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1177 VSKLQSEVQNTKQALKKLETREVVDLSKYKAtksdLETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAkdekellhf 1256
Cdd:COG1196 374 LAEAEEELEELAEELLEALRAAAELAAQLEE----LEEAEEALLERLERLEEELEELEEALAELEEEEEEE--------- 440
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1257 siEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALNGLSQLTytsgnpTKRQSQLIDTLQ 1336
Cdd:COG1196 441 --EEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEAE------ADYEGFLEGVKA 512
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....
gi 530406156 1337 HQVKSLEQQLADADRQHQEVIAIYRTHLLSAAQGHMDEDVQEALLQIIQMRQGL 1390
Cdd:COG1196 513 ALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYL 566
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
438-1115 |
1.66e-09 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 62.35 E-value: 1.66e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 438 KLQNELAHKVAECKAL---ALECERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSEAASGNhrl 514
Cdd:TIGR04523 44 TIKNELKNKEKELKNLdknLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDKEQKN--- 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 515 teELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEgqaKE 594
Cdd:TIGR04523 121 --KLEVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLNIQKNIDKIK---NK 195
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 595 LSAKLALSIPAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVkpEEHEQVKSRLEQKS 674
Cdd:TIGR04523 196 LLKLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLK--DEQNKIKKQLSEKQ 273
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 675 GEL---GKKITELTLKNQTLQKEIEKvyLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKY 751
Cdd:TIGR04523 274 KELeqnNKKIKELEKQLNQLKSEISD--LNNQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKEL 351
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 752 TEKKLEMEKLlleNDSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMM 831
Cdd:TIGR04523 352 TNSESENSEK---QRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEI 428
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 832 SNQYVPVKTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSL 911
Cdd:TIGR04523 429 ERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKKEL 508
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 912 SQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIqecikvkyapivsfeecerkfkatekelkdqlsEQTQK 991
Cdd:TIGR04523 509 EEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKD---------------------------------DFELK 555
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 992 YSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKshemeraLSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAK 1071
Cdd:TIGR04523 556 KENLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQ-------KEKEKKDLIKEIEEKEKKISSLEKELEKAKKENEK 628
|
650 660 670 680
....*....|....*....|....*....|....*....|....
gi 530406156 1072 QTSEIlavQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQ 1115
Cdd:TIGR04523 629 LSSII---KNIKSKKNKLKQEVKQIKETIKEIRNKWPEIIKKIK 669
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
888-1364 |
1.81e-09 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 62.43 E-value: 1.81e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 888 ENTQNQIKAEYISLAEHEAKMSSLSQSMRkVQDSNAEILANYRKGQEEIVTLHAEiKAQKKELDTIQECIKVKYAPIVSF 967
Cdd:pfam05483 175 EYEREETRQVYMDLNNNIEKMILAFEELR-VQAENARLEMHFKLKEDHEKIQHLE-EEYKKEINDKEKQVSLLLIQITEK 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 968 EECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEI----FTLQKDLRDKTVLIEKSHEMERALSRKTDE 1043
Cdd:pfam05483 253 ENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELedikMSLQRSMSTQKALEEDLQIATKTICQLTEE 332
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1044 LNKQLKDLSQKYTEVKNVKEKLveenakqTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYE---- 1119
Cdd:pfam05483 333 KEAQMEELNKAKAAHSFVVTEF-------EATTCSLEELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKFKNnkev 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1120 --KEQQTVTKLHQLLENQKNSSVPLAEHLQIKE--------AFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKq 1189
Cdd:pfam05483 406 elEELKKILAEDEKLLDEKKQFEKIAEELKGKEqelifllqAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEK- 484
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1190 aLKKLETREVVDL-----SKYKATKSDLETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKELLHFSIEQEIKD 1264
Cdd:pfam05483 485 -LKNIELTAHCDKlllenKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDE 563
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1265 QKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALNGLSQLTYTSGNPTKRQSQLIDTLQHQVKSLEQ 1344
Cdd:pfam05483 564 VKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYEIKVNKLEL 643
|
490 500
....*....|....*....|
gi 530406156 1345 QLADADRQHQEVIAIYRTHL 1364
Cdd:pfam05483 644 ELASAKQKFEEIIDNYQKEI 663
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
864-1309 |
3.97e-09 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 61.70 E-value: 3.97e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 864 KKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDsNAEILANYRKGQEEIVTLHAEI 943
Cdd:PTZ00121 1194 RKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERN-NEEIRKFEEARMAHFARRQAAI 1272
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 944 KAQ-KKELDTIQECIKVKYAPIVSFEECERKF-----KATEKELKDQLSEQTQKYSVSEEEVKKNKQENdklKKEIFTLQ 1017
Cdd:PTZ00121 1273 KAEeARKADELKKAEEKKKADEAKKAEEKKKAdeakkKAEEAKKADEAKKKAEEAKKKADAAKKKAEEA---KKAAEAAK 1349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1018 KDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKE--KLVEENAKQTSEILAVQNLLQKQHVPLEQVEA 1095
Cdd:PTZ00121 1350 AEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEakKKAEEDKKKADELKKAAAAKKKADEAKKKAEE 1429
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1096 LKKSlngtiENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQiKEAFEKEVGiikASLREKEEESQNKME 1175
Cdd:PTZ00121 1430 KKKA-----DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAK-KKAEEAKKA---DEAKKKAEEAKKKAD 1500
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1176 EVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSDLETQISSLNEKLANLnRKYEEV--CEEVLHAKKKEiSAKDEKEL 1253
Cdd:PTZ00121 1501 EAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADEL-KKAEELkkAEEKKKAEEAK-KAEEDKNM 1578
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*.
gi 530406156 1254 LHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQ 1309
Cdd:PTZ00121 1579 ALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
|
|
| PTZ00322 |
PTZ00322 |
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional |
137-234 |
5.04e-09 |
|
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
Pssm-ID: 140343 [Multi-domain] Cd Length: 664 Bit Score: 60.68 E-value: 5.04e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 137 IQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLL 216
Cdd:PTZ00322 98 ARILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQLLSRHSQCHFEL 177
|
90
....*....|....*...
gi 530406156 217 DALGHDSSYYARIGDNLD 234
Cdd:PTZ00322 178 GANAKPDSFTGKPPSLED 195
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
288-1178 |
6.66e-09 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 60.76 E-value: 6.66e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 288 IENEDLKERLRKIQQEQRILLDKVNGLQLQLNEREKLKSLLAAKEKQHEESLRTIEALKNRFKYFESDHLGSGSHFSNRK 367
Cdd:pfam02463 166 RLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQEL 245
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 368 EDMLLKQGQMYMADSQCTSPGIPAHMQSRSMLRPLELSLPSQTSYSENEILKKELEAmrtfcESAKQDRLKLQNELAHKV 447
Cdd:pfam02463 246 LRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSEL-----LKLERRKVDDEEKLKESE 320
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 448 AECKALALECERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSeaasgNHRLTEELKDQLKDLKV 527
Cdd:pfam02463 321 KEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAK-----KKLESERLSSAAKLKEE 395
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 528 KYEGASAEVGKLRNQIkqnemivEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLAlsipaek 607
Cdd:pfam02463 396 ELELKSEEEKEAQLLL-------ELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLL------- 461
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 608 fENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRE-LENVKAKLAQHVKPEEHEQVKSRLEQKSGELGKKITELTL 686
Cdd:pfam02463 462 -KDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEERSqKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVEN 540
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 687 -KNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKS----------HDAIIDDLNRKLLDVTQKYTEKK 755
Cdd:pfam02463 541 yKVAISTAVIVEVSATADEVEERQKLVRALTELPLGARKLRLLIPKLklplksiavlEIDPILNLAQLDKATLEADEDDK 620
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 756 LEMEKLLLENDSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQY 835
Cdd:pfam02463 621 RAKVVEGILKDTELTKLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLE 700
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 836 VPVKTHEEVKMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQ--IKAEYISLAEHEAKMSSLSQ 913
Cdd:pfam02463 701 IKKKEQREKEELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKkeEKEEEKSELSLKEKELAEER 780
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 914 SMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVsfEECERKFKATEKELKDQLSEQTQKYS 993
Cdd:pfam02463 781 EKTEKLKVEEEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKE--EELEELALELKEEQKLEKLAEEELER 858
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 994 VSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQT 1073
Cdd:pfam02463 859 LEEEITKEELLQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPE 938
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1074 SEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFE 1153
Cdd:pfam02463 939 ELLLEEADEKEKEENNKEEEEERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQR 1018
|
890 900
....*....|....*....|....*
gi 530406156 1154 KEVGIIKASLREKEEESQNKMEEVS 1178
Cdd:pfam02463 1019 LKEFLELFVSINKGWNKVFFYLELG 1043
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
624-1317 |
2.07e-08 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 59.21 E-value: 2.07e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 624 KKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQVKSRLEQKsgELGKKITELTLKNQTLQKEIEKVYLDNK 703
Cdd:pfam02463 169 RKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEK--LELEEEYLLYLDYLKLNEERIDLLQELL 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 704 LLKEQAHNLTI-EMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLETVfvpP 782
Cdd:pfam02463 247 RDEQEEIESSKqEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKE---K 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 783 EKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQYVPVKTHEE----VKMTLNDTLAKTNRE 858
Cdd:pfam02463 324 KKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESErlssAAKLKEEELELKSEE 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 859 LLDVKKKFEDINQEFVKIKDKNEILKRNLENTQ--NQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEI 936
Cdd:pfam02463 404 EKEAQLLLELARQLEDLLKEEKKEELEILEEEEesIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQ 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 937 VTLHAEIKAQKKELDTIQEC----------IKVKYAPIVSFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQEN 1006
Cdd:pfam02463 484 EQLELLLSRQKLEERSQKESkarsglkvllALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEER 563
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1007 DKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKT--DELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEI-------- 1076
Cdd:pfam02463 564 QKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEidPILNLAQLDKATLEADEDDKRAKVVEGILKDTELTklkesaka 643
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1077 --------------------LAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQK 1136
Cdd:pfam02463 644 kesglrkgvsleeglaekseVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKKLKLEAEELL 723
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1137 NSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREV---VDLSKYKATKSDLE 1213
Cdd:pfam02463 724 ADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLkveEEKEEKLKAQEEEL 803
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1214 TQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRR--IQESAKQIE 1291
Cdd:pfam02463 804 RALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEELLQELllKEEELEEQK 883
|
730 740
....*....|....*....|....*.
gi 530406156 1292 AKDNKITELLNDVERLKQALNGLSQL 1317
Cdd:pfam02463 884 LKDELESKEEKEKEEKKELEEESQKL 909
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
540-1366 |
2.24e-08 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 58.90 E-value: 2.24e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 540 RNQIKQNEMIVEEFKRDEGKLIEENKRLQKELS-MCEMEREKKGRKVT--EMEGQAKELSAKLALSIPA--EKFENMKSS 614
Cdd:TIGR00606 230 EAQLESSREIVKSYENELDPLKNRLKEIEHNLSkIMKLDNEIKALKSRkkQMEKDNSELELKMEKVFQGtdEQLNDLYHN 309
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 615 LSNEVNEKAKKLVEMEREHEKSLSEIRQL---KRELENVKAKLAQHVKPEEHEQVKSRLEQKSGELGKKI--------TE 683
Cdd:TIGR00606 310 HQRTVREKERELVDCQRELEKLNKERRLLnqeKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELdgfergpfSE 389
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 684 LTLKN-QTLQKE--IEKVYLDNKLLKEQAHNLTIEMKNhyvpLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEK 760
Cdd:TIGR00606 390 RQIKNfHTLVIErqEDEAKTAAQLCADLQSKERLKQEQ----ADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQ 465
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 761 LLLENDSLSKDVSRLETVFVPPEKHEKE--IIALKSNIVELK-KQLSELKKKCGEDQEkIHALTSENTNLKKMMSNQYVP 837
Cdd:TIGR00606 466 LEGSSDRILELDQELRKAERELSKAEKNslTETLKKEVKSLQnEKADLDRKLRKLDQE-MEQLNHHTTTRTQMEMLTKDK 544
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 838 VKTHEEV--------------------KMTLNDTLAKTNRELLDVKKKFEDINQEFVKikdkneilkrnLENTQNQIKAE 897
Cdd:TIGR00606 545 MDKDEQIrkiksrhsdeltsllgyfpnKKQLEDWLHSKSKEINQTRDRLAKLNKELAS-----------LEQNKNHINNE 613
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 898 YISLaehEAKMSSLSQSMRKVQDSNAEilanyrkgQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEE-------- 969
Cdd:TIGR00606 614 LESK---EEQLSSYEDKLFDVCGSQDE--------ESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDenqsccpv 682
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 970 CERKFKaTEKELKDQLSEQTQKYSV-------SEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTD 1042
Cdd:TIGR00606 683 CQRVFQ-TEAELQEFISDLQSKLRLapdklksTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEIPELRNKLQKVNRDIQ 761
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1043 ELNKQLKDlSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGT----------------IEN 1106
Cdd:TIGR00606 762 RLKNDIEE-QETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQGSDLDrtvqqvnqekqekqheLDT 840
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1107 LKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEvgiikasLREKEEESQNKMEEVSKLQSEVQN 1186
Cdd:TIGR00606 841 VVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQ-------LVELSTEVQSLIREIKDAKEQDSP 913
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1187 TKQALKKLETREVVDLSKYKATKSDLETQISSLNEKLAN-------LNRKYEEVCEEVLHAKKKEISA------------ 1247
Cdd:TIGR00606 914 LETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNihgymkdIENKIQDGKDDYLKQKETELNTvnaqleecekhq 993
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1248 -KDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELlnDVERLKQALNGLSQLTYTSGNPTK 1326
Cdd:TIGR00606 994 eKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQM--QVLQMKQEHQKLEENIDLIKRNHV 1071
|
890 900 910 920
....*....|....*....|....*....|....*....|....*
gi 530406156 1327 ----RQSQLIDTLQHQVKSL-EQQLADADRQHQEVIAIYRTHLLS 1366
Cdd:TIGR00606 1072 lalgRQKGYEKEIKHFKKELrEPQFRDAEEKYREMMIVMRTTELV 1116
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
609-1196 |
2.48e-08 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 58.76 E-value: 2.48e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 609 ENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLaQHVKPEEHE-----QVKSRLEQKSGELGKKITE 683
Cdd:PRK01156 189 EEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDY-NNLKSALNElssleDMKNRYESEIKTAESDLSM 267
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 684 LTLKNQTLqKEIEKVYldNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNR-----KLLDVTQKYTEKKLEM 758
Cdd:PRK01156 268 ELEKNNYY-KELEERH--MKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKyhaiiKKLSVLQKDYNDYIKK 344
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 759 EKlllENDSLSKDVSRLETvfvppekHEKEIIALKSNIVELKKQLSELKKKcgedQEKIHALTSENtnLKKMMSNQYVPV 838
Cdd:PRK01156 345 KS---RYDDLNNQILELEG-------YEMDYNSYLKSIESLKKKIEEYSKN----IERMSAFISEI--LKIQEIDPDAIK 408
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 839 KTHEEVkmtlndtlaktNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLsqsMRKV 918
Cdd:PRK01156 409 KELNEI-----------NVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVCPVCGTTLGEEKSNHI---INHY 474
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 919 QDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQEcikvkyAPIVSFEECERKFKATEKELKD------QLSEQTQKY 992
Cdd:PRK01156 475 NEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLES------EEINKSINEYNKIESARADLEDikikinELKDKHDKY 548
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 993 SVSEEEVKKNKQENDKLKKEIFTLQKDLRDkTVLIEkshemerALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQ 1072
Cdd:PRK01156 549 EEIKNRYKSLKLEDLDSKRTSWLNALAVIS-LIDIE-------TNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKS 620
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1073 TSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSsvplaehlqikeaf 1152
Cdd:PRK01156 621 IREIENEANNLNNKYNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRINDIEDNLKKS-------------- 686
|
570 580 590 600
....*....|....*....|....*....|....*....|....
gi 530406156 1153 EKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLET 1196
Cdd:PRK01156 687 RKALDDAKANRARLESTIEILRTRINELSDRINDINETLESMKK 730
|
|
| PHA02878 |
PHA02878 |
ankyrin repeat protein; Provisional |
36-209 |
3.25e-08 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222939 [Multi-domain] Cd Length: 477 Bit Score: 57.97 E-value: 3.25e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 36 KVTSILAKKGVNPGKLDVE-GRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPT 114
Cdd:PHA02878 148 EITKLLLSYGADINMKDRHkGNTALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGAST 227
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 115 EHADLQGRTALHdAAMADCPSS--IQLLCDHGASVNAKD-VDGRTPLVLATQMSRPTicQLLIDRGADVNSRDKQNRTAL 191
Cdd:PHA02878 228 DARDKCGNTPLH-ISVGYCKDYdiLKLLLEHGVDVNAKSyILGLTALHSSIKSERKL--KLLLEYGADINSLNSYKLTPL 304
|
170
....*....|....*....
gi 530406156 192 -MLGCEYGCRDAVEVLIKN 209
Cdd:PHA02878 305 sSAVKQYLCINIGRILISN 323
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
28-244 |
5.06e-08 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 57.38 E-value: 5.06e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 28 AAERGDVeKVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHlaakygHALCLQKL 107
Cdd:PHA02876 185 AAERGNA-KMVNLLLSYGADVNIIALDDLSVLECAVDSKNIDTIKAIIDNRSNINKNDLSLLKAIR------NEDLETSL 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 108 LQYNC--PTEHADLQGRTALHDAAMAdcPSSIQL---LCDHGASVNAKDVDGRTPLVLATQMSRPTI-CQLLIDRGADVN 181
Cdd:PHA02876 258 LLYDAgfSVNSIDDCKNTPLHHASQA--PSLSRLvpkLLERGADVNAKNIKGETPLYLMAKNGYDTEnIRTLIMLGADVN 335
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530406156 182 SRDKQNRTALMLGCEYG-CRDAVEVLIKNGADISLLDALGHDSSYYARIGDNLDIL-TLLKTASE 244
Cdd:PHA02876 336 AADRLYITPLHQASTLDrNKDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIInTLLDYGAD 400
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
586-1014 |
7.40e-08 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 57.39 E-value: 7.40e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 586 TEMEGQAKELSAKL-ALSIPAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQ-HVKPEEH 663
Cdd:TIGR02169 670 RSEPAELQRLRERLeGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEElEEDLSSL 749
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 664 EQVKSRLEQKSGELGKKITELTLKNQTLQKEIEKVYldNKLLKEQAHNLTIEMKNhyvplkvSEDMKKSHDAIIDDLNRK 743
Cdd:TIGR02169 750 EQEIENVKSELKELEARIEELEEDLHKLEEALNDLE--ARLSHSRIPEIQAELSK-------LEEEVSRIEARLREIEQK 820
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 744 LldvtqkyteKKLEMEKLLLENDSlskdvsrletvfvppEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKihaltse 823
Cdd:TIGR02169 821 L---------NRLTLEKEYLEKEI---------------QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEE------- 869
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 824 ntnlkkmmsnqyvpvktheevkmtlndtLAKTNRELLDVKKKFEDINQEFVKIKDKneilKRNLENTQNQIKAEYISLAE 903
Cdd:TIGR02169 870 ----------------------------LEELEAALRDLESRLGDLKKERDELEAQ----LRELERKIEELEAQIEKKRK 917
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 904 HeakMSSLSQSMRKVQDSNAEILANYRKGQEE------IVTLHAEIKAQKKELDTIQEcikVKYAPIVSFEECERKFkat 977
Cdd:TIGR02169 918 R---LSELKAKLEALEEELSEIEDPKGEDEEIpeeelsLEDVQAELQRVEEEIRALEP---VNMLAIQEYEEVLKRL--- 988
|
410 420 430
....*....|....*....|....*....|....*..
gi 530406156 978 ekelkDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIF 1014
Cdd:TIGR02169 989 -----DELKEKRAKLEEERKAILERIEEYEKKKREVF 1020
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
121-174 |
8.72e-08 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 49.97 E-value: 8.72e-08
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 530406156 121 GRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLI 174
Cdd:pfam13637 1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
|
|
| PHA02875 |
PHA02875 |
ankyrin repeat protein; Provisional |
66-247 |
9.53e-08 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165206 [Multi-domain] Cd Length: 413 Bit Score: 56.15 E-value: 9.53e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 66 GNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGA 145
Cdd:PHA02875 13 GELDIARRLLDIGINPNFEIYDGISPIKLAMKFRDSEAIKLLMKHGAIPDVKYPDIESELHDAVEEGDVKAVEELLDLGK 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 146 SVNakDV---DGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALGHD 222
Cdd:PHA02875 93 FAD--DVfykDGMTPLHLATILKKLDIMKLLIARGADPDIPNTDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCT 170
|
170 180
....*....|....*....|....*
gi 530406156 223 SSYYARIGDNLDILTLLKTASENTN 247
Cdd:PHA02875 171 PLIIAMAKGDIAICKMLLDSGANID 195
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
608-876 |
1.06e-07 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 56.47 E-value: 1.06e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 608 FENMKSSLSNEVNEKAKKLVEMEREHeksLSEIRQLkRELENVKAKLAQHVKPEEHEQVKSRLEQKSGELGKKITELTLK 687
Cdd:PRK05771 33 IEDLKEELSNERLRKLRSLLTKLSEA---LDKLRSY-LPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEE 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 688 NQTLQKEIEKvYLDNKLLKEQAHNLTIEMKNHY-------VPLKVSEDMKKSHDAIIDDLNRKLLDVTQKY------TEK 754
Cdd:PRK05771 109 ISELENEIKE-LEQEIERLEPWGNFDLDLSLLLgfkyvsvFVGTVPEDKLEELKLESDVENVEYISTDKGYvyvvvvVLK 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 755 KL--EMEKLLLENDSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMS 832
Cdd:PRK05771 188 ELsdEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLEIELERAEALS 267
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530406156 833 NQ-----------YVPVKTHEEVKMTLNDTlakTNR----ELLDVKKKFEDI-----NQEFVKI 876
Cdd:PRK05771 268 KFlktdktfaiegWVPEDRVKKLKELIDKA---TGGsayvEFVEPDEEEEEVptklkNPKFIKP 328
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
848-1347 |
1.19e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.48 E-value: 1.19e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 848 LNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILA 927
Cdd:COG1196 237 LEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEE 316
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 928 NYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQEND 1007
Cdd:COG1196 317 RLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAA 396
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1008 KLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKqh 1087
Cdd:COG1196 397 ELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAAL-- 474
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1088 vpLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHqLLENQKNSSVPLAEHLQIKEAFEKEVGIIKASL---- 1163
Cdd:COG1196 475 --LEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAAL-LLAGLRGLAGAVAVLIGVEAAYEAALEAALAAAlqni 551
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1164 ---------REKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVD-------------------LSKYKATKSDLETQ 1215
Cdd:COG1196 552 vveddevaaAAIEYLKAAKAGRATFLPLDKIRARAALAAALARGAIGaavdlvasdlreadaryyvLGDTLLGRTLVAAR 631
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1216 ISSLNEKLANLNRKYEEV---CEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEA 1292
Cdd:COG1196 632 LEAALRRAVTLAGRLREVtleGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAE 711
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530406156 1293 KDNKITELLNDVERLKQALNGLSQ-----------LTYTSGNPTKRQSQLIDTLQHQVKSLEQQLA 1347
Cdd:COG1196 712 AEEERLEEELEEEALEEQLEAEREelleelleeeeLLEEEALEELPEPPDLEELERELERLEREIE 777
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
414-807 |
1.32e-07 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 56.28 E-value: 1.32e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 414 ENEILKKeLEAMRTFCESAKQDRLKLQNELAHKvaeckalalecERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQ 493
Cdd:pfam15921 456 KNESLEK-VSSLTAQLESTKEMLRKVVEELTAK-----------KMTLESSERTVSDLTASLQEKERAIEATNAEITKLR 523
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 494 THF-LALKE--HLTSEAASGNHRLTEelkdqLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKE 570
Cdd:pfam15921 524 SRVdLKLQElqHLKNEGDHLRNVQTE-----CEALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGAMQVEKAQLEKE 598
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 571 LSMCEMEREKKGRKVTEMEGQAKELSAKLAlsipaeKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENV 650
Cdd:pfam15921 599 INDRRLELQEFKILKDKKDAKIRELEARVS------DLELEKVKLVNAGSERLRAVKDIKQERDQLLNEVKTSRNELNSL 672
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 651 KaklaqhvkpEEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIekvyldnkllkEQAHNLTIEMK-NHYVPLKVSEDM 729
Cdd:pfam15921 673 S---------EDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSEL-----------EQTRNTLKSMEgSDGHAMKVAMGM 732
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 730 KKSHDAI---IDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSrleTVFVPPEKHEKEIIALKSNIVELKKQLSEL 806
Cdd:pfam15921 733 QKQITAKrgqIDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELS---TVATEKNKMAGELEVLRSQERRLKEKVANM 809
|
.
gi 530406156 807 K 807
Cdd:pfam15921 810 E 810
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
881-1309 |
2.00e-07 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 55.82 E-value: 2.00e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 881 EILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVK 960
Cdd:PRK02224 254 ETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEEC 333
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 961 YAPIVSF-EECERKFKATE------KELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEM 1033
Cdd:PRK02224 334 RVAAQAHnEEAESLREDADdleeraEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDF 413
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1034 ERALSRKTDELNKQLKDLSQKYTEVKNVKEK---LVE-----------ENAKQTSEILAVQNLLQKQHVPLEQVEALKKS 1099
Cdd:PRK02224 414 LEELREERDELREREAELEATLRTARERVEEaeaLLEagkcpecgqpvEGSPHVETIEEDRERVEELEAELEDLEEEVEE 493
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1100 LNGTIENLkEELKSMQRCYEKEQQTVTKLHQLLENQKNS----SVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKME 1175
Cdd:PRK02224 494 VEERLERA-EDLVEAEDRIERLEERREDLEELIAERRETieekRERAEELRERAAELEAEAEEKREAAAEAEEEAEEARE 572
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1176 EVSKLQSEVQNTKQALKKLETREVVDlskykATKSDLETQISSLNEKLANLNRKYEEVCEEVlhAKKKEISAKDEKELLH 1255
Cdd:PRK02224 573 EVAELNSKLAELKERIESLERIRTLL-----AAIADAEDEIERLREKREALAELNDERRERL--AEKRERKRELEAEFDE 645
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....*...
gi 530406156 1256 FSIEqEIKDQKERCDKSLTTITELQRRIQES----AKQIEAKDNKITELLNDVERLKQ 1309
Cdd:PRK02224 646 ARIE-EAREDKERAEEYLEQVEEKLDELREErddlQAEIGAVENELEELEELRERREA 702
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
274-1345 |
2.68e-07 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 55.83 E-value: 2.68e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 274 KSHQREHQNIQDLEIENEDLK------ERLRKIQQEQRILLDKVN--------GLQLQLNEREKLKSLLAAKEKQHEE-- 337
Cdd:TIGR01612 904 KSIEEEYQNINTLKKVDEYIKicentkESIEKFHNKQNILKEILNknidtikeSNLIEKSYKDKFDNTLIDKINELDKaf 983
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 338 ---SLRTIEALKNRF-KYFESDHLGSGSHfsnrKEDMLLKQ------------GQMYMADSQCTSPGIPAHMQSRSMLRP 401
Cdd:TIGR01612 984 kdaSLNDYEAKNNELiKYFNDLKANLGKN----KENMLYHQfdekekatndieQKIEDANKNIPNIEIAIHTSIYNIIDE 1059
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 402 LELSLPSQTSYSENEILKKELEAMRTFCESakQDRLKLQNELAHKVAECKALALECERVKEDSDEQIKQLEDALKDVQKR 481
Cdd:TIGR01612 1060 IEKEIGKNIELLNKEILEEAEINITNFNEI--KEKLKHYNFDDFGKEENIKYADEINKIKDDIKNLDQKIDHHIKALEEI 1137
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 482 MYESEGKVKQMQTHFLALkEHLTSEAASGNHrlTEELKDQLKDLKVKYEGAS---AEVGKLRNQIKQNEMIVEEFKRDEG 558
Cdd:TIGR01612 1138 KKKSENYIDEIKAQINDL-EDVADKAISNDD--PEEIEKKIENIVTKIDKKKniyDEIKKLLNEIAEIEKDKTSLEEVKG 1214
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 559 KLIEENKRLQKE-LSMCEMEREKKGRKVTEMEGQAKEL------SAKLALSIPAEKFENMKSSLSNEVNEKAKKLVEMER 631
Cdd:TIGR01612 1215 INLSYGKNLGKLfLEKIDEEKKKSEHMIKAMEAYIEDLdeikekSPEIENEMGIEMDIKAEMETFNISHDDDKDHHIISK 1294
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 632 EHEKSLSEIRQ--LKRELENVKAKLAQHVKPEEHEQVkSRLEQKSGELGKKITELT-----LKNQTLQKEIEKVYLDNKL 704
Cdd:TIGR01612 1295 KHDENISDIREksLKIIEDFSEESDINDIKKELQKNL-LDAQKHNSDINLYLNEIAniyniLKLNKIKKIIDEVKEYTKE 1373
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 705 LKEQAHNLTIEMKNHYVPLKVSED---MKKSHDAIIDDLNRKLLD-VTQKYTEKKLEMEKLLLENDSL-------SKDVS 773
Cdd:TIGR01612 1374 IEENNKNIKDELDKSEKLIKKIKDdinLEECKSKIESTLDDKDIDeCIKKIKELKNHILSEESNIDTYfknadenNENVL 1453
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 774 RLETVFVPPEKHEKEIIALKS---------NIVELKKQLSElKKKCGEDQEKIHALTSENTNLKK-------MMSNQYVP 837
Cdd:TIGR01612 1454 LLFKNIEMADNKSQHILKIKKdnatndhdfNINELKEHIDK-SKGCKDEADKNAKAIEKNKELFEqykkdvtELLNKYSA 1532
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 838 VKTHEEVKMTLNDTLAKTNrELLDVKKKF----EDINQEFVKIKDKNEILKRNLENTQNQIKAEY---ISLAEHEAKMSS 910
Cdd:TIGR01612 1533 LAIKNKFAKTKKDSEIIIK-EIKDAHKKFileaEKSEQKIKEIKKEKFRIEDDAAKNDKSNKAAIdiqLSLENFENKFLK 1611
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 911 LSQSMRKVQDSNAEILANYRK-GQEEIVTLHAEIKAQKKELDTIQECIKvkyapivSFEECERKFKATEKELkDQLSEQT 989
Cdd:TIGR01612 1612 ISDIKKKINDCLKETESIEKKiSSFSIDSQDTELKENGDNLNSLQEFLE-------SLKDQKKNIEDKKKEL-DELDSEI 1683
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 990 QKYSVSEEEVKKNKQEN--DKLKKEIFTLQKDLRDKTVLIEKSheMERALS-------------RKTDELNKQLKDLSQK 1054
Cdd:TIGR01612 1684 EKIEIDVDQHKKNYEIGiiEKIKEIAIANKEEIESIKELIEPT--IENLISsfntndlegidpnEKLEEYNTEIGDIYEE 1761
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1055 YTEVKNV---------KEKLVEENAKQT-----SEILAVQNLLQKQHVPLEQVEAlkKSLNGTIENLKEELKSMQRCYEK 1120
Cdd:TIGR01612 1762 FIELYNIiagcletvsKEPITYDEIKNTrinaqNEFLKIIEIEKKSKSYLDDIEA--KEFDRIINHFKKKLDHVNDKFTK 1839
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1121 EQQTVTK----LHQLLENQKNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQS----EVQNTkQALK 1192
Cdd:TIGR01612 1840 EYSKINEgfddISKSIENVKNSTDENLLFDILNKTKDAYAGIIGKKYYSYKDEAEKIFINISKLANsiniQIQNN-SGID 1918
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1193 KLETREVVDLSKYKATKSDLETQISSlNEKLANLNRKYEEVCEEVLHA-KKKEISAKDEKELLHFSIEQEIKDQKERCDK 1271
Cdd:TIGR01612 1919 LFDNINIAILSSLDSEKEDTLKFIPS-PEKEPEIYTKIRDSYDTLLDIfKKSQDLHKKEQDTLNIIFENQQLYEKIQASN 1997
|
1130 1140 1150 1160 1170 1180 1190
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530406156 1272 SLttitelqrriQESAKQIEAKDNKItelLNDVERLKQALNGLSQLTYTSGN-PTKRQSQLIDTLQHQVKSLEQQ 1345
Cdd:TIGR01612 1998 EL----------KDTLSDLKYKKEKI---LNDVKLLLHKFDELNKLSCDSQNyDTILELSKQDKIKEKIDNYEKE 2059
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
783-1023 |
4.39e-07 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 54.00 E-value: 4.39e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 783 EKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNqyvpvktheevkmtLNDTLAKTNRELLDV 862
Cdd:COG4942 23 AEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRA--------------LEQELAALEAELAEL 88
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 863 KKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAE 942
Cdd:COG4942 89 EKEIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAE 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 943 IKAQKKELDTIQEcikvkyapivSFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRD 1022
Cdd:COG4942 169 LEAERAELEALLA----------ELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAA 238
|
.
gi 530406156 1023 K 1023
Cdd:COG4942 239 A 239
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
21-213 |
4.96e-07 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 54.30 E-value: 4.96e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 21 YDDRLMKAAERGDVEKVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTsdtagrnalhLAAKYGh 100
Cdd:PHA02876 341 YITPLHQASTLDRNKDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEA----------LSQKIG- 409
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 101 alclqkllqyncptehadlqgrTALHDAAMADCP-SSIQLLCDHGASVNAKDVDGRTPLVLATQMS-RPTICQLLIDRGA 178
Cdd:PHA02876 410 ----------------------TALHFALCGTNPyMSVKTLIDRGANVNSKNKDLSTPLHYACKKNcKLDVIEMLLDNGA 467
|
170 180 190
....*....|....*....|....*....|....*
gi 530406156 179 DVNSRDKQNRTALMLGCEYgcRDAVEVLIKNGADI 213
Cdd:PHA02876 468 DVNAINIQNQYPLLIALEY--HGIVNILLHYGAEL 500
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
55-108 |
6.26e-07 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 47.65 E-value: 6.26e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 530406156 55 GRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLAAKYGHALCLQKLL 108
Cdd:pfam13637 1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
286-708 |
9.71e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 53.53 E-value: 9.71e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 286 LEIENEDLKERLRKIQQEQRILLDKVNGLQLQLNEREKLKSLLAAKEKQHEESLRTIEALKNRFKYFEsdhlgsgshfsn 365
Cdd:PRK03918 298 LSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHELYE------------ 365
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 366 rKEDMLLKQGQMYMADSQCTSPGIPAHMQSRSMLRPLELslpsqtsysENEIlkKELEAMRTFCESAKQDRLKLQNELah 445
Cdd:PRK03918 366 -EAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEI---------EEEI--SKITARIGELKKEIKELKKAIEEL-- 431
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 446 KVAECKALALECERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSEAA-SGNHRLTEELKDQLKD 524
Cdd:PRK03918 432 KKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESElIKLKELAEQLKELEEK 511
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 525 LKV-----------KYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQ---KELSMCEMEREKKGRK-VTEME 589
Cdd:PRK03918 512 LKKynleelekkaeEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDeleEELAELLKELEELGFEsVEELE 591
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 590 GQAKELsaklalsipaEKFENMKSSLSN---EVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVK---PEEH 663
Cdd:PRK03918 592 ERLKEL----------EPFYNEYLELKDaekELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKkysEEEY 661
|
410 420 430 440
....*....|....*....|....*....|....*....|....*
gi 530406156 664 EQVKSRLEQKSGELGKKITELtlknQTLQKEIEKVYLDNKLLKEQ 708
Cdd:PRK03918 662 EELREEYLELSRELAGLRAEL----EELEKRREEIKKTLEKLKEE 702
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
755-1312 |
1.71e-06 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 52.60 E-value: 1.71e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 755 KLEMEKLLLEN--DSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMS 832
Cdd:PRK01156 163 SLERNYDKLKDviDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALN 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 833 NQyvpvktheevkMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLS 912
Cdd:PRK01156 243 EL-----------SSLEDMKNRYESEIKTAESDLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKK 311
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 913 QSMRKVQDSNAEILANYRKgQEEIVTLHAEIKAQKKELDTIQECI------KVKYAPIV-SFEECERKFKATEKELKDQL 985
Cdd:PRK01156 312 QILSNIDAEINKYHAIIKK-LSVLQKDYNDYIKKKSRYDDLNNQIlelegyEMDYNSYLkSIESLKKKIEEYSKNIERMS 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 986 SEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLK------DLSQKYTEvk 1059
Cdd:PRK01156 391 AFISEILKIQEIDPDAIKKELNEINVKLQDISSKVSSLNQRIRALRENLDELSRNMEMLNGQSVcpvcgtTLGEEKSN-- 468
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1060 NVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGtiENLKEelksmqrcYEKEQQTVTKLHQLLENQKNSS 1139
Cdd:PRK01156 469 HIINHYNEKKSRLEEKIREIEIEVKDIDEKIVDLKKRKEYLES--EEINK--------SINEYNKIESARADLEDIKIKI 538
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1140 VPLAE-HLQIKEAFEKEVGIIKASLREKEEESQNKMEEVS-----KLQSEVQNTKQALKKLETREVVDLSKYKATKSDLE 1213
Cdd:PRK01156 539 NELKDkHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAVISlidieTNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYID 618
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1214 TQISSLNEKLANLNRKYEEVCE-----EVLHAK----KKEISAKDEkellhfsIEQEIKDQKERCDKSLTTITELQRRIQ 1284
Cdd:PRK01156 619 KSIREIENEANNLNNKYNEIQEnkiliEKLRGKidnyKKQIAEIDS-------IIPDLKEITSRINDIEDNLKKSRKALD 691
|
570 580
....*....|....*....|....*...
gi 530406156 1285 ESAKQIEAKDNKITELLNDVERLKQALN 1312
Cdd:PRK01156 692 DAKANRARLESTIEILRTRINELSDRIN 719
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
864-1078 |
1.92e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 51.69 E-value: 1.92e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 864 KKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEI 943
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 944 KAQKKE----LDTIQECIKVKY-APIVSFEECERKFKATE--KELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTL 1016
Cdd:COG4942 100 EAQKEElaelLRALYRLGRQPPlALLLSPEDFLDAVRRLQylKYLAPARREQAEELRADLAELAALRAELEAERAELEAL 179
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530406156 1017 QKDLRDK----TVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILA 1078
Cdd:COG4942 180 LAELEEEraalEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPA 245
|
|
| PHA02878 |
PHA02878 |
ankyrin repeat protein; Provisional |
137-239 |
2.22e-06 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222939 [Multi-domain] Cd Length: 477 Bit Score: 51.80 E-value: 2.22e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 137 IQLLCDHGASVNAKDVD-GRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISL 215
Cdd:PHA02878 150 TKLLLSYGADINMKDRHkGNTALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDA 229
|
90 100
....*....|....*....|....*...
gi 530406156 216 LDALG----HDSSYYARigdNLDILTLL 239
Cdd:PHA02878 230 RDKCGntplHISVGYCK---DYDILKLL 254
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
419-894 |
2.30e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 52.45 E-value: 2.30e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 419 KKELEAMRTFCESAKQDRLKLQNELahKVAECKALALECERVKEDSD------EQIKQLEDALKDVQKRMYESEGKVKQM 492
Cdd:PTZ00121 1533 AKKADEAKKAEEKKKADELKKAEEL--KKAEEKKKAEEAKKAEEDKNmalrkaEEAKKAEEARIEEVMKLYEEEKKMKAE 1610
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 493 QthflaLKEHLTSEAASGNHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEgkliEENKRLQKELS 572
Cdd:PTZ00121 1611 E-----AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKA----EEDKKKAEEAK 1681
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 573 MCEMEREKKGRKVTEMEgqakelsaklalsipaekfenmksslsnevnEKAKKLVEMEREHEKSLSEIRQLKRELENVKA 652
Cdd:PTZ00121 1682 KAEEDEKKAAEALKKEA-------------------------------EEAKKAEELKKKEAEEKKKAEELKKAEEENKI 1730
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 653 KLAQHVKPEEHEQVKSRLEQKSGELGKKIteltlknQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMK-- 730
Cdd:PTZ00121 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKKKI-------AHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKdi 1803
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 731 KSHDAIIDDLNRKlldvTQKYTEKKLEMEKLLLENDSLSKDVSRLETVFVPPEKHEKEIIALKSNIvelKKQLSELKKKC 810
Cdd:PTZ00121 1804 FDNFANIIEGGKE----GNLVINDSKEMEDSAIKEVADSKNMQLEEADAFEKHKFNKNNENGEDGN---KEADFNKEKDL 1876
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 811 GEDQEKIHALTSENTNLKKMMSNQYVPvktheevkmtlNDTLAKTNRELLDVKkkfedinqefvkiKDKNEILKRNLENT 890
Cdd:PTZ00121 1877 KEDDEEEIEEADEIEKIDKDDIEREIP-----------NNNMAGKNNDIIDDK-------------LDKDEYIKRDAEET 1932
|
....
gi 530406156 891 QNQI 894
Cdd:PTZ00121 1933 REEI 1936
|
|
| TRPV5-6 |
cd22192 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ... |
92-220 |
2.73e-06 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.
Pssm-ID: 411976 [Multi-domain] Cd Length: 609 Bit Score: 51.94 E-value: 2.73e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 92 LHLAAKYGHALCLQKLLQYNcpteHADLQ-----GRTALHDAAMADCPSSIQLLCDhgasvNAKDV----------DGRT 156
Cdd:cd22192 21 LLLAAKENDVQAIKKLLKCP----SCDLFqrgalGETALHVAALYDNLEAAVVLME-----AAPELvnepmtsdlyQGET 91
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530406156 157 PLVLATQMSRPTICQLLIDRGADVNSRD------KQNRTALMLGCEY--------GCRDAVEVLIKNGADISLLDALG 220
Cdd:cd22192 92 ALHIAVVNQNLNLVRELIARGADVVSPRatgtffRPGPKNLIYYGEHplsfaacvGNEEIVRLLIEHGADIRAQDSLG 169
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
266-1195 |
3.53e-06 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 51.97 E-value: 3.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 266 HMQDEVNVKSHQREHQN--IQDLEIENEDLKERLRKIQQEQRILL---DKVNGLQLQLNEREKLKSLLaaKEKQHEESLR 340
Cdd:TIGR00606 221 EIRDQITSKEAQLESSReiVKSYENELDPLKNRLKEIEHNLSKIMkldNEIKALKSRKKQMEKDNSEL--ELKMEKVFQG 298
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 341 TIEALKNRFKYFESDHLGSGSHFS--------NRKEDMLLKQGQMYM----ADSQCTSPGIPAHMQSRSMLRpLELSLPS 408
Cdd:TIGR00606 299 TDEQLNDLYHNHQRTVREKERELVdcqrelekLNKERRLLNQEKTELlveqGRLQLQADRHQEHIRARDSLI-QSLATRL 377
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 409 QTSYSENE-ILKKELEAMRTFCESAKQDRLKLQNELAHKVAECKALAlecervkedsDEQIKQLEDALKDVQKRMYESEG 487
Cdd:TIGR00606 378 ELDGFERGpFSERQIKNFHTLVIERQEDEAKTAAQLCADLQSKERLK----------QEQADEIRDEKKGLGRTIELKKE 447
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 488 KVKQMQTHFLALKEHLTSEAASGNH--RLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMI-VEEFKRDEGKLIEEN 564
Cdd:TIGR00606 448 ILEKKQEELKFVIKELQQLEGSSDRilELDQELRKAERELSKAEKNSLTETLKKEVKSLQNEKAdLDRKLRKLDQEMEQL 527
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 565 KRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLALSIP----AEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEI 640
Cdd:TIGR00606 528 NHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLGyfpnKKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNK 607
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 641 RQLKRELENVKAKLaqhvkpeeheqvkSRLEQKSGEL-GKKITELTLKNqtLQKEIEKVYLDNKLLKEQAH-------NL 712
Cdd:TIGR00606 608 NHINNELESKEEQL-------------SSYEDKLFDVcGSQDEESDLER--LKEEIEKSSKQRAMLAGATAvysqfitQL 672
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 713 TIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLETVFvppEKHEKEIIAL 792
Cdd:TIGR00606 673 TDENQSCCPVCQRVFQTEAELQEFISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSII---DLKEKEIPEL 749
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 793 KSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQYVPVKTHEEVK--------MTLNDTLAKTNRELLDVKK 864
Cdd:TIGR00606 750 RNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKdverkiaqQAAKLQGSDLDRTVQQVNQ 829
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 865 KFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQdsnaeilanyrKGQEEIVTLHAEik 944
Cdd:TIGR00606 830 EKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQ-----------QFEEQLVELSTE-- 896
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 945 aqkkeldtIQECIKvkyapivSFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKT 1024
Cdd:TIGR00606 897 --------VQSLIR-------EIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKVNDIKEKVKNIHGYMKDIE 961
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1025 VLIEKSheMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLveENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTI 1104
Cdd:TIGR00606 962 NKIQDG--KDDYLKQKETELNTVNAQLEECEKHQEKINEDM--RLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEEL 1037
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1105 ENLKEELKSMQRCYEKeqQTVTKLHQLLENQKNSSVPLAEHLQikeAFEKEVGIIKASLREKeeESQNKMEEVSKLQSEV 1184
Cdd:TIGR00606 1038 KQHLKEMGQMQVLQMK--QEHQKLEENIDLIKRNHVLALGRQK---GYEKEIKHFKKELREP--QFRDAEEKYREMMIVM 1110
|
970
....*....|.
gi 530406156 1185 QNTKQALKKLE 1195
Cdd:TIGR00606 1111 RTTELVNKDLD 1121
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
873-1345 |
3.64e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 51.89 E-value: 3.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 873 FVKIKDKNEILKRNLENTQNQIKAeyisLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDT 952
Cdd:TIGR00618 367 IREISCQQHTLTQHIHTLQQQKTT----LTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAEL 442
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 953 ----IQECIKVKYAPIVSFEECERKFKATEKELKD--QLSEQTQKYSVSEEEVKKNKQENDK-LKKEIFTLQKDLRDKTV 1025
Cdd:TIGR00618 443 caaaITCTAQCEKLEKIHLQESAQSLKEREQQLQTkeQIHLQETRKKAVVLARLLELQEEPCpLCGSCIHPNPARQDIDN 522
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1026 LIEKSHEMERALSRKT-------------DELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQ 1092
Cdd:TIGR00618 523 PGPLTRRMQRGEQTYAqletseedvyhqlTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEK 602
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1093 VEALKKSLNGTIENLKEELK----------SMQRCYEKEQQTVTKLHQLLENQKNSSVplAEHLQIKEAFEKEVGiikAS 1162
Cdd:TIGR00618 603 LSEAEDMLACEQHALLRKLQpeqdlqdvrlHLQQCSQELALKLTALHALQLTLTQERV--REHALSIRVLPKELL---AS 677
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1163 LREKEEESQNKMEEVSKLQSEVQNTKQALKKLET------REVVDLSKYKAT-KSDLETQISSLNEKLANLNRKYEEVCE 1235
Cdd:TIGR00618 678 RQLALQKMQSEKEQLTYWKEMLAQCQTLLRELEThieeydREFNEIENASSSlGSDLAAREDALNQSLKELMHQARTVLK 757
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1236 E--VLHAKKKEISAKDEKELLHFS-IEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLndVERLKQALN 1312
Cdd:TIGR00618 758 ArtEAHFNNNEEVTAALQTGAELShLAAEIQFFNRLREEDTHLLKTLEAEIGQEIPSDEDILNLQCETL--VQEEEQFLS 835
|
490 500 510
....*....|....*....|....*....|...
gi 530406156 1313 GLSQLTYTSGNPTKRQSQLIDTLQHQVKSLEQQ 1345
Cdd:TIGR00618 836 RLEEKSATLGEITHQLLKYEECSKQLAQLTQEQ 868
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
923-1374 |
3.89e-06 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 51.31 E-value: 3.89e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 923 AEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKvKYAPIVSFEECERKFKATEKELKdQLSEQTQKYSVSEEEVKKN 1002
Cdd:COG4717 84 EEKEEEYAELQEELEELEEELEELEAELEELREELE-KLEKLLQLLPLYQELEALEAELA-ELPERLEELEERLEELREL 161
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1003 KQENDKLKKEIFTLQKDLRDKtvLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKL--VEENAKQTSEILAVQ 1080
Cdd:COG4717 162 EEELEELEAELAELQEELEEL--LEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQEELeeLEEELEQLENELEAA 239
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1081 NLLQKQHVPLEQ---------VEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEA 1151
Cdd:COG4717 240 ALEERLKEARLLlliaaallaLLGLGGSLLSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEE 319
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1152 fekEVGIIKASLREKEEESQnkmEEVSKLQSEVQNTKQALKKLETREvvdlskYKATKSDLETQISSLnekLANLNRKYE 1231
Cdd:COG4717 320 ---ELEELLAALGLPPDLSP---EELLELLDRIEELQELLREAEELE------EELQLEELEQEIAAL---LAEAGVEDE 384
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1232 EVCEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKslTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQAL 1311
Cdd:COG4717 385 EELRAALEQAEEYQELKEELEELEEQLEELLGELEELLEA--LDEEELEEELEELEEELEELEEELEELREELAELEAEL 462
|
410 420 430 440 450 460
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530406156 1312 NGLsqltytsgnptkRQSQLIDTLQHQVKSLEQQLADADRQHQeVIAIYRTHLLSAAQGHMDE 1374
Cdd:COG4717 463 EQL------------EEDGELAELLQELEELKAELRELAEEWA-ALKLALELLEEAREEYREE 512
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
461-1229 |
4.46e-06 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 51.51 E-value: 4.46e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 461 KEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSEAasgnhrLTEELKDQLKDLKVKYEGASAEVGKLR 540
Cdd:TIGR00618 159 KAKSKEKKELLMNLFPLDQYTQLALMEFAKKKSLHGKAELLTLRSQL------LTLCTPCMPDTYHERKQVLEKELKHLR 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 541 NQIKQnEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKkgrkvtemegqAKELSAKLALSIPAEKFENMKSSLSnevn 620
Cdd:TIGR00618 233 EALQQ-TQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEE-----------LRAQEAVLEETQERINRARKAAPLA---- 296
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 621 EKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQVKSRLEQksgelgkkitelTLKNQTLQKEIE-KVY 699
Cdd:TIGR00618 297 AHIKAVTQIEQQAQRIHTELQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQ------------TLHSQEIHIRDAhEVA 364
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 700 LDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLETVF 779
Cdd:TIGR00618 365 TSIREISCQQHTLTQHIHTLQQQKTTLTQKLQSLCKELDILQREQATIDTRTSAFRDLQGQLAHAKKQQELQQRYAELCA 444
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 780 VPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQ------EKIHALTSENTNLKKMMSNQYVPVKTHEEVKMTLNDTLA 853
Cdd:TIGR00618 445 AAITCTAQCEKLEKIHLQESAQSLKEREQQLQTKEqihlqeTRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDNPG 524
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 854 KTNRELLdvkkkfedinqefvKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQ 933
Cdd:TIGR00618 525 PLTRRMQ--------------RGEQTYAQLETSEEDVYHQLTSERKQRASLKEQMQEIQQSFSILTQCDNRSKEDIPNLQ 590
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 934 EEIVTLHAEIKAQKKEldtiqecikvkyapivsfeecERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDK---LK 1010
Cdd:TIGR00618 591 NITVRLQDLTEKLSEA---------------------EDMLACEQHALLRKLQPEQDLQDVRLHLQQCSQELALKltaLH 649
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1011 KEIFTLQKDLRDKTVLIEKSHEMERALSRKtdelnKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPL 1090
Cdd:TIGR00618 650 ALQLTLTQERVREHALSIRVLPKELLASRQ-----LALQKMQSEKEQLTYWKEMLAQCQTLLRELETHIEEYDREFNEIE 724
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1091 EQVEALKKSLNGTIENLKEELKSMQRcyekEQQTVTKLHQLLENQKNSSVPLAEHLQIKEA--------FEKEVGIIKAS 1162
Cdd:TIGR00618 725 NASSSLGSDLAAREDALNQSLKELMH----QARTVLKARTEAHFNNNEEVTAALQTGAELShlaaeiqfFNRLREEDTHL 800
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1163 LREKEEESQNKME--------EVSKLQSEVQNTKQALKKLET---------REVVDLSKYKATKSDLETQISSLNEKLAN 1225
Cdd:TIGR00618 801 LKTLEAEIGQEIPsdedilnlQCETLVQEEEQFLSRLEEKSAtlgeithqlLKYEECSKQLAQLTQEQAKIIQLSDKLNG 880
|
....
gi 530406156 1226 LNRK 1229
Cdd:TIGR00618 881 INQI 884
|
|
| PHA02798 |
PHA02798 |
ankyrin-like protein; Provisional |
137-235 |
5.46e-06 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222931 [Multi-domain] Cd Length: 489 Bit Score: 50.60 E-value: 5.46e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 137 IQLLCDHGASVNAKDVDGRTPLV-----LATQMSRPTICQLLIDRGADVNSRDKQNRTAL--MLGCEY-GCRDAVEVLIK 208
Cdd:PHA02798 54 VKLFINLGANVNGLDNEYSTPLCtilsnIKDYKHMLDIVKILIENGADINKKNSDGETPLycLLSNGYiNNLEILLFMIE 133
|
90 100
....*....|....*....|....*...
gi 530406156 209 NGADISLLDALGHDS-SYYARIGDNLDI 235
Cdd:PHA02798 134 NGADTTLLDKDGFTMlQVYLQSNHHIDI 161
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
926-1226 |
5.53e-06 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 51.09 E-value: 5.53e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 926 LANYRKGQEEIVTLHAEIKAQKKELDTIQecikvkyapivsfeecerkfkATEKELKDQLSEQTQKYSVSEEEVKKNKQE 1005
Cdd:COG1196 231 LLKLRELEAELEELEAELEELEAELEELE---------------------AELAELEAELEELRLELEELELELEEAQAE 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1006 NDKLKKEIFTLQKDlrdktvlIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQK 1085
Cdd:COG1196 290 EYELLAELARLEQD-------IARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAE 362
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1086 QhvpLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEvgiiKASLRE 1165
Cdd:COG1196 363 A---EEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEA----LAELEE 435
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 530406156 1166 KEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSDLETQISSLNEKLANL 1226
Cdd:COG1196 436 EEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLL 496
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
158-239 |
5.84e-06 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 45.88 E-value: 5.84e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 158 LVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGAdislLDALGHDSS--YYARIGDNLDI 235
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEHAD----VNLKDNGRTalHYAARSGHLEI 76
|
....
gi 530406156 236 LTLL 239
Cdd:pfam12796 77 VKLL 80
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
973-1291 |
7.18e-06 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 50.60 E-value: 7.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 973 KFKATEKELKD---QLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRD--KTVLiEKSHEMERALsrktDELNKQ 1047
Cdd:PRK04778 99 RFRKAKHEINEiesLLDLIEEDIEQILEELQELLESEEKNREEVEQLKDLYRElrKSLL-ANRFSFGPAL----DELEKQ 173
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1048 LKDLSQKYTEVKNVKEklvEENAKQTSEILAVQ----NLLQKQ--HVPlEQVEALKKSLNGTIENLKEELKSMQRC-YEK 1120
Cdd:PRK04778 174 LENLEEEFSQFVELTE---SGDYVEAREILDQLeeelAALEQImeEIP-ELLKELQTELPDQLQELKAGYRELVEEgYHL 249
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1121 EQQTVTKLHQLLENQKNSSVPLAEHLQIKEAfEKEVGIIKA------SLREKEEESQNKMEE-VSKLQSEVQNTKQALKK 1193
Cdd:PRK04778 250 DHLDIEKEIQDLKEQIDENLALLEELDLDEA-EEKNEEIQEridqlyDILEREVKARKYVEKnSDTLPDFLEHAKEQNKE 328
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1194 LETR-EVVDLSkYKATKSDLETQiSSLNEKLANLNRKYEEVCEEVlhaKKKEISakdekellhFSieqEIKDQKERCDKS 1272
Cdd:PRK04778 329 LKEEiDRVKQS-YTLNESELESV-RQLEKQLESLEKQYDEITERI---AEQEIA---------YS---ELQEELEEILKQ 391
|
330
....*....|....*....
gi 530406156 1273 LTTITELQRRIQESAKQIE 1291
Cdd:PRK04778 392 LEEIEKEQEKLSEMLQGLR 410
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
137-213 |
9.12e-06 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 50.02 E-value: 9.12e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 137 IQLLCDHGASVNAKDVDGRTPLVLATQMS---RPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCR-DAVEVLIKNGAD 212
Cdd:PHA03095 30 VRRLLAAGADVNFRGEYGKTPLHLYLHYSsekVKDIVRLLLEAGADVNAPERCGFTPLHLYLYNATTlDVIKLLIKAGAD 109
|
.
gi 530406156 213 I 213
Cdd:PHA03095 110 V 110
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
787-1317 |
1.02e-05 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 50.46 E-value: 1.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 787 KEIIALKSNIVELKKQLSELKKKCGED-QEKIHALTSENTNLKKMMSNqyVPVKTHEEVKMTLNDTLAKtnreLLDVKKK 865
Cdd:COG5022 892 KSISSLKLVNLELESEIIELKKSLSSDlIENLEFKTELIARLKKLLNN--IDLEEGPSIEYVKLPELNK----LHEVESK 965
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 866 FEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQS---MRKVQDSNAEILANYRKGQEEIVTLHAE 942
Cdd:COG5022 966 LKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQEStkqLKELPVEVAELQSASKIISSESTELSIL 1045
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 943 IKAQK-------KELDTIQECIKVKYAPIVSFEEC------------ERKFKATEKELKDQ-----------LSEQTQKY 992
Cdd:COG5022 1046 KPLQKlkgllllENNQLQARYKALKLRRENSLLDDkqlyqlestenlLKTINVKDLEVTNRnlvkpanvlqfIVAQMIKL 1125
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 993 SVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMER----ALSRKTDELNKQLKDLSQKYT----EVKNVKEK 1064
Cdd:COG5022 1126 NLLQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALpsppPFAALSEKRLYQSALYDEKSKlsssEVNDLKNE 1205
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1065 LVEENAKQTSEIL---AVQNLLQKQHVPLEQVEALKKSLNG----------TIENLKEELKSMQRCYEKEQQTVTKLHQL 1131
Cdd:COG5022 1206 LIALFSKIFSGWPrgdKLKKLISEGWVPTEYSTSLKGFNNLnkkfdtpasmSNEKLLSLLNSIDNLLSSYKLEEEVLPAT 1285
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1132 LENQ-KNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREvVDLSKYKATKS 1210
Cdd:COG5022 1286 INSLlQYINVGLFNALRTKASSLRWKSATEVNYNSEELDDWCREFEISDVDEELEELIQAVKVLQLLK-DDLNKLDELLD 1364
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1211 DLETQissLNEKLANLNRKYEevCEEVLHAKKKEISAKDEkellhfsieqeikdqKERCDKSLTTITELQRRIQESAKQI 1290
Cdd:COG5022 1365 ACYSL---NPAEIQNLKSRYD--PADKENNLPKEILKKIE---------------ALLIKQELQLSLEGKDETEVHLSEI 1424
|
570 580
....*....|....*....|....*..
gi 530406156 1291 EAKDNKITELLNDVERLKQALNGLSQL 1317
Cdd:COG5022 1425 FSEEKSLISLDRNSIYKEEVLSSLSAL 1451
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
1008-1259 |
1.04e-05 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 49.93 E-value: 1.04e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1008 KLKK-EIFTLqKDLRDKTVliEKSHEMERA--LSRKTDELNKQLKDLSQKYTEVKNVKEKL--------VEENAKQTSEI 1076
Cdd:PRK05771 5 RMKKvLIVTL-KSYKDEVL--EALHELGVVhiEDLKEELSNERLRKLRSLLTKLSEALDKLrsylpklnPLREEKKKVSV 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1077 LAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSmqrcYEKEQQTVTKLH------QLLENQKNSSVPLAEhlqIKE 1150
Cdd:PRK05771 82 KSLEELIKDVEEELEKIEKEIKELEEEISELENEIKE----LEQEIERLEPWGnfdldlSLLLGFKYVSVFVGT---VPE 154
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1151 AFEKEVGIIKaSLREKEEESQNKMEE-----VSKLQSEVQNTkqALKKLETREVV---------DLSKYKATKSDLETQI 1216
Cdd:PRK05771 155 DKLEELKLES-DVENVEYISTDKGYVyvvvvVLKELSDEVEE--ELKKLGFERLEleeegtpseLIREIKEELEEIEKER 231
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 530406156 1217 SSLNEKLANLNRKYEE---VCEEVL--HAKKKEISAKDEKELLHFSIE 1259
Cdd:PRK05771 232 ESLLEELKELAKKYLEellALYEYLeiELERAEALSKFLKTDKTFAIE 279
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
979-1357 |
1.09e-05 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 50.29 E-value: 1.09e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 979 KELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEV 1058
Cdd:PRK01156 172 KDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEIERLSIEYNNAMDDYNNLKSALNELSSLEDMK 251
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1059 KNVKEKLVEENAKQTSEILAVQNL--LQKQHVPLEQVEALKK--------SLNGTIENLKEELKSMQRCYEKEQQTVTKL 1128
Cdd:PRK01156 252 NRYESEIKTAESDLSMELEKNNYYkeLEERHMKIINDPVYKNrnyindyfKYKNDIENKKQILSNIDAEINKYHAIIKKL 331
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1129 HQL-------LENQKNSSVPLAEHLQIKEAFEKEVGIIKA--SLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREv 1199
Cdd:PRK01156 332 SVLqkdyndyIKKKSRYDDLNNQILELEGYEMDYNSYLKSieSLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKE- 410
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1200 vdLSKYKATKSDLETQISSLNEKLANLNRKYEE---------------VCEEVLHAKKKEISAKD--------EKELLHF 1256
Cdd:PRK01156 411 --LNEINVKLQDISSKVSSLNQRIRALRENLDElsrnmemlngqsvcpVCGTTLGEEKSNHIINHynekksrlEEKIREI 488
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1257 SIEQEIKDQKERCDKSLTTITELQ--RRIQESAKQIEAKDNKITELLNDVERLKQALNGLSQLT--------------YT 1320
Cdd:PRK01156 489 EIEVKDIDEKIVDLKKRKEYLESEeiNKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKnrykslkledldskRT 568
|
410 420 430
....*....|....*....|....*....|....*....
gi 530406156 1321 SGNPTKRQSQLID--TLQHQVKSLEQQLADADRQHQEVI 1357
Cdd:PRK01156 569 SWLNALAVISLIDieTNRSRSNEIKKQLNDLESRLQEIE 607
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
466-698 |
1.35e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.99 E-value: 1.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 466 EQIKQLEDALKDVQKRMYESEGKVKQMQthflalkehltseaasgnhRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQ 545
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALK-------------------KEEKALLKQLAALERRIAALARRIRALEQELAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 546 NEMIVEEFKRDEGKLIEENKRLQKELSmcEMERekkgrkVTEMEGQAKELSAKLALSIPAEKFENMK---------SSLS 616
Cdd:COG4942 81 LEAELAELEKEIAELRAELEAQKEELA--ELLR------ALYRLGRQPPLALLLSPEDFLDAVRRLQylkylaparREQA 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 617 NEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKpeEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIE 696
Cdd:COG4942 153 EELRADLAELAALRAELEAERAELEALLAELEEERAALEALKA--ERQKLLARLEKELAELAAELAELQQEAEELEALIA 230
|
..
gi 530406156 697 KV 698
Cdd:COG4942 231 RL 232
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
465-777 |
1.57e-05 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 49.55 E-value: 1.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 465 DEQIKQLE-DALKDVQKRMYESEGKVKQMQTHFLALKEhLTSEAASGNHRLtEELKDQLKDLKVKYEGASAEVGKLRNQI 543
Cdd:COG1196 199 ERQLEPLErQAEKAERYRELKEELKELEAELLLLKLRE-LEAELEELEAEL-EELEAELEELEAELAELEAELEELRLEL 276
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 544 KQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLALsipaekfenmkssLSNEVNEKA 623
Cdd:COG1196 277 EELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEE-------------LEEELEELE 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 624 KKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKP-EEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIEKVYLDN 702
Cdd:COG1196 344 EELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEElEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEEL 423
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530406156 703 KLLKEQAHNLTIEMKNHyvpLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLET 777
Cdd:COG1196 424 EELEEALAELEEEEEEE---EEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLL 495
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1035-1251 |
1.59e-05 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 49.06 E-value: 1.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1035 RALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQnllqkqhvplEQVEALKKSLNGTIENLKEELKSM 1114
Cdd:COG3883 26 SELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQ----------AEIAEAEAEIEERREELGERARAL 95
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1115 QRcyekEQQTVTKLHQLLENQKNSSVplaehlqikeafekevgIIKASLREKEEESQNK-MEEVSKLQSEVQNTKQALKK 1193
Cdd:COG3883 96 YR----SGGSVSYLDVLLGSESFSDF-----------------LDRLSALSKIADADADlLEELKADKAELEAKKAELEA 154
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 530406156 1194 LETREVVDLSKYKATKSDLETQISSLNEKLANLNRKYEEVcEEVLHAKKKEISAKDEK 1251
Cdd:COG3883 155 KLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAA-EAQLAELEAELAAAEAA 211
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
921-1302 |
1.60e-05 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 49.69 E-value: 1.60e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 921 SNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIV-SFEECERKFKATEKELKDQLSEQTQK-------- 991
Cdd:COG5022 804 SLLGSRKEYRSYLACIIKLQKTIKREKKLRETEEVEFSLKAEVLIqKFGRSLKAKKRFSLLKKETIYLQSAQrvelaerq 883
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 992 ---YSVSEEEVKKNKQENDKLKKEIF----TLQKDLRDKT-VLIEKSHEMERALS------------RKTDELNK----- 1046
Cdd:COG5022 884 lqeLKIDVKSISSLKLVNLELESEIIelkkSLSSDLIENLeFKTELIARLKKLLNnidleegpsieyVKLPELNKlheve 963
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1047 -QLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLngtiENLKEELKSMQRCYEKEQQTV 1125
Cdd:COG5022 964 sKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQL----KELPVEVAELQSASKIISSES 1039
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1126 TKLHQLLENQKNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQnkMEEVSKLQSEVQNTKQALKKLET---REVVDL 1202
Cdd:COG5022 1040 TELSILKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQ--LESTENLLKTINVKDLEVTNRNLvkpANVLQF 1117
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1203 SKYKATKSDLETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTI-TELQR 1281
Cdd:COG5022 1118 IVAQMIKLNLLQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKsKLSSS 1197
|
410 420
....*....|....*....|.
gi 530406156 1282 RIQESAKQIEAKDNKITELLN 1302
Cdd:COG5022 1198 EVNDLKNELIALFSKIFSGWP 1218
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
466-664 |
1.76e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 49.53 E-value: 1.76e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 466 EQIKQLEDALKDVQKrmYEsegKVKQMQTHFLALKEHLTSEAAsgnHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQ 545
Cdd:COG4913 249 EQIELLEPIRELAER--YA---AARERLAELEYLRAALRLWFA---QRRLELLEAELEELRAELARLEAELERLEARLDA 320
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 546 NEMIVEEFKR----DEGKLIEenkRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLALSIPA------------EKFE 609
Cdd:COG4913 321 LREELDELEAqirgNGGDRLE---QLEREIERLERELEERERRRARLEALLAALGLPLPASAEEfaalraeaaallEALE 397
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530406156 610 NMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKR-------ELENVKAKLAQHVKPEEHE 664
Cdd:COG4913 398 EELEALEEALAEAEAALRDLRRELRELEAEIASLERrksnipaRLLALRDALAEALGLDEAE 459
|
|
| TRPV5-6 |
cd22192 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ... |
64-191 |
1.96e-05 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.
Pssm-ID: 411976 [Multi-domain] Cd Length: 609 Bit Score: 49.24 E-value: 1.96e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 64 SKGNLECLNAILI-HGVDITTSDTAGRNALHLAAKYGHALCLQKLLQ-----YNCPTEHADLQGRTALHDAAMADCPSSI 137
Cdd:cd22192 26 KENDVQAIKKLLKcPSCDLFQRGALGETALHVAALYDNLEAAVVLMEaapelVNEPMTSDLYQGETALHIAVVNQNLNLV 105
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530406156 138 QLLCDHGASVNAKDVD--------------GRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTAL 191
Cdd:cd22192 106 RELIARGADVVSPRATgtffrpgpknliyyGEHPLSFAACVGNEEIVRLLIEHGADIRAQDSLGNTVL 173
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
514-807 |
2.02e-05 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 49.16 E-value: 2.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 514 LTEELKDQLKDLK-----------VKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKG 582
Cdd:COG1196 194 ILGELERQLEPLErqaekaeryreLKEELKELEAELLLLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELR 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 583 RKVTEMEGQAKELSAKL-ALSIPAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQH---- 657
Cdd:COG1196 274 LELEELELELEEAQAEEyELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAeeel 353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 658 ----VKPEEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHyvplkvsEDMKKSH 733
Cdd:COG1196 354 eeaeAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERL-------EEELEEL 426
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530406156 734 DAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLETVFvppEKHEKEIIALKSNIVELKKQLSELK 807
Cdd:COG1196 427 EEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEA---ALLEAALAELLEELAEAAARLLLLL 497
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
469-1354 |
2.25e-05 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 49.02 E-value: 2.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 469 KQLEDALKDVQKRMYESEGKVKQMQThfLALKEHLTSEAASGNHRLTEELKDQLKDLKVKYEGASAEvgkLRNQIKQNEM 548
Cdd:pfam01576 155 KLLEERISEFTSNLAEEEEKAKSLSK--LKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGESTD---LQEQIAELQA 229
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 549 IVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLALSIPA-EKFENMKSSLSNEVnEKAKKLV 627
Cdd:pfam01576 230 QIAELRAQLAKKEEELQAALARLEEETAQKNNALKKIRELEAQISELQEDLESERAArNKAEKQRRDLGEEL-EALKTEL 308
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 628 EMEREHEKSLSEIRQlKRELENVKAKLAQHVKPEEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIEKvylDNKLLKE 707
Cdd:pfam01576 309 EDTLDTTAAQQELRS-KREQEVTELKKALEEETRSHEAQLQEMRQKHTQALEELTEQLEQAKRNKANLEK---AKQALES 384
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 708 QAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLETVFVppeKHEK 787
Cdd:pfam01576 385 ENAELQAELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNI---KLSK 461
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 788 EIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQyvpvkthEEVKMTLNDTLAKTNRELLDVKKKFE 867
Cdd:pfam01576 462 DVSSLESQLQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEE-------EEAKRNVERQLSTLQAQLSDMKKKLE 534
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 868 DINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKV---QDSNAEILANYRKGQEEIVTLHAEIK 944
Cdd:pfam01576 535 EDAGTLEALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLlvdLDHQRQLVSNLEKKQKKFDQMLAEEK 614
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 945 AqkkeldtiqecIKVKYApivsfEECER-KFKATEKELKD-QLSEQTQKYSVSEEEVKKNkqeNDKLKKEIFTL--QKDL 1020
Cdd:pfam01576 615 A-----------ISARYA-----EERDRaEAEAREKETRAlSLARALEEALEAKEELERT---NKQLRAEMEDLvsSKDD 675
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1021 RDKTVlieksHEME---RALSRKTDELNKQLKDLSQkytevknvkEKLVEENAKQTSEIlavqnllqkqhvpleQVEALK 1097
Cdd:pfam01576 676 VGKNV-----HELErskRALEQQVEEMKTQLEELED---------ELQATEDAKLRLEV---------------NMQALK 726
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1098 KSLNGTIENLKEELKsmqrcyEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEV 1177
Cdd:pfam01576 727 AQFERDLQARDEQGE------EKRRQLVKQVRELEAELEDERKQRAQAVAAKKKLELDLKELEAQIDAANKGREEAVKQL 800
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1178 SKLQSEVQNTKQALK--KLETREVVDLSKYKATKS-DLETQISSLNEKLANLNR-------KYEEVCEEVLHAKKKEISA 1247
Cdd:pfam01576 801 KKLQAQMKDLQRELEeaRASRDEILAQSKESEKKLkNLEAELLQLQEDLAASERarrqaqqERDELADEIASGASGKSAL 880
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1248 KDEKELLhfsiEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALNGL--------SQLTY 1319
Cdd:pfam01576 881 QDEKRRL----EARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTELAAERSTSQKSESARQQLerqnkelkAKLQE 956
|
890 900 910
....*....|....*....|....*....|....*
gi 530406156 1320 TSGNPTKRQSQLIDTLQHQVKSLEQQLADADRQHQ 1354
Cdd:pfam01576 957 MEGTVKSKFKSSIAALEAKIAQLEEQLEQESRERQ 991
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
995-1229 |
2.36e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.22 E-value: 2.36e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 995 SEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTS 1074
Cdd:COG4942 18 QADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRA 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1075 EILAVQNLLQKQHV---PLEQVEALKKSLNGtiENLKEELKSMQRcyekeqqtvtkLHQLLENQKNSSVPLAEHLQIKEA 1151
Cdd:COG4942 98 ELEAQKEELAELLRalyRLGRQPPLALLLSP--EDFLDAVRRLQY-----------LKYLAPARREQAEELRADLAELAA 164
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 530406156 1152 FEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSDLETQISSLNEKLANLNRK 1229
Cdd:COG4942 165 LRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
692-1335 |
3.06e-05 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 48.89 E-value: 3.06e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 692 QKEIEKvyldNKLLKEQAHNLTIEMKNHYVPLKVSEDM---KKSHDAIID---DLNRKLLDVTQKYTEK--KLEMEKLLL 763
Cdd:TIGR01612 1506 AKAIEK----NKELFEQYKKDVTELLNKYSALAIKNKFaktKKDSEIIIKeikDAHKKFILEAEKSEQKikEIKKEKFRI 1581
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 764 ENDSLSKDVSRLETVFVPPEKHEKEIIALKsnIVELKKQLSELKKKCGEDQEKIHALT--SENTNLKKMMSNqyvpVKTH 841
Cdd:TIGR01612 1582 EDDAAKNDKSNKAAIDIQLSLENFENKFLK--ISDIKKKINDCLKETESIEKKISSFSidSQDTELKENGDN----LNSL 1655
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 842 EEVKMTLNDTlaKTNREllDVKKKFEDINQEFVKIKDKNEILKRNLE-NTQNQIKAEYISlaeHEAKMSSLSQSmrkVQD 920
Cdd:TIGR01612 1656 QEFLESLKDQ--KKNIE--DKKKELDELDSEIEKIEIDVDQHKKNYEiGIIEKIKEIAIA---NKEEIESIKEL---IEP 1725
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 921 SNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKFKATEKELKD-QLSEQTQKYSVSEEEV 999
Cdd:TIGR01612 1726 TIENLISSFNTNDLEGIDPNEKLEEYNTEIGDIYEEFIELYNIIAGCLETVSKEPITYDEIKNtRINAQNEFLKIIEIEK 1805
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1000 KKNKQENDKLKKEIFTLQKDLRDKTvlieksHEMERALSRKTDELNKQLKDLSQKYTEVKN-VKEKLVEENAKQTSEILA 1078
Cdd:TIGR01612 1806 KSKSYLDDIEAKEFDRIINHFKKKL------DHVNDKFTKEYSKINEGFDDISKSIENVKNsTDENLLFDILNKTKDAYA 1879
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1079 vqNLLQKQHVP--------LEQVEALKKSLNGTIEN------LKEELKSMQRCYEKEQQTVTKLHQLLENQKNSsvplae 1144
Cdd:TIGR01612 1880 --GIIGKKYYSykdeaekiFINISKLANSINIQIQNnsgidlFDNINIAILSSLDSEKEDTLKFIPSPEKEPEI------ 1951
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1145 HLQIKEAFEKEVGIIKAS--LREKEEESQNKMEEVSKLQSEVQNTKqalkklETREVVDLSKYKATK--SDLETQISSLN 1220
Cdd:TIGR01612 1952 YTKIRDSYDTLLDIFKKSqdLHKKEQDTLNIIFENQQLYEKIQASN------ELKDTLSDLKYKKEKilNDVKLLLHKFD 2025
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1221 E--KLANLNRKYEEVCEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKIT 1298
Cdd:TIGR01612 2026 ElnKLSCDSQNYDTILELSKQDKIKEKIDNYEKEKEKFGIDFDVKAMEEKFDNDIKDIEKFENNYKHSEKDNHDFSEEKD 2105
|
650 660 670
....*....|....*....|....*....|....*..
gi 530406156 1299 ELLNDVERLKQALNGLSQLTYTSGNPTKRQSQLIDTL 1335
Cdd:TIGR01612 2106 NIIQSKKKLKELTEAFNTEIKIIEDKIIEKNDLIDKL 2142
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
458-656 |
4.12e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 48.09 E-value: 4.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 458 ERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQthflalKEH----LTSEAASGNHRLTeELKDQLKDLKVKYEGAS 533
Cdd:COG3206 167 ELRREEARKALEFLEEQLPELRKELEEAEAALEEFR------QKNglvdLSEEAKLLLQQLS-ELESQLAEARAELAEAE 239
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 534 AEVGKLRNQIKQNEMIVEEFKRDegkliEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLAlsipaekfeNMKS 613
Cdd:COG3206 240 ARLAALRAQLGSGPDALPELLQS-----PVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIA---------ALRA 305
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 530406156 614 SLSNEVnekAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQ 656
Cdd:COG3206 306 QLQQEA---QRILASLEAELEALQAREASLQAQLAQLEARLAE 345
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
724-1097 |
5.02e-05 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 47.70 E-value: 5.02e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 724 KVSEDMKKSHDAIIDDLNRKLLDVTQKYTeKKLEMEKLLLENDSLSKDVSRLETVFvppEKHEKEIIALKSNIVELKKQL 803
Cdd:NF033838 73 EIQKSLDKRKHTQNVALNKKLSDIKTEYL-YELNVLKEKSEAELTSKTKKELDAAF---EQFKKDTLEPGKKVAEATKKV 148
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 804 SELKKKCGEDQEKIHALTSENTnlkkmmsnqyvpVKTHEEVKMTLNDTLAKTNRELLDVKKKF----EDINQEFVKIKDK 879
Cdd:NF033838 149 EEAEKKAKDQKEEDRRNYPTNT------------YKTLELEIAESDVEVKKAELELVKEEAKEprdeEKIKQAKAKVESK 216
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 880 NEILKRnLENTQNQIKAeyislAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKV 959
Cdd:NF033838 217 KAEATR-LEKIKTDREK-----AEEEAKRRADAKLKEAVEKNVATSEQDKPKRRAKRGVLGEPATPDKKENDAKSSDSSV 290
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 960 --------KYAPIVSFEECERKFKATEKELKDQLSEQTQKY----------SVSEEEVKKNKQENDKLKKEiftlQKDLR 1021
Cdd:NF033838 291 geetlpspSLKPEKKVAEAEKKVEEAKKKAKDQKEEDRRNYptntyktlelEIAESDVKVKEAELELVKEE----AKEPR 366
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1022 DKTVLIEKSHEMERALSRKT------DELNKQLKDLSQKYTEVKNVKEKLVEEnaKQTSEILAVQNLLQKQHVPLEQVEA 1095
Cdd:NF033838 367 NEEKIKQAKAKVESKKAEATrlekikTDRKKAEEEAKRKAAEEDKVKEKPAEQ--PQPAPAPQPEKPAPKPEKPAEQPKA 444
|
..
gi 530406156 1096 LK 1097
Cdd:NF033838 445 EK 446
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
415-896 |
5.41e-05 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 47.79 E-value: 5.41e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 415 NEILKKELEAMRTFCESAKQDRLKLQNELAHKVAECKALALECERVKEDSDEQIKQL-------EDALKDVQKRMYESEG 487
Cdd:pfam05483 189 NNNIEKMILAFEELRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLliqitekENKMKDLTFLLEESRD 268
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 488 KVKQMQ----------THFLALKEHLTSE----------AASGNHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQNE 547
Cdd:pfam05483 269 KANQLEektklqdenlKELIEKKDHLTKEledikmslqrSMSTQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHS 348
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 548 MIVEEFKRDEGKLIE----ENKRLQK---ELSMCEMEREKKGRKVTEMEG--QAKELSAKLALSIPAEKfenmkSSLSNE 618
Cdd:pfam05483 349 FVVTEFEATTCSLEEllrtEQQRLEKnedQLKIITMELQKKSSELEEMTKfkNNKEVELEELKKILAED-----EKLLDE 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 619 vNEKAKKLVEMEREHEKSLSEIRQLK-RELENVKAKLAQHVKPEEHeqvksrLEQKSGELGKKITELTLKNQTLQKEIEK 697
Cdd:pfam05483 424 -KKQFEKIAEELKGKEQELIFLLQAReKEIHDLEIQLTAIKTSEEH------YLKEVEDLKTELEKEKLKNIELTAHCDK 496
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 698 VYLDNKLLKEQAHNLTIEMKNHYVPLKvseDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLET 777
Cdd:pfam05483 497 LLLENKELTQEASDMTLELKKHQEDII---NCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKGDEVKCKLDKSEE 573
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 778 vfvPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQYVPVKTHEevkmtlnDTLAKTNR 857
Cdd:pfam05483 574 ---NARSIEYEVLKKEKQMKILENKCNNLKKQIENKNKNIEELHQENKALKKKGSAENKQLNAYE-------IKVNKLEL 643
|
490 500 510
....*....|....*....|....*....|....*....
gi 530406156 858 ELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKA 896
Cdd:pfam05483 644 ELASAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEKAKA 682
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1159-1385 |
5.94e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 47.70 E-value: 5.94e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1159 IKASLREKEEESQNKM----EEVSKLQSEVQNTKQALKKLETRE-VVDLSkykATKSDLETQISSLNEKLANLNRKYEEV 1233
Cdd:COG3206 162 LEQNLELRREEARKALefleEQLPELRKELEEAEAALEEFRQKNgLVDLS---EEAKLLLQQLSELESQLAEARAELAEA 238
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1234 ceEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALNG 1313
Cdd:COG3206 239 --EARLAALRAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNHPDVIALRAQIAALRAQLQQEAQRILA 316
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 530406156 1314 LSQLTYTSGNPTKRQ-SQLIDTLQHQVKSL---EQQLADADRQHQEVIAIYRTHLLSAAQGHMDEDVQEALLQIIQ 1385
Cdd:COG3206 317 SLEAELEALQAREASlQAQLAQLEARLAELpelEAELRRLEREVEVARELYESLLQRLEEARLAEALTVGNVRVID 392
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
40-95 |
7.62e-05 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 41.56 E-value: 7.62e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 530406156 40 ILAKKGVNPGKLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSDTAGRNALHLA 95
Cdd:pfam13857 1 LLEHGPIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALDLA 56
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
465-675 |
8.19e-05 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 46.93 E-value: 8.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 465 DEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSEAASgNHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIK 544
Cdd:PHA02562 187 DMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKT-IKAEIEELTDELLNLVMDIEDPSAALNKLNTAAA 265
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 545 QNEMIVEEFKRDE-----------------------GKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKlal 601
Cdd:PHA02562 266 KIKSKIEQFQKVIkmyekggvcptctqqisegpdriTKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLEL--- 342
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 530406156 602 sipAEKFENMKSSLSNEVNEKAKKLVEMER---EHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQVKSRLEQKSG 675
Cdd:PHA02562 343 ---KNKISTNKQSLITLVDKAKKVKAAIEElqaEFVDNAEELAKLQDELDKIVKTKSELVKEKYHRGIVTDLLKDSG 416
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
556-1267 |
1.02e-04 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 46.97 E-value: 1.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 556 DEGKLIEENKRLQKELSMCE-MEREKKGRKVTEMEGQAKELSaklalSIPAEKFENMKSSLSNEVNEKAKKLVEMEREHE 634
Cdd:TIGR01612 694 DKAKLDDLKSKIDKEYDKIQnMETATVELHLSNIENKKNELL-----DIIVEIKKHIHGEINKDLNKILEDFKNKEKELS 768
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 635 KSLSEIRQLKRELENVKAKLAQhVKPEEHEQ--VKSRLEQKSGELGKKITELTLKNQTLQKEIEKVYLDNKLLKEQAHNL 712
Cdd:TIGR01612 769 NKINDYAKEKDELNKYKSKISE-IKNHYNDQinIDNIKDEDAKQNYDKSKEYIKTISIKEDEIFKIINEMKFMKDDFLNK 847
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 713 TiemkNHYVPLK--VSEDMKKSHDAIIDDLNRKLLDVTQkytEKKLEMEKLLLENDSLSKDVSrletvfvppekheKEII 790
Cdd:TIGR01612 848 V----DKFINFEnnCKEKIDSEHEQFAELTNKIKAEISD---DKLNDYEKKFNDSKSLINEIN-------------KSIE 907
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 791 ALKSNIVELKKqLSELKKKCGEDQEKIHALTSENTNLKKMMSNQYVPVKTHEEVKMTLND----TLAKTNRELLDVKKK- 865
Cdd:TIGR01612 908 EEYQNINTLKK-VDEYIKICENTKESIEKFHNKQNILKEILNKNIDTIKESNLIEKSYKDkfdnTLIDKINELDKAFKDa 986
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 866 ----FEDINQEFVK-IKDKNEILKRNLENTQnqikaeYISLAEHEAKMSSLSQsmrKVQDSNAEI----LANYRKGQEEI 936
Cdd:TIGR01612 987 slndYEAKNNELIKyFNDLKANLGKNKENML------YHQFDEKEKATNDIEQ---KIEDANKNIpnieIAIHTSIYNII 1057
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 937 VTLHAEIKAQKKELDTiqECIKVKYAPIVSFEECERKFK---------------ATE-KELKDQLSEQTQK--YSVSEEE 998
Cdd:TIGR01612 1058 DEIEKEIGKNIELLNK--EILEEAEINITNFNEIKEKLKhynfddfgkeenikyADEiNKIKDDIKNLDQKidHHIKALE 1135
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 999 VKKNKQEN--DKLKKEIFTLQkDLRDKTVLIEKSHEMERALSRKTDELNKQlkdlSQKYTEVKNVKEKLVEENAKQTS-E 1075
Cdd:TIGR01612 1136 EIKKKSENyiDEIKAQINDLE-DVADKAISNDDPEEIEKKIENIVTKIDKK----KNIYDEIKKLLNEIAEIEKDKTSlE 1210
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1076 ILAVQNLLQKQHVP---LEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLE------------------- 1133
Cdd:TIGR01612 1211 EVKGINLSYGKNLGklfLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMGIEMDikaemetfnishdddkdhh 1290
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1134 -----NQKNSSVPLAEHLQIKEAFEKEVGI--IKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDlsKYK 1206
Cdd:TIGR01612 1291 iiskkHDENISDIREKSLKIIEDFSEESDIndIKKELQKNLLDAQKHNSDINLYLNEIANIYNILKLNKIKKIID--EVK 1368
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 530406156 1207 ATKSDLETQISSLNEKLANLNRKYEEVCEEVlhaKKKEISAKDEKELLHFSIEQEIKDQKE 1267
Cdd:TIGR01612 1369 EYTKEIEENNKNIKDELDKSEKLIKKIKDDI---NLEECKSKIESTLDDKDIDECIKKIKE 1426
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
173-220 |
1.08e-04 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 41.18 E-value: 1.08e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 530406156 173 LIDRG-ADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALG 220
Cdd:pfam13857 1 LLEHGpIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEG 49
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
923-1385 |
1.13e-04 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 46.89 E-value: 1.13e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 923 AEILANYrKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERK--------FKATEKELKDQLSE--QTQKY 992
Cdd:TIGR00618 166 KELLMNL-FPLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDtyherkqvLEKELKHLREALQQtqQSHAY 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 993 SVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEmERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQ 1072
Cdd:TIGR00618 245 LTQKREAQEEQLKKQQLLKQLRARIEELRAQEAVLEETQE-RINRARKAAPLAAHIKAVTQIEQQAQRIHTELQSKMRSR 323
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1073 TSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQT-----VTKLHQLLENQKNSSVPLAEHLQ 1147
Cdd:TIGR00618 324 AKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHtltqhIHTLQQQKTTLTQKLQSLCKELD 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1148 IKEAFEKEVGIIKASLR---------EKEEESQNKMEEVSKLQSEVQNTKQALKKLE-------TREVVDLSKYKATKSD 1211
Cdd:TIGR00618 404 ILQREQATIDTRTSAFRdlqgqlahaKKQQELQQRYAELCAAAITCTAQCEKLEKIHlqesaqsLKEREQQLQTKEQIHL 483
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1212 LETQISSLNEKLANLNRKYE-EVCEEVLH--AKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAK 1288
Cdd:TIGR00618 484 QETRKKAVVLARLLELQEEPcPLCGSCIHpnPARQDIDNPGPLTRRMQRGEQTYAQLETSEEDVYHQLTSERKQRASLKE 563
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1289 QIEAKDNKITELLNDVERLKQALNGLSQLTYTSGNPTKRQSQLIDTL---------QHQVKSLEQQLADADRQHQEVIAI 1359
Cdd:TIGR00618 564 QMQEIQQSFSILTQCDNRSKEDIPNLQNITVRLQDLTEKLSEAEDMLaceqhallrKLQPEQDLQDVRLHLQQCSQELAL 643
|
490 500
....*....|....*....|....*.
gi 530406156 1360 YRTHLLSAAQGHMDEDVQEALLQIIQ 1385
Cdd:TIGR00618 644 KLTALHALQLTLTQERVREHALSIRV 669
|
|
| CagA_N |
pfam18971 |
CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial ... |
883-1134 |
1.22e-04 |
|
CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial virulence determinant and critical for gastric carcinogenesis. X-ray crystallographic analysis of the N-terminal CagA fragment (residues 1-876) revealed that the region has a structure comprised of three discrete domains. Domain I constitutes a mobile CagA N terminus, while Domain II tethers CagA to the plasma membrane by interacting with membrane phosphatidylserine. Domain III interacts intramolecularly with the intrinsically disordered C-terminal region, and this interaction potentiates the pathogenic scaffold/hub function of CagA.
Pssm-ID: 408741 [Multi-domain] Cd Length: 876 Bit Score: 46.69 E-value: 1.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 883 LKRNLENtqnQIKAEYISLAEHEAKMSSLSQSMRKVqdsnAEILANYRKGQEEIVTL--HAEIKAQKKELdtiqecikvk 960
Cdd:pfam18971 557 VRRNLEN---KLTAKGLSLQEANKLIKDFLSSNKEL----AGKALNFNKAVAEAKSTgnYDEVKKAQKDL---------- 619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 961 yapivsfEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQendklKKEIFTL--QKDLRDKTVL--IEKSHEMERA 1036
Cdd:pfam18971 620 -------EKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQ-----KDEIFALinKEANRDARAIayTQNLKGIKRE 687
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1037 LSRKTDELNKQLKDLSQKYTEVKNVKEKlveENAKQTSEILAVQNLLQKQHVPLEQVEalkkslngTIENLKEELKSMQR 1116
Cdd:pfam18971 688 LSDKLEKISKDLKDFSKSFDEFKNGKNK---DFSKAEETLKALKGSVKDLGINPEWIS--------KVENLNAALNEFKN 756
|
250
....*....|....*...
gi 530406156 1117 CYEKEQQTVTKLHQLLEN 1134
Cdd:pfam18971 757 GKNKDFSKVTQAKSDLEN 774
|
|
| PTZ00322 |
PTZ00322 |
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional |
93-173 |
1.58e-04 |
|
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
Pssm-ID: 140343 [Multi-domain] Cd Length: 664 Bit Score: 46.04 E-value: 1.58e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 93 HLAAKyGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQL 172
Cdd:PTZ00322 88 QLAAS-GDAVGARILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQL 166
|
.
gi 530406156 173 L 173
Cdd:PTZ00322 167 L 167
|
|
| COG2433 |
COG2433 |
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only]; |
1187-1311 |
1.97e-04 |
|
Possible nuclease of RNase H fold, RuvC/YqgF family [General function prediction only];
Pssm-ID: 441980 [Multi-domain] Cd Length: 644 Bit Score: 46.00 E-value: 1.97e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1187 TKQALKKLETREVVDLSKYKATKSDLETQISSLNEKLANLnRKYEEVCEEVLHAKKKEIsAKDEKELLHFSIEQEIKDQK 1266
Cdd:COG2433 386 IEKELPEEEPEAEREKEHEERELTEEEEEIRRLEEQVERL-EAEVEELEAELEEKDERI-ERLERELSEARSEERREIRK 463
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 530406156 1267 ERcdksltTITELQRRIQESAKQIEAKDNKITELLNDVERLKQAL 1311
Cdd:COG2433 464 DR------EISRLDREIERLERELEEERERIEELKRKLERLKELW 502
|
|
| PTZ00322 |
PTZ00322 |
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional |
171-239 |
2.01e-04 |
|
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
Pssm-ID: 140343 [Multi-domain] Cd Length: 664 Bit Score: 45.66 E-value: 2.01e-04
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530406156 171 QLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALGHDSSYYARIGDNLDILTLL 239
Cdd:PTZ00322 99 RILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQLL 167
|
|
| Ank |
pfam00023 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
153-185 |
2.02e-04 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.
Pssm-ID: 459634 [Multi-domain] Cd Length: 34 Bit Score: 39.97 E-value: 2.02e-04
10 20 30
....*....|....*....|....*....|....
gi 530406156 153 DGRTPLVLA-TQMSRPTICQLLIDRGADVNSRDK 185
Cdd:pfam00023 1 DGNTPLHLAaGRRGNLEIVKLLLSKGADVNARDK 34
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
838-1348 |
2.03e-04 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 45.94 E-value: 2.03e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 838 VKTHEEVKMTLNDTLAKTNRElldvKKKFEDINQEFVKIKDKNEILKRNLENTQNQikaeyislaeHEAKMSSLSQSMRK 917
Cdd:pfam01576 133 IKKLEEDILLLEDQNSKLSKE----RKLLEERISEFTSNLAEEEEKAKSLSKLKNK----------HEAMISDLEERLKK 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 918 VQDSNAEILANYRKGQEEIVTLHAEIKAQKKELdtiqecikvkyapivsfEECERKFKATEKELKDQLSEQTQKYSVSEE 997
Cdd:pfam01576 199 EEKGRQELEKAKRKLEGESTDLQEQIAELQAQI-----------------AELRAQLAKKEEELQAALARLEEETAQKNN 261
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 998 EVKKNKQendkLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDL-----------SQKYTEVKNVKEKLV 1066
Cdd:pfam01576 262 ALKKIRE----LEAQISELQEDLESERAARNKAEKQRRDLGEELEALKTELEDTldttaaqqelrSKREQEVTELKKALE 337
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1067 EENAKQTSEilaVQNLLQKQHVPLEQ--------------VEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLH-QL 1131
Cdd:pfam01576 338 EETRSHEAQ---LQEMRQKHTQALEElteqleqakrnkanLEKAKQALESENAELQAELRTLQQAKQDSEHKRKKLEgQL 414
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1132 LENQKNSSVPLAEHLQIKEAFEK---EVGIIKASLREKEEESQNKMEEVSKLQSEVQNTkQALKKLETREvvdlskykat 1208
Cdd:pfam01576 415 QELQARLSESERQRAELAEKLSKlqsELESVSSLLNEAEGKNIKLSKDVSSLESQLQDT-QELLQEETRQ---------- 483
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1209 KSDLETQISSLNEKLANLNRKYEEvceevlhakkkEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAK 1288
Cdd:pfam01576 484 KLNLSTRLRQLEDERNSLQEQLEE-----------EEEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKKRLQR 552
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1289 QIEAKDNKITELLNDVERLKQALNGLSQLTYTSGNPTKRQSQLIDTLQHQVKSLEQQLAD 1348
Cdd:pfam01576 553 ELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLEKKQKKFDQMLAE 612
|
|
| CCDC22 |
pfam05667 |
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ... |
966-1315 |
2.07e-04 |
|
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.
Pssm-ID: 461708 [Multi-domain] Cd Length: 600 Bit Score: 45.79 E-value: 2.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 966 SFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQEndklkkeifTLQKDLRDK-TVLIEKSHEMERALSRKTDEL 1044
Cdd:pfam05667 207 SLLERNAAELAAAQEWEEEWNSQGLASRLTPEEYRKRKRT---------KLLKRIAEQlRSAALAGTEATSGASRSAQDL 277
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1045 NKQLKDLSQKYTEVKNV--------KEKLV--EENAKQTSEILAV----QNLLQKQHVPLEQVEALKKSLNGTIENLKEE 1110
Cdd:pfam05667 278 AELLSSFSGSSTTDTGLtkgsrfthTEKLQftNEAPAATSSPPTKveteEELQQQREEELEELQEQLEDLESSIQELEKE 357
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1111 LKSMQrcyekeqQTVTKLHQLLENQKNSSVPLAEHLQIKEafeKEVGIIkaslrekeEESQNKMEevsKLQSEVQNTKQA 1190
Cdd:pfam05667 358 IKKLE-------SSIKQVEEELEELKEQNEELEKQYKVKK---KTLDLL--------PDAEENIA---KLQALVDASAQR 416
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1191 LKKLEtrevvdlSKYKATKSDLETQISSLNEKLANLNRKYEEVCEEV--LHAKKKEISAK-DEKELLHfsieQEIKDQKE 1267
Cdd:pfam05667 417 LVELA-------GQWEKHRVPLIEEYRALKEAKSNKEDESQRKLEEIkeLREKIKEVAEEaKQKEELY----KQLVAEYE 485
|
330 340 350 360
....*....|....*....|....*....|....*....|....*...
gi 530406156 1268 RCDKSlTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALNGLS 1315
Cdd:pfam05667 486 RLPKD-VSRSAYTRRILEIVKNIKKQKEEITKILSDTKSLQKEINSLT 532
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1046-1226 |
2.17e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 44.53 E-value: 2.17e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1046 KQLKDLSQKYTE---VKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSmqrcYEKEQ 1122
Cdd:COG1579 7 RALLDLQELDSEldrLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKK----YEEQL 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1123 QTVTKLHQLlenqknssvplaehlqikEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDL 1202
Cdd:COG1579 83 GNVRNNKEY------------------EALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKK 144
|
170 180
....*....|....*....|....
gi 530406156 1203 SKYKATKSDLETQISSLNEKLANL 1226
Cdd:COG1579 145 AELDEELAELEAELEELEAEREEL 168
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
88-140 |
2.22e-04 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 40.34 E-value: 2.22e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 530406156 88 GRNALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALHDAAMADCPSSIQLL 140
Cdd:pfam13637 1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLL 53
|
|
| PTZ00440 |
PTZ00440 |
reticulocyte binding protein 2-like protein; Provisional |
802-1307 |
2.34e-04 |
|
reticulocyte binding protein 2-like protein; Provisional
Pssm-ID: 240419 [Multi-domain] Cd Length: 2722 Bit Score: 45.98 E-value: 2.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 802 QLSELKKKCGEDQEKIHALTSENTNLKKMMSNqyVPVKTHEEVkMTLNDTLAKTNRELLDVKKKFEDINQEFVKIKDKNE 881
Cdd:PTZ00440 985 KLDKEKDEWEHFKSEIDKLNVNYNILNKKIDD--LIKKQHDDI-IELIDKLIKEKGKEIEEKVDQYISLLEKMKTKLSSF 1061
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 882 ILKRNLENTQNQIKAEYISLAEHeaKMSSLsqsMRKVQDSNAEILANYRKGQEEIVTLHAEIKAQ-------KKELDTIQ 954
Cdd:PTZ00440 1062 HFNIDIKKYKNPKIKEEIKLLEE--KVEAL---LKKIDENKNKLIEIKNKSHEHVVNADKEKNKQtehynkkKKSLEKIY 1136
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 955 ECIKVKYAPIVSFEECERKFKATEK-----------ELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDlrdk 1023
Cdd:PTZ00440 1137 KQMEKTLKELENMNLEDITLNEVNEieieyerilidHIVEQINNEAKKSKTIMEEIESYKKDIDQVKKNMSKERND---- 1212
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1024 tvlieksHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKlvEENAKQTSEILAVQNLLqkqHVPLEQVealkKSLNGT 1103
Cdd:PTZ00440 1213 -------HLTTFEYNAYYDKATASYENIEELTTEAKGLKGE--ANRSTNVDELKEIKLQV---FSYLQQV----IKENNK 1276
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1104 IENLKEELKSMqrcyeKEQQTVTKLHQLLENQKNSSVPLAEH-LQIKEAFEKEVGIIKASLR--EKEEESQNKM---EEV 1177
Cdd:PTZ00440 1277 MENALHEIKNM-----YEFLISIDSEKILKEILNSTKKAEEFsNDAKKELEKTDNLIKQVEAkiEQAKEHKNKIygsLED 1351
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1178 SKLQSEVQNTKQALKKL-ETREVVD--LSKYKATKSDLETQISSLNEKLANLNrkyeevceevLHAKKKEISAKDEKEll 1254
Cdd:PTZ00440 1352 KQIDDEIKKIEQIKEEIsNKRKEINkyLSNIKSNKEKCDLHVRNASRGKDKID----------FLNKHEAIEPSNSKE-- 1419
|
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|...
gi 530406156 1255 hFSIeQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERL 1307
Cdd:PTZ00440 1420 -VNI-IKITDNINKCKQYSNEAMETENKADENNDSIIKYEKEITNILNNSSIL 1470
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
967-1330 |
2.61e-04 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 45.73 E-value: 2.61e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 967 FEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNK 1046
Cdd:pfam02463 165 SRLKRKKKEALKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQE 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1047 QLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVT 1126
Cdd:pfam02463 245 LLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEKK 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1127 KLHQLLENQKNSSVPLaehlqikEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKYK 1206
Cdd:pfam02463 325 KAEKELKKEKEEIEEL-------EKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEEL 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1207 ATKSDLETQISSLNEklanlNRKYEEVCEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQES 1286
Cdd:pfam02463 398 ELKSEEEKEAQLLLE-----LARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSED 472
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 530406156 1287 AKQIEAKDNKITELLNDVERLKQALNGLSQLTYTSGNPTKRQSQ 1330
Cdd:pfam02463 473 LLKETQLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALI 516
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
471-672 |
2.69e-04 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 45.53 E-value: 2.69e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 471 LEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSEAASGNHRLTE------ELKDQLKDLKVKYEGASAEVGKLRnQIK 544
Cdd:COG4717 47 LLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEYAElqeeleELEEELEELEAELEELREELEKLE-KLL 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 545 QNEMIVEEFKRDEGKLIEENKRLQkELSMCEMEREKKGRKVTEMEGQAKELSAKLAlsipaEKFENMKSSLSNEVNEKAK 624
Cdd:COG4717 126 QLLPLYQELEALEAELAELPERLE-ELEERLEELRELEEELEELEAELAELQEELE-----ELLEQLSLATEEELQDLAE 199
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 530406156 625 KLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQVKSRLEQ 672
Cdd:COG4717 200 ELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKE 247
|
|
| PHA02874 |
PHA02874 |
ankyrin repeat protein; Provisional |
123-247 |
2.74e-04 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165205 [Multi-domain] Cd Length: 434 Bit Score: 44.95 E-value: 2.74e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 123 TALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLIDRGAD----------------------- 179
Cdd:PHA02874 37 TPLIDAIRSGDAKIVELFIKHGADINHINTKIPHPLLTAIKIGAHDIIKLLIDNGVDtsilpipciekdmiktildcgid 116
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 180 VNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALGHDSSYYARIGDNLDILTLL--KTASENTN 247
Cdd:PHA02874 117 VNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLleKGAYANVK 186
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
465-642 |
2.84e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 45.68 E-value: 2.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 465 DEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSEAASGNHRLTE----ELKDQLKDLkvkyEGASAEVGKLR 540
Cdd:COG4913 616 EAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAEreiaELEAELERL----DASSDDLAALE 691
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 541 NQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAK-----ELSAKLALSIPAEKFENMKSSL 615
Cdd:COG4913 692 EQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARlelraLLEERFAAALGDAVERELRENL 771
|
170 180
....*....|....*....|....*..
gi 530406156 616 SNEVNEKAKKLVEMEREHEKSLSEIRQ 642
Cdd:COG4913 772 EERIDALRARLNRAEEELERAMRAFNR 798
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1035-1296 |
2.99e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.75 E-value: 2.99e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1035 RALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSM 1114
Cdd:COG4942 23 AEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQ 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1115 QRCYEKeqqtvtklhQLLENQKNSSVPLAEHLQIKEAFEKEVGIIKAsLREKEEESQNKMEEVSKLQSEVQNTKQALKkl 1194
Cdd:COG4942 103 KEELAE---------LLRALYRLGRQPPLALLLSPEDFLDAVRRLQY-LKYLAPARREQAEELRADLAELAALRAELE-- 170
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1195 etrevvdlskykATKSDLETQISSLNEKLANLNrkyeevceevlhakkkeiSAKDEKELLHFSIEQEIKDQKERCDKSLT 1274
Cdd:COG4942 171 ------------AERAELEALLAELEEERAALE------------------ALKAERQKLLARLEKELAELAAELAELQQ 220
|
250 260
....*....|....*....|..
gi 530406156 1275 TITELQRRIQESAKQIEAKDNK 1296
Cdd:COG4942 221 EAEELEALIARLEAEAAAAAER 242
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1026-1314 |
3.07e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.44 E-value: 3.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1026 LIEKSHEMERALSRKT--DELNKQLKDLSQKYTEVKNVKEKLvEENAKQTSEIlavQNLLQKQHVPLEQVEALKKSLNGT 1103
Cdd:PRK03918 140 ILESDESREKVVRQILglDDYENAYKNLGEVIKEIKRRIERL-EKFIKRTENI---EELIKEKEKELEEVLREINEISSE 215
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1104 IENLKEELKSMqrcyEKEQQTVTKLHQLLENQKNSsvplaehlqiKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSE 1183
Cdd:PRK03918 216 LPELREELEKL----EKEVKELEELKEEIEELEKE----------LESLEGSKRKLEEKIRELEERIEELKKEIEELEEK 281
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1184 VQNTKQ---------ALKKLETREVVDLSKYKATKSDLETQISSLNEKLANLNRKYEEVCEevLHAKKKEIS---AKDEK 1251
Cdd:PRK03918 282 VKELKElkekaeeyiKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE--LKKKLKELEkrlEELEE 359
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530406156 1252 ELLHFSIEQEIKDQKERCDKSLT--TITELQRRIQESAK---QIEAKDNKIT----ELLNDVERLKQALNGL 1314
Cdd:PRK03918 360 RHELYEEAKAKKEELERLKKRLTglTPEKLEKELEELEKakeEIEEEISKITarigELKKEIKELKKAIEEL 431
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1157-1388 |
3.16e-04 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 45.31 E-value: 3.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1157 GIIKasLREKEEESQNKMEEVS-------------------------------KLQSEVQNTKQALKKLETREV-VDLSK 1204
Cdd:COG1196 166 GISK--YKERKEEAERKLEATEenlerledilgelerqleplerqaekaeryrELKEELKELEAELLLLKLRELeAELEE 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1205 YKATKSDLETQISSLNEKLANLNRKYEEVcEEVLHAKKKEISAKDEKELLHFS----IEQEIKDQKERCDKSLTTITELQ 1280
Cdd:COG1196 244 LEAELEELEAELEELEAELAELEAELEEL-RLELEELELELEEAQAEEYELLAelarLEQDIARLEERRRELEERLEELE 322
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1281 RRIQESAKQIEAKDNKITELLNDVERLKQALNGLSQLTYTSGNPTKRQSQLIDTLQHQVKSLEQQLADADRQhQEVIAIY 1360
Cdd:COG1196 323 EELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRA-AAELAAQ 401
|
250 260
....*....|....*....|....*...
gi 530406156 1361 RTHLLSAAQGHMDEdvQEALLQIIQMRQ 1388
Cdd:COG1196 402 LEELEEAEEALLER--LERLEEELEELE 427
|
|
| TRPV5-6 |
cd22192 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ... |
25-172 |
3.22e-04 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.
Pssm-ID: 411976 [Multi-domain] Cd Length: 609 Bit Score: 45.00 E-value: 3.22e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 25 LMKAAERGDVEKVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLEcLNAILIHGV------DITTSDTAGRNALHLAAKY 98
Cdd:cd22192 21 LLLAAKENDVQAIKKLLKCPSCDLFQRGALGETALHVAALYDNLE-AAVVLMEAApelvnePMTSDLYQGETALHIAVVN 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 99 GHALCLQKLLQYNcptehADLQ-----------GRTAL-----HDAAMADCPSS---IQLLCDHGASVNAKDVDGRTPL- 158
Cdd:cd22192 100 QNLNLVRELIARG-----ADVVspratgtffrpGPKNLiyygeHPLSFAACVGNeeiVRLLIEHGADIRAQDSLGNTVLh 174
|
170
....*....|....
gi 530406156 159 VLATQMSRPTICQL 172
Cdd:cd22192 175 ILVLQPNKTFACQM 188
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
107-161 |
3.30e-04 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 40.02 E-value: 3.30e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 530406156 107 LLQY-NCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNAKDVDGRTPLVLA 161
Cdd:pfam13857 1 LLEHgPIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALDLA 56
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
975-1190 |
3.73e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 44.44 E-value: 3.73e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 975 KATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRdktvliekshEMERALSRKTDELNKQLKDLSQK 1054
Cdd:COG3883 29 QAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIA----------EAEAEIEERREELGERARALYRS 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1055 YTEVkNVKEKLVeeNAKQTSEIL----AVQNLLQKQHVPLEQVEALKKSLngtiENLKEELKSMQRCYEKEQQTVTKLHQ 1130
Cdd:COG3883 99 GGSV-SYLDVLL--GSESFSDFLdrlsALSKIADADADLLEELKADKAEL----EAKKAELEAKLAELEALKAELEAAKA 171
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1131 LLENQKnssvplAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQA 1190
Cdd:COG3883 172 ELEAQQ------AEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAA 225
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
737-1100 |
3.94e-04 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 44.90 E-value: 3.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 737 IDDLN-RKLLDVTQKYTEKKLEMEKLLLENdslskdvsrletvfvppekhekeIIALKSNIVELKKQLSELKKKCGEDQE 815
Cdd:PRK11281 45 LDALNkQKLLEAEDKLVQQDLEQTLALLDK-----------------------IDRQKEETEQLKQQLAQAPAKLRQAQA 101
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 816 KIHALTSENTN-LKKMMSNQYVPvktheevkmTLNDTLAKTNRELLDVKKKFEDINQEFVKikdkneiLKRNLENTQNQI 894
Cdd:PRK11281 102 ELEALKDDNDEeTRETLSTLSLR---------QLESRLAQTLDQLQNAQNDLAEYNSQLVS-------LQTQPERAQAAL 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 895 KAEYISLAEHEAKMSSLSQSMRKVQDSNAEILanyrkgQEEIVTLHAEIKAQKKEL---DTIQECIKVKYapivsfEECE 971
Cdd:PRK11281 166 YANSQRLQQIRNLLKGGKVGGKALRPSQRVLL------QAEQALLNAQNDLQRKSLegnTQLQDLLQKQR------DYLT 233
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 972 RKFKATEKELKD----------QLSEQTQKYSVSEEEVKKnKQENDKLKKEiftlqkdlrdktvlIEKSHEMERALSRKT 1041
Cdd:PRK11281 234 ARIQRLEHQLQLlqeainskrlTLSEKTVQEAQSQDEAAR-IQANPLVAQE--------------LEINLQLSQRLLKAT 298
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*....
gi 530406156 1042 DELNkqlkDLSQKYTEVKNVKEklveenakqtseilavqNLLQKQHVPLEQVEALKKSL 1100
Cdd:PRK11281 299 EKLN----TLTQQNLRVKNWLD-----------------RLTQSERNIKEQISVLKGSL 336
|
|
| PHA02798 |
PHA02798 |
ankyrin-like protein; Provisional |
137-214 |
4.24e-04 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222931 [Multi-domain] Cd Length: 489 Bit Score: 44.44 E-value: 4.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 137 IQLLCDHGASVNAKDVDGRTPLVlaTQMSRPTICQL-----LIDRGADVNSRDKQNRTALMLGCEYGCR---DAVEVLIK 208
Cdd:PHA02798 92 VKILIENGADINKKNSDGETPLY--CLLSNGYINNLeillfMIENGADTTLLDKDGFTMLQVYLQSNHHidiEIIKLLLE 169
|
....*.
gi 530406156 209 NGADIS 214
Cdd:PHA02798 170 KGVDIN 175
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
553-817 |
4.39e-04 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 44.53 E-value: 4.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 553 FKRDEGKLIEEnkrLQKE-----LSMCEMEREKKGRKVTEMEGQAKELSAKL------ALSIPAEKFENMKSSLSNEVNE 621
Cdd:PRK05771 14 LKSYKDEVLEA---LHELgvvhiEDLKEELSNERLRKLRSLLTKLSEALDKLrsylpkLNPLREEKKKVSVKSLEELIKD 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 622 KAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQhVKP-------EEHEQVKSRLEQKSGELGKKITELTLKNQTLQKE 694
Cdd:PRK05771 91 VEEELEKIEKEIKELEEEISELENEIKELEQEIER-LEPwgnfdldLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENV 169
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 695 IEKVYLDNKL------LKEQAHNLTIEMKNH-YVPLKVSE------------DMKKSHDAIIDDLNRKLLDVTQKYTEKK 755
Cdd:PRK05771 170 EYISTDKGYVyvvvvvLKELSDEVEEELKKLgFERLELEEegtpselireikEELEEIEKERESLLEELKELAKKYLEEL 249
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530406156 756 LEMEKLLL----ENDSLSKDVSRLETVFVP---PEKHEKEIIALKSNIVELKKQLSELKKKcgEDQEKI 817
Cdd:PRK05771 250 LALYEYLEieleRAEALSKFLKTDKTFAIEgwvPEDRVKKLKELIDKATGGSAYVEFVEPD--EEEEEV 316
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
971-1137 |
4.62e-04 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 44.53 E-value: 4.62e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 971 ERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKT--------- 1041
Cdd:PRK05771 74 EEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERLEPWGNFDLDLSLLLGFKYvsvfvgtvp 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1042 -DELNKQLKDLSQKYTE-VKNVKEKLV------EENAKQTSEILAvQNLLQKQHVP--------LEQVEALKKSLNGTIE 1105
Cdd:PRK05771 154 eDKLEELKLESDVENVEyISTDKGYVYvvvvvlKELSDEVEEELK-KLGFERLELEeegtpselIREIKEELEEIEKERE 232
|
170 180 190
....*....|....*....|....*....|..
gi 530406156 1106 NLKEELKSMQRCYEKEQQTVtklHQLLENQKN 1137
Cdd:PRK05771 233 SLLEELKELAKKYLEELLAL---YEYLEIELE 261
|
|
| 46 |
PHA02562 |
endonuclease subunit; Provisional |
584-809 |
4.88e-04 |
|
endonuclease subunit; Provisional
Pssm-ID: 222878 [Multi-domain] Cd Length: 562 Bit Score: 44.62 E-value: 4.88e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 584 KVTEMEGQAKELSAKLALSIPA----EKFENMKSSLSNEVNEKAKKLVEMEREhekslsEIRQLKRELENVKAKLAQHVK 659
Cdd:PHA02562 175 KIRELNQQIQTLDMKIDHIQQQiktyNKNIEEQRKKNGENIARKQNKYDELVE------EAKTIKAEIEELTDELLNLVM 248
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 660 PEEheqvksRLEQKSGELGKKITELTLKNQTLQKEIeKVYLDNKL-----------------LKEQAHNLTI---EMKNH 719
Cdd:PHA02562 249 DIE------DPSAALNKLNTAAAKIKSKIEQFQKVI-KMYEKGGVcptctqqisegpdritkIKDKLKELQHsleKLDTA 321
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 720 YVPLKVSEDMkkshdaiIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLETVFVppeKHEKEIIALKSNIVEL 799
Cdd:PHA02562 322 IDELEEIMDE-------FNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFV---DNAEELAKLQDELDKI 391
|
250
....*....|
gi 530406156 800 KKQLSELKKK 809
Cdd:PHA02562 392 VKTKSELVKE 401
|
|
| Ank |
pfam00023 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
120-151 |
5.24e-04 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.
Pssm-ID: 459634 [Multi-domain] Cd Length: 34 Bit Score: 38.81 E-value: 5.24e-04
10 20 30
....*....|....*....|....*....|...
gi 530406156 120 QGRTALHDAA-MADCPSSIQLLCDHGASVNAKD 151
Cdd:pfam00023 1 DGNTPLHLAAgRRGNLEIVKLLLSKGADVNARD 33
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
503-712 |
5.39e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 43.98 E-value: 5.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 503 LTSEAASGNHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKG 582
Cdd:COG4942 10 LLALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 583 RKVTEMEGQAKELSAKLALSIPA----EKFENMKSSLSNEVNEKAKKLVEM----EREHEKSLSEIRQLKRELENVKAKL 654
Cdd:COG4942 90 KEIAELRAELEAQKEELAELLRAlyrlGRQPPLALLLSPEDFLDAVRRLQYlkylAPARREQAEELRADLAELAALRAEL 169
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*...
gi 530406156 655 AQhvKPEEHEQVKSRLEQKSGELGKKITELTLKNQTLQKEIEKVYLDNKLLKEQAHNL 712
Cdd:COG4942 170 EA--ERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEEL 225
|
|
| PLN02939 |
PLN02939 |
transferase, transferring glycosyl groups |
238-579 |
5.40e-04 |
|
transferase, transferring glycosyl groups
Pssm-ID: 215507 [Multi-domain] Cd Length: 977 Bit Score: 44.51 E-value: 5.40e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 238 LLKTASENTNKGRELWKKGPSLQQRNlTHMQDEVNVKSHQR-EHQNIQDLEIEN-------------EDL---------- 293
Cdd:PLN02939 65 LQSNTDENGQLENTSLRTVMELPQKS-TSSDDDHNRASMQRdEAIAAIDNEQQTnskdgeqlsdfqlEDLvgmiqnaekn 143
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 294 -----KERLRKIQQEQRILLDK------VNGLQLQLNEREKlKSLLAAKEKQHEESLRtiEALKNRFKYFESDHLGSGSH 362
Cdd:PLN02939 144 illlnQARLQALEDLEKILTEKealqgkINILEMRLSETDA-RIKLAAQEKIHVEILE--EQLEKLRNELLIRGATEGLC 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 363 FSN--------RKEDMLLKQGQMYM----ADSQCTSPGIPAHMQSRSMLRPLELSLPSQTSYSENEILKkeLEAMRTFCE 430
Cdd:PLN02939 221 VHSlskeldvlKEENMLLKDDIQFLkaelIEVAETEERVFKLEKERSLLDASLRELESKFIVAQEDVSK--LSPLQYDCW 298
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 431 SAKQDRLKLQNELAHKVAECKALALECERvkeDSDEQIKQLEDALKdvqkrmyesEGKVKQMQTHFLALkehltseaasg 510
Cdd:PLN02939 299 WEKVENLQDLLDRATNQVEKAALVLDQNQ---DLRDKVDKLEASLK---------EANVSKFSSYKVEL----------- 355
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530406156 511 nhrlteeLKDQLKDLKVKYEGASAEVgklRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMERE 579
Cdd:PLN02939 356 -------LQQKLKLLEERLQASDHEI---HSYIQLYQESIKEFQDTLSKLKEESKKRSLEHPADDMPSE 414
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
1104-1315 |
6.07e-04 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 44.15 E-value: 6.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1104 IENLKEELKSMQ-RCYEKEQQTVTKLHQLLENQKNssvplAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQS 1182
Cdd:PRK05771 33 IEDLKEELSNERlRKLRSLLTKLSEALDKLRSYLP-----KLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEE 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1183 EVQNTKQALKKLEtREVVDLSKYKATKSDLETQISSLN----------EKLANLNRKYEEVCEEVLHAKK---------- 1242
Cdd:PRK05771 108 EISELENEIKELE-QEIERLEPWGNFDLDLSLLLGFKYvsvfvgtvpeDKLEELKLESDVENVEYISTDKgyvyvvvvvl 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1243 KEISAKDEKELLHFS-IEQEIKDQK---ERCDKSLTTITELQRRIQESAKQIEAKDNKITE--------LLNDVERLKQA 1310
Cdd:PRK05771 187 KELSDEVEEELKKLGfERLELEEEGtpsELIREIKEELEEIEKERESLLEELKELAKKYLEellalyeyLEIELERAEAL 266
|
....*
gi 530406156 1311 LNGLS 1315
Cdd:PRK05771 267 SKFLK 271
|
|
| CagA_N |
pfam18971 |
CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial ... |
1004-1289 |
6.91e-04 |
|
CagA protein; The Helicobacter pylori type IV secretion effector CagA is a major bacterial virulence determinant and critical for gastric carcinogenesis. X-ray crystallographic analysis of the N-terminal CagA fragment (residues 1-876) revealed that the region has a structure comprised of three discrete domains. Domain I constitutes a mobile CagA N terminus, while Domain II tethers CagA to the plasma membrane by interacting with membrane phosphatidylserine. Domain III interacts intramolecularly with the intrinsically disordered C-terminal region, and this interaction potentiates the pathogenic scaffold/hub function of CagA.
Pssm-ID: 408741 [Multi-domain] Cd Length: 876 Bit Score: 44.38 E-value: 6.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1004 QENDKLKKEIFTLQKDLRDKTVLIEKS-----------------HEMERALsRKTDELNKQLKDLSQKYTEVKNVKEKLV 1066
Cdd:pfam18971 573 QEANKLIKDFLSSNKELAGKALNFNKAvaeakstgnydevkkaqKDLEKSL-RKREHLEKEVEKKLESKSGNKNKMEAKA 651
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1067 EENAkQTSEILAVQNLLQKQHVP----LEQVEALKKSLNGTIENLKEELKSMQRCYEKeqqtvtklhqlLENQKNSSVPL 1142
Cdd:pfam18971 652 QANS-QKDEIFALINKEANRDARaiayTQNLKGIKRELSDKLEKISKDLKDFSKSFDE-----------FKNGKNKDFSK 719
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1143 AEH-LQIKEAFEKEVGIikaslrekeeesqnKMEEVSKlqseVQNTKQALKKLETREVVDLSKYKATKSDLETQISS--L 1219
Cdd:pfam18971 720 AEEtLKALKGSVKDLGI--------------NPEWISK----VENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDviI 781
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 530406156 1220 NEKLANlnrKYEEVCEEVLHAKKKEISAKDEK---ELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQ 1289
Cdd:pfam18971 782 NQKVTD---KVDNLNQAVSVAKAMGDFSRVEQvlaDLKNFSKEQLAQQAQKNEDFNTGKNSELYQSVKNSVNK 851
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1168-1312 |
7.25e-04 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 42.99 E-value: 7.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1168 EESQNKMEEVSKLQSEVQNTKQALKKLEtREVVDLSK----YKATKSDLETQISSLNEKLANLNRKYEEVCEEVLHAKKK 1243
Cdd:COG1579 3 PEDLRALLDLQELDSELDRLEHRLKELP-AELAELEDelaaLEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQ 81
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 530406156 1244 EISAKDEKELLhfSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALN 1312
Cdd:COG1579 82 LGNVRNNKEYE--ALQKEIESLKRRISDLEDEILELMERIEELEEELAELEAELAELEAELEEKKAELD 148
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
904-1293 |
7.56e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 44.36 E-value: 7.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 904 HEAKMSSLSQ--SMRKVQDSNAEILANYRKGQEEIVTLHAEIK----AQKKELDTIQECIKVKYApiVSFEECERKFKAT 977
Cdd:PTZ00121 1056 HEGKAEAKAHvgQDEGLKPSYKDFDFDAKEDNRADEATEEAFGkaeeAKKTETGKAEEARKAEEA--KKKAEDARKAEEA 1133
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 978 EKELKDQLSEQTQKySVSEEEVKKNKQENDKLKKEIFTLQKDLRdKTVLIEKSHEMERALS-RKTDELNKqlKDLSQKYT 1056
Cdd:PTZ00121 1134 RKAEDARKAEEARK-AEDAKRVEIARKAEDARKAEEARKAEDAK-KAEAARKAEEVRKAEElRKAEDARK--AEAARKAE 1209
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1057 EVKNVKEKLVEENAKQTSEILAVQNLLQKQHVP--LEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLEN 1134
Cdd:PTZ00121 1210 EERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAkkAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEK 1289
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1135 QKNSSVPLAEHLQIKEAFEKEVGIIKAS--LREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSDL 1212
Cdd:PTZ00121 1290 KKADEAKKAEEKKKADEAKKKAEEAKKAdeAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAA 1369
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1213 ETQISSLNEKLANLNRKYEEVCEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEA 1292
Cdd:PTZ00121 1370 EKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEA 1449
|
.
gi 530406156 1293 K 1293
Cdd:PTZ00121 1450 K 1450
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1160-1355 |
7.85e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 43.67 E-value: 7.85e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1160 KASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLEtrevVDLSKYKATKSDLETQISSLNEKLAnlnRKYEEVCEEVLH 1239
Cdd:COG3883 22 QKELSELQAELEAAQAELDALQAELEELNEEYNELQ----AELEALQAEIDKLQAEIAEAEAEIE---ERREELGERARA 94
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1240 AKKKE---------ISAKDEKELLH--FSIEQEIKDQKERcdksLTTITELQRRIQESAKQIEAKDNKITELLNDVERLK 1308
Cdd:COG3883 95 LYRSGgsvsyldvlLGSESFSDFLDrlSALSKIADADADL----LEELKADKAELEAKKAELEAKLAELEALKAELEAAK 170
|
170 180 190 200
....*....|....*....|....*....|....*....|....*..
gi 530406156 1309 QALNGLsqltytsgnpTKRQSQLIDTLQHQVKSLEQQLADADRQHQE 1355
Cdd:COG3883 171 AELEAQ----------QAEQEALLAQLSAEEAAAEAQLAELEAELAA 207
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
16-85 |
8.42e-04 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 39.71 E-value: 8.42e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530406156 16 ADWNKYDDR----LMKAAERGDVEKVTSILAKKGVNpgkLDVEGRSVFHVVTSKGNLECLNAILIHGVDITTSD 85
Cdd:pfam12796 21 ADANLQDKNgrtaLHLAAKNGHLEIVKLLLEHADVN---LKDNGRTALHYAARSGHLEIVKLLLEKGADINVKD 91
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1201-1358 |
8.78e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 43.28 E-value: 8.78e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1201 DLSKYKATKSDLETQISSLNEKLANLNRKYEEVCEEVLhakkkeisakdekellhfSIEQEIKDQKERcdkslttITELQ 1280
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYN------------------ELQAELEALQAE-------IDKLQ 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1281 RRIQESAKQIEAKDNKITELLNDVERLKQALNGLSQLTyTSGNP-------------TKRQSQLIDTLQHQVKSLEQQLA 1347
Cdd:COG3883 72 AEIAEAEAEIEERREELGERARALYRSGGSVSYLDVLL-GSESFsdfldrlsalskiADADADLLEELKADKAELEAKKA 150
|
170
....*....|.
gi 530406156 1348 DADRQHQEVIA 1358
Cdd:COG3883 151 ELEAKLAELEA 161
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
142-191 |
8.83e-04 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 38.87 E-value: 8.83e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 530406156 142 DHG-ASVNAKDVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTAL 191
Cdd:pfam13857 3 EHGpIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTAL 53
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
79-126 |
9.64e-04 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 38.48 E-value: 9.64e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 530406156 79 VDITTSDTAGRNALHLAAKYGHALCLQKLLQYNCPTEHADLQGRTALH 126
Cdd:pfam13857 7 IDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALD 54
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1096-1268 |
1.01e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 43.23 E-value: 1.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1096 LKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLhQLLENQKnssvplaEHLQIKEAFEKEVGIIKASLREKEEESQNKME 1175
Cdd:PRK12704 25 RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKE-ALLEAKE-------EIHKLRNEFEKELRERRNELQKLEKRLLQKEE 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1176 EVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSDLETQISSLNEKL---ANLNRkyEEVCEEVLHAKKKEisAKDEKE 1252
Cdd:PRK12704 97 NLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELeriSGLTA--EEAKEILLEKVEEE--ARHEAA 172
|
170
....*....|....*.
gi 530406156 1253 LLHFSIEQEIKDQKER 1268
Cdd:PRK12704 173 VLIKEIEEEAKEEADK 188
|
|
| PHA02946 |
PHA02946 |
ankyin-like protein; Provisional |
137-214 |
1.06e-03 |
|
ankyin-like protein; Provisional
Pssm-ID: 165256 [Multi-domain] Cd Length: 446 Bit Score: 43.12 E-value: 1.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 137 IQLLCDHGASVNAKDVDGRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTAL--MLGCEYGCRDAVEVLIKNGADIS 214
Cdd:PHA02946 55 VEELLHRGYSPNETDDDGNYPLHIASKINNNRIVAMLLTHGADPNACDKQHKTPLyyLSGTDDEVIERINLLVQYGAKIN 134
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
974-1268 |
1.39e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.37 E-value: 1.39e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 974 FKATEK--ELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKdlrdktvlIEKSHEME---RALSRKTDELNKQL 1048
Cdd:COG4913 606 FDNRAKlaALEAELAELEEELAEAEERLEALEAELDALQERREALQR--------LAEYSWDEidvASAEREIAELEAEL 677
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1049 KDLSQKYTEVKNVKEKLVEenakqtseilavqnlLQKQhvpLEQVEALKKSLNGTIENLKEELKSMQRcyekEQQTVTkl 1128
Cdd:COG4913 678 ERLDASSDDLAALEEQLEE---------------LEAE---LEELEEELDELKGEIGRLEKELEQAEE----ELDELQ-- 733
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1129 HQLLENQKNSSVPLAEHLQikEAFEkevgiikaslREKEEESQNKMEEvsKLQSEVQNTKQALKKLETReVVDL-----S 1203
Cdd:COG4913 734 DRLEAAEDLARLELRALLE--ERFA----------AALGDAVERELRE--NLEERIDALRARLNRAEEE-LERAmrafnR 798
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 530406156 1204 KYKATKSDLETQISSLNEKLAnlnrKYEEVCEEVLHAKKKEI------SAKDEKELLHFSIEQEIKDQKER 1268
Cdd:COG4913 799 EWPAETADLDADLESLPEYLA----LLDRLEEDGLPEYEERFkellneNSIEFVADLLSKLRRAIREIKER 865
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
462-695 |
1.56e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 43.11 E-value: 1.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 462 EDSDEQIKQLEDALKDVQKRMYESEGKVKQMqthflalkehltsEAASgnhrlteELKDQLKDLKVKYEGASAEVGKLRN 541
Cdd:PRK02224 471 EEDRERVEELEAELEDLEEEVEEVEERLERA-------------EDLV-------EAEDRIERLEERREDLEELIAERRE 530
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 542 QIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLalsipaEKFENMKSSLS----- 616
Cdd:PRK02224 531 TIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERI------ESLERIRTLLAaiada 604
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 617 ----NEVNEKAKKLVEMEREHEKSLSEIRQLKRELEnvkAKLaQHVKPEEHEQVKSRLEQKSGELGKKITELTLKNQTLQ 692
Cdd:PRK02224 605 edeiERLREKREALAELNDERRERLAEKRERKRELE---AEF-DEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQ 680
|
...
gi 530406156 693 KEI 695
Cdd:PRK02224 681 AEI 683
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
21-71 |
1.67e-03 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 38.02 E-value: 1.67e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 530406156 21 YDDRLMKAAERGDVEkVTSILAKKGVNPGKLDVEGRSVFHVVTSKGNLECL 71
Cdd:pfam13637 1 ELTALHAAAASGHLE-LLRLLLEKGADINAVDGNGETALHFAASNGNVEVL 50
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
294-772 |
1.75e-03 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 43.14 E-value: 1.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 294 KERLRKIQQEQRILLDKVNGLQLQLNEREkLKSLLAAKEKqheESLRTIEALKNRFKYFESdhlgsgshfSNRKEDMLLK 373
Cdd:COG5022 762 RRYLQALKRIKKIQVIQHGFRLRRLVDYE-LKWRLFIKLQ---PLLSLLGSRKEYRSYLAC---------IIKLQKTIKR 828
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 374 QGQMYMADSQCTSpgipahMQSRSMLRPLELSLPSQTSYSE-------NEILKKELEAMRTFCEsaKQDRLKLQNELAHK 446
Cdd:COG5022 829 EKKLRETEEVEFS------LKAEVLIQKFGRSLKAKKRFSLlkketiyLQSAQRVELAERQLQE--LKIDVKSISSLKLV 900
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 447 VAECKALALECERvKEDSDEQIK-----QLEDALKDVQKRMYESEGKVKQMQTHFLALKEHltseaasgnhrlteELKDQ 521
Cdd:COG5022 901 NLELESEIIELKK-SLSSDLIENlefktELIARLKKLLNNIDLEEGPSIEYVKLPELNKLH--------------EVESK 965
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 522 LKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEgQAKELSAKLAL 601
Cdd:COG5022 966 LKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASK-IISSESTELSI 1044
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 602 SIPAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKR--------ELENVKAKLAQHVKPEEHEQVKSRLEQK 673
Cdd:COG5022 1045 LKPLQKLKGLLLLENNQLQARYKALKLRRENSLLDDKQLYQLEStenllktiNVKDLEVTNRNLVKPANVLQFIVAQMIK 1124
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 674 SGELGKKITELTLKNQTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTE 753
Cdd:COG5022 1125 LNLLQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALPSPPPFAALSEKRLYQSALYDEKSKLSSSEVNDLKN 1204
|
490
....*....|....*....
gi 530406156 754 KKLEMEKLLLENDSLSKDV 772
Cdd:COG5022 1205 ELIALFSKIFSGWPRGDKL 1223
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
971-1121 |
2.40e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.07 E-value: 2.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 971 ERKFKATEKELKDQLSE-QTQKYSVSEEEVKKNKQENDKLK-----------KEIFTLQKDLRDKTVLIEKSHEmerALS 1038
Cdd:PRK12704 30 EAKIKEAEEEAKRILEEaKKEAEAIKKEALLEAKEEIHKLRnefekelrerrNELQKLEKRLLQKEENLDRKLE---LLE 106
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1039 RKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQKqhvpleqvEALKKSLNGTIENLKEELKSMQRCY 1118
Cdd:PRK12704 107 KREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAE--------EAKEILLEKVEEEARHEAAVLIKEI 178
|
...
gi 530406156 1119 EKE 1121
Cdd:PRK12704 179 EEE 181
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
516-661 |
2.41e-03 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 41.06 E-value: 2.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 516 EELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEF----KRDEGKLieENKRLQKELSMCEMEREKKGRKVTEMEGQ 591
Cdd:COG1579 34 AELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVeariKKYEEQL--GNVRNNKEYEALQKEIESLKRRISDLEDE 111
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 592 AKELSAKLalsipaEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKPE 661
Cdd:COG1579 112 ILELMERI------EELEEELAELEAELAELEAELEEKKAELDEELAELEAELEELEAEREELAAKIPPE 175
|
|
| Ank_3 |
pfam13606 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
187-214 |
2.46e-03 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.
Pssm-ID: 463933 [Multi-domain] Cd Length: 30 Bit Score: 36.85 E-value: 2.46e-03
10 20
....*....|....*....|....*...
gi 530406156 187 NRTALMLGCEYGCRDAVEVLIKNGADIS 214
Cdd:pfam13606 2 GNTPLHLAARNGRLEIVKLLLENGADIN 29
|
|
| PRK00409 |
PRK00409 |
recombination and DNA strand exchange inhibitor protein; Reviewed |
1026-1168 |
2.51e-03 |
|
recombination and DNA strand exchange inhibitor protein; Reviewed
Pssm-ID: 234750 [Multi-domain] Cd Length: 782 Bit Score: 42.51 E-value: 2.51e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1026 LIEKSHEMERALSRKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKqtseilavqnLLQKQHVPLEQ-VEALKKSLNGTI 1104
Cdd:PRK00409 521 LIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDK----------LLEEAEKEAQQaIKEAKKEADEII 590
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530406156 1105 ENLkeelksmqRCYEKEQQTVTKLHQLLENQKNssvpLAEHLQIKEAFEKEVGIIKASLREKEE 1168
Cdd:PRK00409 591 KEL--------RQLQKGGYASVKAHELIEARKR----LNKANEKKEKKKKKQKEKQEELKVGDE 642
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
941-1076 |
2.92e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 42.07 E-value: 2.92e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 941 AEIKAQKKELDTIQECIKVKyapivsfEECERKFKATEKELKDQLSEQTQKysvsEEEVKKNKQENDKLKKEIFTLQKDL 1020
Cdd:PRK12704 51 AEAIKKEALLEAKEEIHKLR-------NEFEKELRERRNELQKLEKRLLQK----EENLDRKLELLEKREEELEKKEKEL 119
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 530406156 1021 RDKTVLIEKsheMERALSRKTDELNKQLKDLSQKYTE------VKNVKEKLVEENAKQTSEI 1076
Cdd:PRK12704 120 EQKQQELEK---KEEELEELIEEQLQELERISGLTAEeakeilLEKVEEEARHEAAVLIKEI 178
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
753-1309 |
2.93e-03 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 42.12 E-value: 2.93e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 753 EKKLEMEKLLLE---NDSLSKDVSRLETVFVPPEKHEKEIIALKSNIVELKKQLSELKKK----CGEDQEKIHALTSENT 825
Cdd:pfam10174 203 QKEKENIHLREElhrRNQLQPDPAKTKALQTVIEMKDTKISSLERNIRDLEDEVQMLKTNgllhTEDREEEIKQMEVYKS 282
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 826 NLKKMMSNQyvpvkthEEVKMTLNdtlaKTNRELLDVKKKFEDINQEFVKIKDKNEILKRNL---ENTQNQIKAEYISL- 901
Cdd:pfam10174 283 HSKFMKNKI-------DQLKQELS----KKESELLALQTKLETLTNQNSDCKQHIEVLKESLtakEQRAAILQTEVDALr 351
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 902 AEHEAKMSSLSQSMRKVQDSnaeilanyrkgQEEIVTLHAEIKAQKKELDtiqecikVKyapivsfeecERKFKATEKE- 980
Cdd:pfam10174 352 LRLEEKESFLNKKTKQLQDL-----------TEEKSTLAGEIRDLKDMLD-------VK----------ERKINVLQKKi 403
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 981 --LKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEK-SHEMERALSRKTDELNKQLKDLsqkyte 1057
Cdd:pfam10174 404 enLQEQLRDKDKQLAGLKERVKSLQTDSSNTDTALTTLEEALSEKERIIERlKEQREREDRERLEELESLKKEN------ 477
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1058 vKNVKEKLVeenakqtseilAVQNLLQKQHVPLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVTKLhqllENQKN 1137
Cdd:pfam10174 478 -KDLKEKVS-----------ALQPELTEKESSLIDLKEHASSLASSGLKKDSKLKSLEIAVEQKKEECSKL----ENQLK 541
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1138 SSVPLAEHLQIKEAFEKEVgiikaslREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREVVDLSKYKATKSDLETQIS 1217
Cdd:pfam10174 542 KAHNAEEAVRTNPEINDRI-------RLLEQEVARYKEESGKAQAEVERLLGILREVENEKNDKDKKIAELESLTLRQMK 614
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1218 SLNEKLANLNRKYEEvcEEVLHAKKKEISAKDEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKI 1297
Cdd:pfam10174 615 EQNKKVANIKHGQQE--MKKKGAQLLEEARRREDNLADNSQQLQLEELMGALEKTRQELDATKARLSSTQQSLAEKDGHL 692
|
570
....*....|..
gi 530406156 1298 TELlnDVERLKQ 1309
Cdd:pfam10174 693 TNL--RAERRKQ 702
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1002-1369 |
3.16e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 42.25 E-value: 3.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1002 NKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQKYtevknvkeklveenakqtseilavqn 1081
Cdd:PRK04863 277 HANERRVHLEEALELRRELYTSRRQLAAEQYRLVEMARELAELNEAESDLEQDY-------------------------- 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1082 llqkqhvpleqvealkkslngtiENLKEELKSMQ---RCYEKEQQTVTKLHQLLENqknssvpLAEHLQIKEafekEVGI 1158
Cdd:PRK04863 331 -----------------------QAASDHLNLVQtalRQQEKIERYQADLEELEER-------LEEQNEVVE----EADE 376
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1159 IKASLREKEEESQnkmEEVSKLQSEVQNTKQALKKLETRE------VVDLSKYKATKSDLETQISSLNEKLANLNRKYEE 1232
Cdd:PRK04863 377 QQEENEARAEAAE---EEVDELKSQLADYQQALDVQQTRAiqyqqaVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQE 453
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1233 VCEEVLHAKKKeisakdekellhFSIEQEIKDQKERCDKSLTTIT-------------ELQRRIQES---AKQIEAKDNK 1296
Cdd:PRK04863 454 ATEELLSLEQK------------LSVAQAAHSQFEQAYQLVRKIAgevsrseawdvarELLRRLREQrhlAEQLQQLRMR 521
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1297 ITEL------LNDVERLKQALNGLSQLTYTS----GNPTKRQSQLIDTLQHQVKSLEQQLADAdRQHQEVIAIYRTHLLS 1366
Cdd:PRK04863 522 LSELeqrlrqQQRAERLLAEFCKRLGKNLDDedelEQLQEELEARLESLSESVSEARERRMAL-RQQLEQLQARIQRLAA 600
|
...
gi 530406156 1367 AAQ 1369
Cdd:PRK04863 601 RAP 603
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
154-207 |
3.33e-03 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 36.87 E-value: 3.33e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 530406156 154 GRTPLVLATQMSRPTICQLLIDRGADVNSRDKQNRTALMLGCEYGCRDAVEVLI 207
Cdd:pfam13637 1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
413-655 |
3.50e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 41.85 E-value: 3.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 413 SENEILKKELEAMRTFCESAKQDRLKLQNELAHKVAECKALALECERVKEDSDE---QIKQLEDALKDVQKRMYESEGKV 489
Cdd:COG1196 267 AELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEEleeELAELEEELEELEEELEELEEEL 346
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 490 KQMQTHFLALKEHLTS--EAASGNHRLTEELKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRL 567
Cdd:COG1196 347 EEAEEELEEAEAELAEaeEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEEL 426
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 568 QKELsmcemerekkgrkvtemegqAKELSAKLALSIPAEKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKREL 647
Cdd:COG1196 427 EEAL--------------------AELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAALAELLEEL 486
|
....*...
gi 530406156 648 ENVKAKLA 655
Cdd:COG1196 487 AEAAARLL 494
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
413-710 |
3.75e-03 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 41.54 E-value: 3.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 413 SENEILKKELEAMRTFCESAKQDRLklqnELAHKVAECKALALECERVKEDSDEQIKQ--LEDALKDVQKRMYESEGKVK 490
Cdd:NF033838 111 SEAELTSKTKKELDAAFEQFKKDTL----EPGKKVAEATKKVEEAEKKAKDQKEEDRRnyPTNTYKTLELEIAESDVEVK 186
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 491 QMQTHFLALKEHLTSEaasgnhrlteelKDQLKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGklIEENKRLQKE 570
Cdd:NF033838 187 KAELELVKEEAKEPRD------------EEKIKQAKAKVESKKAEATRLEKIKTDREKAEEEAKRRAD--AKLKEAVEKN 252
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 571 LSMCEMEREKKGRKVTEMEGQA----KELSAKLALSIPAEkfENMKS-SLSNE--VNEKAKKLVEMER------------ 631
Cdd:NF033838 253 VATSEQDKPKRRAKRGVLGEPAtpdkKENDAKSSDSSVGE--ETLPSpSLKPEkkVAEAEKKVEEAKKkakdqkeedrrn 330
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 632 ---------EHEKSLSEIRQLKRELENVKAKLAQHVKPEEHEQVKSRLEQKSGELGKkiteltlknqtlqkeIEKVYLDN 702
Cdd:NF033838 331 yptntyktlELEIAESDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAEATR---------------LEKIKTDR 395
|
....*...
gi 530406156 703 KLLKEQAH 710
Cdd:NF033838 396 KKAEEEAK 403
|
|
| PRK01156 |
PRK01156 |
chromosome segregation protein; Provisional |
883-1350 |
3.84e-03 |
|
chromosome segregation protein; Provisional
Pssm-ID: 100796 [Multi-domain] Cd Length: 895 Bit Score: 41.81 E-value: 3.84e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 883 LKRNLENTQNQIKAEYISLAEHEAKMSSLsqsMRKVQDSNAEILANYRKGQEEIVTLHaEIKAQKKELDTIQECIKVKYA 962
Cdd:PRK01156 188 LEEKLKSSNLELENIKKQIADDEKSHSIT---LKEIERLSIEYNNAMDDYNNLKSALN-ELSSLEDMKNRYESEIKTAES 263
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 963 PIVSFEECERKFKATEKELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIFTLQKDLRDKTVLIEKSHEMERALS---- 1038
Cdd:PRK01156 264 DLSMELEKNNYYKELEERHMKIINDPVYKNRNYINDYFKYKNDIENKKQILSNIDAEINKYHAIIKKLSVLQKDYNdyik 343
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1039 --RKTDELNKQLKDLSQ---KYT----EVKNVKEKLVEENAKQTSEILAVQNLLQKQHVPLEQVEALKKSLNGTIENLKE 1109
Cdd:PRK01156 344 kkSRYDDLNNQILELEGyemDYNsylkSIESLKKKIEEYSKNIERMSAFISEILKIQEIDPDAIKKELNEINVKLQDISS 423
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1110 ELKSMQ---RCYEKEQQTVTKLHQLLENQKNSSVpLAEHLqikeAFEKEVGIIKASLREKE--EESQNKME-EVSKLQSE 1183
Cdd:PRK01156 424 KVSSLNqriRALRENLDELSRNMEMLNGQSVCPV-CGTTL----GEEKSNHIINHYNEKKSrlEEKIREIEiEVKDIDEK 498
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1184 VQNTKQALKKLETREVVDLSKYKATKSDLETQISSLNEKLANL---NRKYEEVCEEVLHAKKKEISAKDEKELLHFS--- 1257
Cdd:PRK01156 499 IVDLKKRKEYLESEEINKSINEYNKIESARADLEDIKIKINELkdkHDKYEEIKNRYKSLKLEDLDSKRTSWLNALAvis 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1258 -------------IEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAKDNKITELLNDVERLKQALNGLSQLTytsgNP 1324
Cdd:PRK01156 579 lidietnrsrsneIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRGKI----DN 654
|
490 500
....*....|....*....|....*.
gi 530406156 1325 TKRQSQLIDTLQHQVKSLEQQLADAD 1350
Cdd:PRK01156 655 YKKQIAEIDSIIPDLKEITSRINDIE 680
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
292-650 |
3.87e-03 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 41.85 E-value: 3.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 292 DLKERLRKIQQEQRILldKVNGLQLQLNEREKLKSLLAAKEKQHEESLRTIEALKNRFKyfesdhlgsgSHFSNRKEDML 371
Cdd:COG1196 217 ELKEELKELEAELLLL--KLRELEAELEELEAELEELEAELEELEAELAELEAELEELR----------LELEELELELE 284
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 372 LKQGQMYMADSQctspgipahmqsrsmlrplelslpsqtsYSENEILKKELEAMRTFCESAKQDRLKLQNELAHKVAECK 451
Cdd:COG1196 285 EAQAEEYELLAE----------------------------LARLEQDIARLEERRRELEERLEELEEELAELEEELEELE 336
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 452 ALALECERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLTSE--AASGNHRLTEELKDQLKDLKVKY 529
Cdd:COG1196 337 EELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEAlrAAAELAAQLEELEEAEEALLERL 416
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 530 EGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELsmcEMEREKKGRKVTEMEGQAKELSAKLALSIPAEKFE 609
Cdd:COG1196 417 ERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEE---EALLELLAELLEEAALLEAALAELLEELAEAAARL 493
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 530406156 610 NMKSSLSNEVNE-----KAKKLVEMEREHEKSLSEIRQLKRELENV 650
Cdd:COG1196 494 LLLLEAEADYEGflegvKAALLLAGLRGLAGAVAVLIGVEAAYEAA 539
|
|
| trp |
TIGR00870 |
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ ... |
17-252 |
4.00e-03 |
|
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ Channel (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also been called the store-operated calcium channel (SOC) family. The prototypical members include the Drosophila retinal proteinsTRP and TRPL (Montell and Rubin, 1989; Hardie and Minke, 1993). SOC members of the family mediate the entry of extracellular Ca2+ into cells in responseto depletion of intracellular Ca2+ stores (Clapham, 1996) and agonist stimulated production of inositol-1,4,5 trisphosphate (IP3). One member of the TRP-CCfamily, mammalian Htrp3, has been shown to form a tight complex with the IP3 receptor (TC #1.A.3.2.1). This interaction is apparently required for IP3 tostimulate Ca2+ release via Htrp3. The vanilloid receptor subtype 1 (VR1), which is the receptor for capsaicin (the ?hot? ingredient in chili peppers) and servesas a heat-activated ion channel in the pain pathway (Caterina et al., 1997), is also a member of this family. The stretch-inhibitable non-selective cation channel(SIC) is identical to the vanilloid receptor throughout all of its first 700 residues, but it exhibits a different sequence in its last 100 residues. VR1 and SICtransport monovalent cations as well as Ca2+. VR1 is about 10x more permeable to Ca2+ than to monovalent ions. Ca2+ overload probably causes cell deathafter chronic exposure to capsaicin. (McCleskey and Gold, 1999). [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273311 [Multi-domain] Cd Length: 743 Bit Score: 41.61 E-value: 4.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 17 DWNKYDDRLMKAAERGDVekvtSILAKKGVNPGKLDVE-----GRSVFHVVTSKGNLECLNAILI-HGVDITTSDTAgrn 90
Cdd:TIGR00870 13 PLSDEEKAFLPAAERGDL----ASVYRDLEEPKKLNINcpdrlGRSALFVAAIENENLELTELLLnLSCRGAVGDTL--- 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 91 aLHLAAKyghalclqkllqyncptEHADLQGRTALHDAAMADCPSSIQLlcdhgasVNAKDVD----GRTPLVLATQMSR 166
Cdd:TIGR00870 86 -LHAISL-----------------EYVDAVEAILLHLLAAFRKSGPLEL-------ANDQYTSeftpGITALHLAAHRQN 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 167 PTICQLLIDRGADVNSR-------DKQNRTALMLGcEY--------GCRDAVEVLIKNGADISLLDALGHdssyyarigd 231
Cdd:TIGR00870 141 YEIVKLLLERGASVPARacgdffvKSQGVDSFYHG-ESplnaaaclGSPSIVALLSEDPADILTADSLGN---------- 209
|
250 260
....*....|....*....|.
gi 530406156 232 nlDILTLLKTASENTNKGREL 252
Cdd:TIGR00870 210 --TLLHLLVMENEFKAEYEEL 228
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
977-1390 |
4.69e-03 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 41.44 E-value: 4.69e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 977 TEKELKDQLseqtqkysvseEEVKKNKQENDKLKkeifTLQKDLRDKTVLIEKSHEMERalsrKTDELNKQLKDLSQKYT 1056
Cdd:PRK11281 37 TEADVQAQL-----------DALNKQKLLEAEDK----LVQQDLEQTLALLDKIDRQKE----ETEQLKQQLAQAPAKLR 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1057 EVKNVKEKLVEENAKQTSEILAVQNL--LQKQhvpLEQVEALKKSLNGTIENLKEELKSMQRCYEKEQQTVT-------K 1127
Cdd:PRK11281 98 QAQAELEALKDDNDEETRETLSTLSLrqLESR---LAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYansqrlqQ 174
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1128 LHQLLENQKNSSVPLAEHLQIKEAFEKevgiikASLREKEEESQNKMEEVSKLQsevqntkqalkkletrevvDLskYKA 1207
Cdd:PRK11281 175 IRNLLKGGKVGGKALRPSQRVLLQAEQ------ALLNAQNDLQRKSLEGNTQLQ-------------------DL--LQK 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1208 TKSDLETQISSLNEKLANLnrkyeevcEEVLHAKKKEISAKDEKEllhfsieQEIKDQKERCDKSLTTITELQRRIQESA 1287
Cdd:PRK11281 228 QRDYLTARIQRLEHQLQLL--------QEAINSKRLTLSEKTVQE-------AQSQDEAARIQANPLVAQELEINLQLSQ 292
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1288 KQIEAKDnKITELLNDVERLKQALNGLSQltytsgnpTKR---------------------QSQLIDTLQhQVKSLEQQL 1346
Cdd:PRK11281 293 RLLKATE-KLNTLTQQNLRVKNWLDRLTQ--------SERnikeqisvlkgslllsrilyqQQQALPSAD-LIEGLADRI 362
|
410 420 430 440 450
....*....|....*....|....*....|....*....|....*....|....
gi 530406156 1347 ADAdRQHQ-EV---------IAIYRTHLLSAAQGHMDEDVQEALLQIIQMRQGL 1390
Cdd:PRK11281 363 ADL-RLEQfEInqqrdalfqPDAYIDKLEAGHKSEVTDEVRDALLQLLDERREL 415
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
534-1192 |
4.76e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 41.70 E-value: 4.76e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 534 AEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELsmcemerekkgRKVTEMEGQAKELSAKLAlsipaekfenmks 613
Cdd:pfam01576 12 EELQKVKERQQKAESELKELEKKHQQLCEEKNALQEQL-----------QAETELCAEAEEMRARLA------------- 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 614 slsnevnEKAKKLVEMEREHEKSLSEIRQLKRELENVKAKLAQHVKP-----EEHEQVKSRLEQKSGELGKKITELTLKN 688
Cdd:pfam01576 68 -------ARKQELEEILHELESRLEEEEERSQQLQNEKKKMQQHIQDleeqlDEEEAARQKLQLEKVTTEAKIKKLEEDI 140
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 689 QTLQKEIEKVYLDNKLLKEQAHNLTIEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLldvtQKYTEKKLEMEKLllendsl 768
Cdd:pfam01576 141 LLLEDQNSKLSKERKLLEERISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERL----KKEEKGRQELEKA------- 209
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 769 skdvsrletvfvpPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALtsentnlkkmmsnqyvpvKTHEEVKMTL 848
Cdd:pfam01576 210 -------------KRKLEGESTDLQEQIAELQAQIAELRAQLAKKEEELQAA------------------LARLEEETAQ 258
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 849 NDTLAKTNRELL-DVKKKFEDINQEFVKiKDKNEILKRNLENTQNQIKAEyislaeheakmsslsqsMRKVQDSNAEILA 927
Cdd:pfam01576 259 KNNALKKIRELEaQISELQEDLESERAA-RNKAEKQRRDLGEELEALKTE-----------------LEDTLDTTAAQQE 320
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 928 NYRKGQEEIVTLHAEIKAQKKELDTiqecikvkyapivSFEECERKFKATEKELKDQLsEQTQKYSVSEEevkKNKQend 1007
Cdd:pfam01576 321 LRSKREQEVTELKKALEEETRSHEA-------------QLQEMRQKHTQALEELTEQL-EQAKRNKANLE---KAKQ--- 380
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1008 KLKKEIFTLQKDLRdktVLIEKSHEMERalsrKTDELNKQLKDLSQKYTEVKNVKEKLVEENAKQTSEILAVQNLLQ--- 1084
Cdd:pfam01576 381 ALESENAELQAELR---TLQQAKQDSEH----KRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNeae 453
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1085 -KQHVPLEQVEALKKSLNGTIENLKEELKsmqrcyeKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEVGIIKASL 1163
Cdd:pfam01576 454 gKNIKLSKDVSSLESQLQDTQELLQEETR-------QKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQL 526
|
650 660 670
....*....|....*....|....*....|....*.
gi 530406156 1164 R------EKEEESQNKMEEVSK-LQSEVQNTKQALK 1192
Cdd:pfam01576 527 SdmkkklEEDAGTLEALEEGKKrLQRELEALTQQLE 562
|
|
| PHA02798 |
PHA02798 |
ankyrin-like protein; Provisional |
37-248 |
5.00e-03 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222931 [Multi-domain] Cd Length: 489 Bit Score: 40.97 E-value: 5.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 37 VTSILAKKGVNPGKLDVEGRSVFHVVTSKG---NLECLNAILIHGVDITTSDTAGRNALHLAAKYGHA-------LCLQK 106
Cdd:PHA02798 91 IVKILIENGADINKKNSDGETPLYCLLSNGyinNLEILLFMIENGADTTLLDKDGFTMLQVYLQSNHHidieiikLLLEK 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 107 LLQYNcptEHADLQGRTALH-------DAAMADCpssIQLLCDHGASVNAKDVDGRTPLVlatqmsrPTICQLLIDRGA- 178
Cdd:PHA02798 171 GVDIN---THNNKEKYDTLHcyfkyniDRIDADI---LKLFVDNGFIINKENKSHKKKFM-------EYLNSLLYDNKRf 237
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 179 ------------DVNSRDKQNRTALMLGCEYGCRDAVEVLIKNGADISLLDALGHdssyyarigdnldilTLLKTASENT 246
Cdd:PHA02798 238 kknildfifsyiDINQVDELGFNPLYYSVSHNNRKIFEYLLQLGGDINIITELGN---------------TCLFTAFENE 302
|
..
gi 530406156 247 NK 248
Cdd:PHA02798 303 SK 304
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
442-820 |
5.34e-03 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 41.18 E-value: 5.34e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 442 ELAHKVAECKALALECERVKEDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLtseaasgnhrltEELKDQ 521
Cdd:PRK02224 255 TLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARR------------EELEDR 322
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 522 LKDLKVKYEGASAEVGKLRNQIKQNEMIVEEFKRDEGKLIEENKRLQKELSMCEMEREKKGRKVTEMEGQAKELSAKLAL 601
Cdd:PRK02224 323 DEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGD 402
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 602 SipAEKFENMkSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRELENVKA-----KLAQHVKPEEHEQVKSRLEQKSGE 676
Cdd:PRK02224 403 A--PVDLGNA-EDFLEELREERDELREREAELEATLRTARERVEEAEALLEagkcpECGQPVEGSPHVETIEEDRERVEE 479
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 677 LGKKITELTLKNQTLQKEIEKVyldnkllkeqahnltiemknhyvplkvsEDMKKSHDAiIDDLNRKLLDVTQKYTEKKL 756
Cdd:PRK02224 480 LEAELEDLEEEVEEVEERLERA----------------------------EDLVEAEDR-IERLEERREDLEELIAERRE 530
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 530406156 757 EMEKLLLENDSLSKDVSRLETvfvPPEKHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHAL 820
Cdd:PRK02224 531 TIEEKRERAEELRERAAELEA---EAEEKREAAAEAEEEAEEAREEVAELNSKLAELKERIESL 591
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
430-677 |
6.53e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 40.52 E-value: 6.53e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 430 ESAKQDRLKLQNELAHKVAECKALALECERVK---EDSDEQIKQLEDALKDVQKRMYESEGKVKQMQTHFLALKEHLtse 506
Cdd:COG4942 23 AEAEAELEQLQQEIAELEKELAALKKEEKALLkqlAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAEL--- 99
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 507 aasgnHRLTEELKDQLKDLkVKYEGASAEVGKLR----NQIKQNEMIVEEFKRDEGKLIEENKRLQKELSmcemerekkg 582
Cdd:COG4942 100 -----EAQKEELAELLRAL-YRLGRQPPLALLLSpedfLDAVRRLQYLKYLAPARREQAEELRADLAELA---------- 163
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 583 RKVTEMEGQAKELSAKLAlsipaeKFENMKSSLSNEVNEKAKKLVEMEREHEKSLSEIRQLKRE---LENVKAKLAQHVK 659
Cdd:COG4942 164 ALRAELEAERAELEALLA------ELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEaeeLEALIARLEAEAA 237
|
250
....*....|....*...
gi 530406156 660 PEEHEQVKSRLEQKSGEL 677
Cdd:COG4942 238 AAAERTPAAGFAALKGKL 255
|
|
| TRPV3 |
cd22194 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 3; TRPV3 is a ... |
103-191 |
6.60e-03 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), type 3; TRPV3 is a temperature-sensitive Transient Receptor Potential (TRP) ion channel that is activated by warm temperatures, synthetic small-molecule chemicals, and natural compounds from plants. TRPV3 function is regulated by physiological factors such as extracellular divalent cations and acidic pH, intracellular adenosine triphosphate, membrane voltage, and arachidonic acid. It is expressed in both neuronal and non-neuronal tissues including epidermal keratinocytes, epithelial cells in the gut, endothelial cells in blood vessels, and neurons in dorsal root ganglia and CNS. TRPV3 null mice have abnormal hair morphogenesis and compromised skin barrier function. It may play roles in inflammatory skin disorders, such as itch and pain sensation. TRPV3 is also expressed by many neuronal and non-neuronal tissues, showing that TRPV3 might play roles in other unknown cellular and physiological functions. TRPV3 belongs to the vanilloid TRP subfamily (TRPV), named after the founding member vanilloid receptor 1 (TRPV1). The structure of TRPV shows the typical topology features of all TRP ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains.
Pssm-ID: 411978 [Multi-domain] Cd Length: 680 Bit Score: 40.90 E-value: 6.60e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 103 CLQKLLqyNCPTEHADLQGRTALHDAAMADCPSSIQLLCDHGASVNA---------KDVD-----GRTPLVLATQMSRPT 168
Cdd:cd22194 125 ILDRFI--NAEYTEEAYEGQTALNIAIERRQGDIVKLLIAKGADVNAhakgvffnpKYKHegfyfGETPLALAACTNQPE 202
|
90 100
....*....|....*....|....
gi 530406156 169 ICQLLIDRGAD-VNSRDKQNRTAL 191
Cdd:cd22194 203 IVQLLMEKESTdITSQDSRGNTVL 226
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
784-1344 |
6.96e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 41.19 E-value: 6.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 784 KHEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMmsnqyVPVKTHEEVKMTLNDTLAKTNRelLDVK 863
Cdd:TIGR01612 576 HLEKEIKDLFDKYLEIDDEIIYINKLKLELKEKIKNISDKNEYIKKA-----IDLKKIIENNNAYIDELAKISP--YQVP 648
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 864 KKFEDINQEFVKIKDK-NEILKRNLENTQNQ----IKAEYISLAEHEAKMSSLSQSMRKVQD--SNAEI------LANYR 930
Cdd:TIGR01612 649 EHLKNKDKIYSTIKSElSKIYEDDIDALYNElssiVKENAIDNTEDKAKLDDLKSKIDKEYDkiQNMETatvelhLSNIE 728
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 931 KGQEEIVTLHAEIKAQ-----KKELDTIQECIKVKYAPIVS-------FEECERKFKATEKELKDQLSEQTQKYSVSEEE 998
Cdd:TIGR01612 729 NKKNELLDIIVEIKKHihgeiNKDLNKILEDFKNKEKELSNkindyakEKDELNKYKSKISEIKNHYNDQINIDNIKDED 808
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 999 VKKNKQENDKLKKEIfTLQKDLRDKTV------------LIEKSHEMERALSRKTDELNKQLKDLSQKY-TEVKNVKEKL 1065
Cdd:TIGR01612 809 AKQNYDKSKEYIKTI-SIKEDEIFKIInemkfmkddflnKVDKFINFENNCKEKIDSEHEQFAELTNKIkAEISDDKLND 887
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1066 VEENAKQT-SEILAVQNLLQKQHvplEQVEALKKsLNGTIENLKEELKSMQRCYEKEQQTVTKLHQLLENQKNSSV---- 1140
Cdd:TIGR01612 888 YEKKFNDSkSLINEINKSIEEEY---QNINTLKK-VDEYIKICENTKESIEKFHNKQNILKEILNKNIDTIKESNLieks 963
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1141 --PLAEHLQIKEAFEKEVGIIKASLREKEEESQNKMEEVSKLQSEVQNTKQALKKLETREvvdlsKYKATkSDLETQISS 1218
Cdd:TIGR01612 964 ykDKFDNTLIDKINELDKAFKDASLNDYEAKNNELIKYFNDLKANLGKNKENMLYHQFDE-----KEKAT-NDIEQKIED 1037
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1219 LNEKLANLnrkyeevcEEVLHAKKKEISAKDEKELLHfSIEQEIKDQKERCDKSLTTITELQRRIQ-----ESAKQIEAK 1293
Cdd:TIGR01612 1038 ANKNIPNI--------EIAIHTSIYNIIDEIEKEIGK-NIELLNKEILEEAEINITNFNEIKEKLKhynfdDFGKEENIK 1108
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|...
gi 530406156 1294 -DNKITELLNDVERLKQALN-GLSQLTYTSgnptKRQSQLIDTLQHQVKSLEQ 1344
Cdd:TIGR01612 1109 yADEINKIKDDIKNLDQKIDhHIKALEEIK----KKSENYIDEIKAQINDLED 1157
|
|
| Gp58 |
pfam07902 |
gp58-like protein; Sequences found in this family are derived from a number of bacteriophage ... |
1023-1339 |
7.95e-03 |
|
gp58-like protein; Sequences found in this family are derived from a number of bacteriophage and prophage proteins. They are similar to gp58, a minor structural protein of Lactococcus delbrueckii bacteriophage LL-H.
Pssm-ID: 369586 [Multi-domain] Cd Length: 594 Bit Score: 40.71 E-value: 7.95e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1023 KTVLIEKSHE-------MERALSrKTDELNKQLKDL--SQKYTEVK-NVKEKLVE-ENAKQTSEILAVQNLLQKQHVplE 1091
Cdd:pfam07902 63 APVVVEGSGEstglfksLEEMLS-QLKELNLELTDTknSNLWSKIKlNNNGMLREyHNDTIKTEIVESAEGIATRIS--E 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1092 QVEALKKSLNGTIENLKEELKSMQR-CYEKEQQTVTKLHQLLENQKNSsvplaehlqikeafekevgiikaslrekeees 1170
Cdd:pfam07902 140 DTDKKLALINETISGIRREYQDADRqLSSSYQAGIEGLKATMASDKIG-------------------------------- 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1171 qnkmeevskLQSEVQNTKQALKKLETREVVDLS-KYKATKSDletQISSLNEKLANLNRKYEEVCEEVLHAKKKEISA-K 1248
Cdd:pfam07902 188 ---------LQAEIQASAQGLSQRYDNEIRKLSaKITTTSSG---TTEAYESKLDDLRAEFTRSNQGMRTELESKISGlQ 255
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1249 DEKELLHFSIEQEIKDQKERCDKSLTTITELQRRIQESAKQIEAkdnkitellndverLKQALNGL-SQLTYTSGNPTKR 1327
Cdd:pfam07902 256 STQQSTAYQISQEISNREGAVSRVQQDLDSYQRRLQDAEKNYSS--------------LTQTVKGLqSTVSDPNSKLESR 321
|
330
....*....|..
gi 530406156 1328 QSQLIDTLQHQV 1339
Cdd:pfam07902 322 ITQLAGLIEQKV 333
|
|
| HEC1 |
COG5185 |
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell ... |
742-1072 |
8.08e-03 |
|
Chromosome segregation protein NDC80, interacts with SMC proteins [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 444066 [Multi-domain] Cd Length: 594 Bit Score: 40.71 E-value: 8.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 742 RKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLET-----VFVPPEKHEKEIIALKSNIVELKKQLSELK-KKCGEDQE 815
Cdd:COG5185 196 KKAEPSGTVNSIKESETGNLGSESTLLEKAKEIINIeealkGFQDPESELEDLAQTSDKLEKLVEQNTDLRlEKLGENAE 275
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 816 KIHALTSENTNLKKMMSNQYVPVKTHEEVKMTLNDTLAKTNR-ELLDVKKKFEDINQEFVKikdKNEILKRNLENTQNQI 894
Cdd:COG5185 276 SSKRLNENANNLIKQFENTKEKIAEYTKSIDIKKATESLEEQlAAAEAEQELEESKRETET---GIQNLTAEIEQGQESL 352
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 895 KAEYISLaeheakmsslsqsmrKVQDSNAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIKVKYAPIVSFEECERKF 974
Cdd:COG5185 353 TENLEAI---------------KEEIENIVGEVELSKSSEELDSFKDTIESTKESLDEIPQNQRGYAQEILATLEDTLKA 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 975 KATE-KELKDQLSEQTQKYSVSEEEVKKNKQENDKLKKEIftlqkDLRDKTVLIEKSHEMERALSRKTDELNKQLKDLSQ 1053
Cdd:COG5185 418 ADRQiEELQRQIEQATSSNEEVSKLLNELISELNKVMREA-----DEESQSRLEEAYDEINRSVRSKKEDLNEELTQIES 492
|
330
....*....|....*....
gi 530406156 1054 KYTEVKNVKEKLVEENAKQ 1072
Cdd:COG5185 493 RVSTLKATLEKLRAKLERQ 511
|
|
| PLN03229 |
PLN03229 |
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional |
776-1176 |
8.14e-03 |
|
acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha; Provisional
Pssm-ID: 178768 [Multi-domain] Cd Length: 762 Bit Score: 40.61 E-value: 8.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 776 ETVFVPPEK--HEKEIIALKSNIVELKKQLSELKKKCGEDQEKIHALTSENTNlkKMMSnqyvpvKTHEEVKMTLNDTLa 853
Cdd:PLN03229 409 EGVPVDPERkvNMKKREAVKTPVRELEGEVEKLKEQILKAKESSSKPSELALN--EMIE------KLKKEIDLEYTEAV- 479
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 854 ktnrELLDVKKKFEDINQEFVKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKVQDSNAEILANYRKGQ 933
Cdd:PLN03229 480 ----IAMGLQERLENLREEFSKANSQDQLMHPVLMEKIEKLKDEFNKRLSRAPNYLSLKYKLDMLNEFSRAKALSEKKSK 555
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 934 EEivTLHAEIKAQKKELDTIQEcikvkyapivsfeecerkFKATEKELKDQlseqtqkysVSEEEVKKNKQENDKLKKEI 1013
Cdd:PLN03229 556 AE--KLKAEINKKFKEVMDRPE------------------IKEKMEALKAE---------VASSGASSGDELDDDLKEKV 606
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1014 FTLQKDLRDKTVLIEKSHEMERALSRKTDelnkqlKDLSQKyTEVKNVKEKLVEENAKQTSEILAVQNLlqkqhvpleqv 1093
Cdd:PLN03229 607 EKMKKEIELELAGVLKSMGLEVIGVTKKN------KDTAEQ-TPPPNLQEKIESLNEEINKKIERVIRS----------- 668
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 1094 ealkKSLNGTIENLKEEL-KSMQRCYEKEQQTVTKLHQLLENQKNSSVPLAEHLQIKEAFEKEVgiikASLREKEEESQN 1172
Cdd:PLN03229 669 ----SDLKSKIELLKLEVaKASKTPDVTEKEKIEALEQQIKQKIAEALNSSELKEKFEELEAEL----AAARETAAESNG 740
|
....
gi 530406156 1173 KMEE 1176
Cdd:PLN03229 741 SLKN 744
|
|
| Ank_3 |
pfam13606 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
153-181 |
8.26e-03 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.
Pssm-ID: 463933 [Multi-domain] Cd Length: 30 Bit Score: 35.31 E-value: 8.26e-03
10 20
....*....|....*....|....*....
gi 530406156 153 DGRTPLVLATQMSRPTICQLLIDRGADVN 181
Cdd:pfam13606 1 DGNTPLHLAARNGRLEIVKLLLENGADIN 29
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
469-1065 |
8.61e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 40.54 E-value: 8.61e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 469 KQLEDALKDVQKRMYESEGKV-------KQMQTHFLALKEHLTSEAASGNHRLTEE--LKDQLKDLKVKYEGASAEVGKL 539
Cdd:pfam01576 71 QELEEILHELESRLEEEEERSqqlqnekKKMQQHIQDLEEQLDEEEAARQKLQLEKvtTEAKIKKLEEDILLLEDQNSKL 150
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 540 RNQIKQNEMIVEEFKRDEGKLIEENKRLQK-----ELSMCEME----REKKGRKVTE-----MEGQAKELSAKLA----- 600
Cdd:pfam01576 151 SKERKLLEERISEFTSNLAEEEEKAKSLSKlknkhEAMISDLEerlkKEEKGRQELEkakrkLEGESTDLQEQIAelqaq 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 601 ----LSIPAEKFENMKSSLSNEVNEKAKK--LVEMEREHEKSLSEI------------------RQLKRELENVKAKLAQ 656
Cdd:pfam01576 231 iaelRAQLAKKEEELQAALARLEEETAQKnnALKKIRELEAQISELqedleseraarnkaekqrRDLGEELEALKTELED 310
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 657 HVKPEEHEQ-VKSRLEQKSGELGKKITELT--------------------LKNQTLQKEIEKVYLD--NKLLKEQAHNLT 713
Cdd:pfam01576 311 TLDTTAAQQeLRSKREQEVTELKKALEEETrsheaqlqemrqkhtqaleeLTEQLEQAKRNKANLEkaKQALESENAELQ 390
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 714 IEMKNHYVPLKVSEDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLLLENDSLSKDVSRLETVFVppeKHEKEIIALK 793
Cdd:pfam01576 391 AELRTLQQAKQDSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNI---KLSKDVSSLE 467
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 794 SNIVELKKQLSELKKKCGEDQEKIHALTSENTNLKKMMSNQyvpvkthEEVKMTLNDTLAKTNRELLDVKKKFEDINQEF 873
Cdd:pfam01576 468 SQLQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEE-------EEAKRNVERQLSTLQAQLSDMKKKLEEDAGTL 540
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 874 VKIKDKNEILKRNLENTQNQIKAEYISLAEHEAKMSSLSQSMRKV---QDSNAEILANYRKGQEEIVTLHAEIKAqkkel 950
Cdd:pfam01576 541 EALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQQELDDLlvdLDHQRQLVSNLEKKQKKFDQMLAEEKA----- 615
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 951 dtiqecIKVKYApivsfEECER-KFKATEKELKD-QLSEQTQKYSVSEEEVKKNkqeNDKLKKEIFTL--QKDLRDKTVl 1026
Cdd:pfam01576 616 ------ISARYA-----EERDRaEAEAREKETRAlSLARALEEALEAKEELERT---NKQLRAEMEDLvsSKDDVGKNV- 680
|
650 660 670 680
....*....|....*....|....*....|....*....|..
gi 530406156 1027 ieksHEME---RALSRKTDELNKQLKDLSQKYTEVKNVKEKL 1065
Cdd:pfam01576 681 ----HELErskRALEQQVEEMKTQLEELEDELQATEDAKLRL 718
|
|
| CCDC22 |
pfam05667 |
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 ... |
727-1017 |
9.03e-03 |
|
Coiled-coil domain-containing protein 22; Human coiled-coil domain-containing protein 22 (CCDC22) is involved in regulation of NF-kappa-B signalling; the function may involve association with COMMD8 and a CUL1-dependent E3 ubiquitin ligase complex. It is part of the OMMD/CCDC22/CCDC93 (CCC) complex, which interacts with the multisubunit WASH complex required for endosomal deposition of F-actin and cargo trafficking in conjunction with the retromer. This entry also includes CCDC22 homologs from animals and plants.
Pssm-ID: 461708 [Multi-domain] Cd Length: 600 Bit Score: 40.40 E-value: 9.03e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 727 EDMKKSHDAIIDDLNRKLLDVTQKYTEKKLEMEKLlleNDSLSKDVSRLETVfvppekhEKEIIALKSNiVELKKQLSEL 806
Cdd:pfam05667 327 EELQQQREEELEELQEQLEDLESSIQELEKEIKKL---ESSIKQVEEELEEL-------KEQNEELEKQ-YKVKKKTLDL 395
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 807 KKKCGEDQEKIHALTSENTNLKKMMSNQY----VP-VKTHEEVKmtlndtLAKTNRELlDVKKKFEDINQEFVKIKDKNE 881
Cdd:pfam05667 396 LPDAEENIAKLQALVDASAQRLVELAGQWekhrVPlIEEYRALK------EAKSNKED-ESQRKLEEIKELREKIKEVAE 468
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 530406156 882 ILKRNlENTQNQIKAEYISLAeheaKMSSLSQSMRKVqdsnAEILANYRKGQEEIVTLHAEIKAQKKELDTIQECIkvky 961
Cdd:pfam05667 469 EAKQK-EELYKQLVAEYERLP----KDVSRSAYTRRI----LEIVKNIKKQKEEITKILSDTKSLQKEINSLTGKL---- 535
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 530406156 962 apivsfeecERKFKATEKEL-----KDQLSEQTQKYSVSE----EEVKKNKQENDKLKKEIFTLQ 1017
Cdd:pfam05667 536 ---------DRTFTVTDELVfkdakKDESVRKAYKYLAALhencEQLIQTVEETGTIMREIRDLE 591
|
|
|