|
Name |
Accession |
Description |
Interval |
E-value |
| CCDC144C |
pfam14915 |
CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ... |
1272-1575 |
1.89e-165 |
|
CCDC144C protein coiled-coil region; This family includes the human protein CCDC144C and the ankyrin repeat domain-containing protein 26-like 1 found in eukaryotes. Its function remains unknown, however, it is known to contain a coiled-coil domain which corresponds to this region. The ankyrin repeat which features in this protein is a common amino acid motif.
Pssm-ID: 464371 [Multi-domain] Cd Length: 304 Bit Score: 508.76 E-value: 1.89e-165
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1272 NSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLE 1351
Cdd:pfam14915 1 NCMLQDEIAMLRLEIDTIKNQNQEKEKKYLEDIEILKEKNDDLQKTLKLNEETLTKTVFQYNGQLNVLKAENTMLNSKLE 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1352 NEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKL 1431
Cdd:pfam14915 81 NEKQNKERLETEVESYRSRLAAAIQDHEQSQTSKRDLELAFQRERDEWLRLQDKMNFDVSNLRDENEILSQQLSKAESKA 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1432 NSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDA 1511
Cdd:pfam14915 161 NSLENELHRTRDALREKTLLLESVQRDLSQAQCQKKELEHMYQNEQDKVNKYIGKQESLEERLAQLQSENMLLRQQLEDA 240
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 1512 HNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAER 1575
Cdd:pfam14915 241 QNKADAKEKTVIDIQDQFQDIVKKLQAESEKQVLLLEERNKELINECNHLKERLYQYEKEKAER 304
|
|
| DUF3496 |
pfam12001 |
Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. ... |
1881-1989 |
2.30e-57 |
|
Domain of unknown function (DUF3496); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 110 amino acids in length.
Pssm-ID: 463425 [Multi-domain] Cd Length: 109 Bit Score: 193.73 E-value: 2.30e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1881 KSQMELRIKDLESELSKIKTSQEDFNKTELEKYKQLYLEELKVRKSLSSKLTKTNERLAEVNTKLLVEKQQSRSLFTTLT 1960
Cdd:pfam12001 1 RSQMELRIKDLESELSKMKTSQEDSNKIELEKYKQLYLEELKVRKSLSNKLNKTNERLAEVSTKLLVEKQQNRSLLSTLT 80
|
90 100
....*....|....*....|....*....
gi 1034567240 1961 TRPVMEPPCVGNLNNSLDLNRKLIPRENL 1989
Cdd:pfam12001 81 TRPVLESPCVGNLNNSLVLNRNFIPRENL 109
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
44-210 |
7.37e-39 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 147.79 E-value: 7.37e-39
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 44 DRDLGKIHKAASAGNVAKVQQILLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQC 123
Cdd:COG0666 51 DALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGNTLLHAAARNGDLEIVKLLLEAGADVNARDKDGETPLHLAAYN 130
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 124 QEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKA 203
Cdd:COG0666 131 GNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAAENGHLEIVKLLLEAGA 210
|
....*..
gi 1034567240 204 NVNAVDK 210
Cdd:COG0666 211 DVNAKDN 217
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
50-218 |
6.53e-38 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 145.10 E-value: 6.53e-38
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNVAKVQqiLLLRKNG-LNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKC 128
Cdd:COG0666 91 LHAAARNGDLEIVK--LLLEAGAdVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEI 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 129 ATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKANVNAV 208
Cdd:COG0666 169 VKLLLEAGADVNARDNDGETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAAENGNLEIVKLLLEAGADLNAK 248
|
170
....*....|
gi 1034567240 209 DKLESSHQLI 218
Cdd:COG0666 249 DKDGLTALLL 258
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
50-210 |
1.74e-31 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 126.22 E-value: 1.74e-31
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNVAKVQqilLLRKNG--LNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEK 127
Cdd:COG0666 124 LHLAAYNGNLEIVK---LLLEAGadVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGADVNARDNDGETPLHLAAENGHLE 200
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 128 CATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKANVNA 207
Cdd:COG0666 201 IVKLLLEAGADVNAKDNDGKTALDLAAENGNLEIVKLLLEAGADLNAKDKDGLTALLLAAAAGAALIVKLLLLALLLLAA 280
|
...
gi 1034567240 208 VDK 210
Cdd:COG0666 281 ALL 283
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
44-210 |
1.26e-28 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 118.13 E-value: 1.26e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 44 DRDLGKIHKAASAGNVAKVQQILLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQC 123
Cdd:COG0666 18 LLLLALLLLAAALLLLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDDGGNTLLHAAARN 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 124 QEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKA 203
Cdd:COG0666 98 GDLEIVKLLLEAGADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDNDGNTPLHLAAANGNLEIVKLLLEAGA 177
|
....*..
gi 1034567240 204 NVNAVDK 210
Cdd:COG0666 178 DVNARDN 184
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1105-1772 |
8.67e-21 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 100.52 E-value: 8.67e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1105 ELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSL-----------NQEEEKRRNADTLYEKI--- 1170
Cdd:TIGR02168 270 EELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLeeleaqleeleSKLDELAEELAELEEKLeel 349
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1171 REQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSR-----EQNARMLQ--DGILTNHLSK 1243
Cdd:TIGR02168 350 KEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERlearlERLEDRRErlQQEIEELLKK 429
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1244 QKEIEMAQKKMNSENSHSHEEEKDLSHKN-----SMLQEEIAMLRLEIDTIKNQNQEKEKKCfEDLKIVKEKNEDLQKTI 1318
Cdd:TIGR02168 430 LEEAELKELQAELEELEEELEELQEELERleealEELREELEEAEQALDAAERELAQLQARL-DSLERLQENLEGFSEGV 508
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1319 KQNEETLTQtISQYNGRLSVL--------TAENAMLNSKL-----ENEKQSKERLEAEVESYHSRLAAAIHDR------D 1379
Cdd:TIGR02168 509 KALLKNQSG-LSGILGVLSELisvdegyeAAIEAALGGRLqavvvENLNAAKKAIAFLKQNELGRVTFLPLDSikgteiQ 587
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1380 QSETSKRELELAFQRARDEC----SRLQDKMNF------DVSNLKDNNEILSQQLFktESKLNSLEIEFHHTRDAL---- 1445
Cdd:TIGR02168 588 GNDREILKNIEGFLGVAKDLvkfdPKLRKALSYllggvlVVDDLDNALELAKKLRP--GYRIVTLDGDLVRPGGVItggs 665
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1446 REKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINI 1525
Cdd:TIGR02168 666 AKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQL 745
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1526 QDQfhaiVQKLQAESEKQSLLLEERNKELISECNHLKERqyqyENEKAEREVVVRQLQQELadtlkkQSMSEASLEVTSR 1605
Cdd:TIGR02168 746 EER----IAQLSKELTELEAEIEELEERLEEAEEELAEA----EAEIEELEAQIEQLKEEL------KALREALDELRAE 811
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1606 YRiNLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQK---------- 1675
Cdd:TIGR02168 812 LT-LLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNerasleeala 890
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1676 --NLLNANLSEDEKEQLKKLMELKQslecNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFsfhGDLKTSQFEM 1753
Cdd:TIGR02168 891 llRSELEELSEELRELESKRSELRR----ELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTL---EEAEALENKI 963
|
730
....*....|....*....
gi 1034567240 1754 DIQINKLKHKIDDLTAELE 1772
Cdd:TIGR02168 964 EDDEEEARRRLKRLENKIK 982
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
84-176 |
1.62e-20 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 87.86 E-value: 1.62e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 84 LHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHgADPNLADvHGNTALHYAVYNEDISVAT 163
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLEH-ADVNLKD-NGRTALHYAARSGHLEIVK 78
|
90
....*....|...
gi 1034567240 164 KLLLYDANIEAKN 176
Cdd:pfam12796 79 LLLEKGADINVKD 91
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1098-1916 |
1.75e-20 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 99.36 E-value: 1.75e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1098 ELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRK 1177
Cdd:TIGR02168 221 ELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1178 EEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLsreqnarmlqDGILTNHLSKQKEIEMAQKKMnsE 1257
Cdd:TIGR02168 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKL----------EELKEELESLEAELEELEAEL--E 368
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1258 NSHSHEEEKDLSHKNsmLQEEIAMLRLEIDTIKNQNQEKEKKcfedLKIVKEKNEDLQKTIKQNEETLTqtisqyNGRLS 1337
Cdd:TIGR02168 369 ELESRLEELEEQLET--LRSKVAQLELQIASLNNEIERLEAR----LERLEDRRERLQQEIEELLKKLE------EAELK 436
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1338 VLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQsetSKRELELAFQRARDECSRLQDKMNF--DVSNLKD 1415
Cdd:TIGR02168 437 ELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDA---AERELAQLQARLDSLERLQENLEGFseGVKALLK 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1416 NNEILSQ------QLFKTESKLnsleiefhhtRDALrEKTLGlERVQKDLSQT-QCQMKEMEQKYQNEQVKVN----KYI 1484
Cdd:TIGR02168 514 NQSGLSGilgvlsELISVDEGY----------EAAI-EAALG-GRLQAVVVENlNAAKKAIAFLKQNELGRVTflplDSI 581
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1485 GKQESVEERLSQLQSENMLLRQQLDdaHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLLLEERNkeLISECNHLKER 1564
Cdd:TIGR02168 582 KGTEIQGNDREILKNIEGFLGVAKD--LVKFDPKLRKALSYLLGGVLVVDDLDNALELAKKLRPGYR--IVTLDGDLVRP 657
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1565 QYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEK 1644
Cdd:TIGR02168 658 GGVITGGSAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLAR 737
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1645 MQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLlnanlsEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNL 1724
Cdd:TIGR02168 738 LEAEVEQLEERIAQLSKELTELEAEIEELEERL------EEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAE 811
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1725 LKMTRKKLNEYENGEFSF---HGDLKTSQFEMDIQINKLKHKIDDLTAELETAgskclhldtknQILQEELlsmktvQKK 1801
Cdd:TIGR02168 812 LTLLNEEAANLRERLESLerrIAATERRLEDLEEQIEELSEDIESLAAEIEEL-----------EELIEEL------ESE 874
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1802 CEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIeERARQEIAEKLKEVNLFLQ-AQAASQENLEQFREN----- 1875
Cdd:TIGR02168 875 LEALLNERASLEEALALLRSELEELSEELRELESKRSEL-RRELEELREKLAQLELRLEgLEVRIDNLQERLSEEysltl 953
|
810 820 830 840
....*....|....*....|....*....|....*....|....*
gi 1034567240 1876 -NFASMKSQMELRIKDLESELSKIKTSQEDF---NKTELEKYKQL 1916
Cdd:TIGR02168 954 eEAEALENKIEDDEEEARRRLKRLENKIKELgpvNLAAIEEYEEL 998
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
50-143 |
5.75e-20 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 86.32 E-value: 5.75e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNVAKVQQiLLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDrKCQLNVCDNeNRTALMKAVQCQEEKCA 129
Cdd:pfam12796 1 LHLAAKNGNLELVKL-LLENGADANLQDKNGRTALHLAAKNGHLEIVKLLLE-HADVNLKDN-GRTALHYAARSGHLEIV 77
|
90
....*....|....
gi 1034567240 130 TILLEHGADPNLAD 143
Cdd:pfam12796 78 KLLLEKGADINVKD 91
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
50-183 |
1.08e-19 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 91.94 E-value: 1.08e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNVAKVQqiLLLrKNG--LNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEK 127
Cdd:COG0666 157 LHLAAANGNLEIVK--LLL-EAGadVNARDNDGETPLHLAAENGHLEIVKLLLEAGADVNAKDNDGKTALDLAAENGNLE 233
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 1034567240 128 CATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPL 183
Cdd:COG0666 234 IVKLLLEAGADLNAKDKDGLTALLLAAAAGAALIVKLLLLALLLLAAALLDLLTLL 289
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1111-1917 |
2.55e-19 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 95.51 E-value: 2.55e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1111 IKKMEDKVNVLQRELSET---KEIKSQLEhqkvEWERELCSLRFslNQEEEKRrnadtlyEKIREQLRRKEEQYRKEVEV 1187
Cdd:TIGR02168 195 LNELERQLKSLERQAEKAeryKELKAELR----ELELALLVLRL--EELREEL-------EELQEELKEAEEELEELTAE 261
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1188 KQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSR-EQNARMLQDGiLTNHLSKQKEIEMAQKKMNSENSHSHEEEK 1266
Cdd:TIGR02168 262 LQELEEKLEELRLEVSELEEEIEELQKELYALANEISRlEQQKQILRER-LANLERQLEELEAQLEELESKLDELAEELA 340
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1267 DLSHKNSMLQEEIAMLRLEIDTIKNQNQE---KEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQ---TISQYNGRLSVLT 1340
Cdd:TIGR02168 341 ELEEKLEELKEELESLEAELEELEAELEElesRLEELEEQLETLRSKVAQLELQIASLNNEIERleaRLERLEDRRERLQ 420
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1341 AENAMLNSKLENEKQSK-----ERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFdVSNLKD 1415
Cdd:TIGR02168 421 QEIEELLKKLEEAELKElqaelEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDS-LERLQE 499
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1416 NNEILSQ---QLFKTESKLNSLE------IEFHHTRDALREKTLGlERVQKDLSQT-QCQMKEMEQKYQNEQVKVN---- 1481
Cdd:TIGR02168 500 NLEGFSEgvkALLKNQSGLSGILgvlselISVDEGYEAAIEAALG-GRLQAVVVENlNAAKKAIAFLKQNELGRVTflpl 578
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1482 KYIGKQESVEERLSQLQSENMLLRQQLDdaHNKADNKEKTVINIQDQFHAIVQKLQ------------------------ 1537
Cdd:TIGR02168 579 DSIKGTEIQGNDREILKNIEGFLGVAKD--LVKFDPKLRKALSYLLGGVLVVDDLDnalelakklrpgyrivtldgdlvr 656
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1538 --------AESEKQSLL--------LEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLE 1601
Cdd:TIGR02168 657 pggvitggSAKTNSSILerrreieeLEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLA 736
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1602 VTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLlnan 1681
Cdd:TIGR02168 737 RLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAEL---- 812
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1682 lsEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMD---IQIN 1758
Cdd:TIGR02168 813 --TLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERAsleEALA 890
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1759 KLKHKIDDLTAELETAGSKCLHLDTKNQILQEELLSMKTVQKKCE-KLQKNKKKL-EQEVINLRSHIERNMVELGQVKQY 1836
Cdd:TIGR02168 891 LLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEvRIDNLQERLsEEYSLTLEEAEALENKIEDDEEEA 970
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1837 KQEIeERARQEIAEkLKEVNLflqaqaasqENLEQFRENN--FASMKSQMElrikDLESELSKIKTSQEDFNKTELEKYK 1914
Cdd:TIGR02168 971 RRRL-KRLENKIKE-LGPVNL---------AAIEEYEELKerYDFLTAQKE----DLTEAKETLEEAIEEIDREARERFK 1035
|
...
gi 1034567240 1915 QLY 1917
Cdd:TIGR02168 1036 DTF 1038
|
|
| ANKYR |
COG0666 |
Ankyrin repeat [Signal transduction mechanisms]; |
65-210 |
4.06e-18 |
|
Ankyrin repeat [Signal transduction mechanisms];
Pssm-ID: 440430 [Multi-domain] Cd Length: 289 Bit Score: 87.32 E-value: 4.06e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 65 ILLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADV 144
Cdd:COG0666 6 LLLLLLLAALLLLLLLALLLLAAALLLLLLLLLLLLLALLALALADALGALLLLAAALAGDLLVALLLLAAGADINAKDD 85
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034567240 145 HGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKANVNAVDK 210
Cdd:COG0666 86 GGNTLLHAAARNGDLEIVKLLLEAGADVNARDKDGETPLHLAAYNGNLEIVKLLLEAGADVNAQDN 151
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1094-1856 |
3.97e-16 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 85.12 E-value: 3.97e-16
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1094 ERLLELKKNHCELLTVKIKKMEDKVNVLQRELSE-TKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIRE 1172
Cdd:TIGR02169 243 ERQLASLEEELEKLTEEISELEKRLEEIEQLLEElNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEE 322
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1173 QLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQL-SREQNARMLQDgiltNHLSKQKEIEMAQ 1251
Cdd:TIGR02169 323 RLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELeEVDKEFAETRD----ELKDYREKLEKLK 398
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1252 KKMNSenshsheeekdLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEkkcfEDLKIVKEKNEDLQKTIKQNEETLTQTISQ 1331
Cdd:TIGR02169 399 REINE-----------LKRELDRLQEELQRLSEELADLNAAIAGIE----AKINELEEEKEDKALEIKKQEWKLEQLAAD 463
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1332 yngrlsvltaenamlnskLENEKQSKERLEAEVESYHSRLaaaihdrdqsetSKRELELAFQRARDECSRLQDKMNFDVS 1411
Cdd:TIGR02169 464 ------------------LSKYEQELYDLKEEYDRVEKEL------------SKLQRELAEAEAQARASEERVRGGRAVE 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1412 NLKDNNeilSQQLFKTESKLNSLE-----------------------------IEFHHTRDALREKTLGLERVQKDLSQT 1462
Cdd:TIGR02169 514 EVLKAS---IQGVHGTVAQLGSVGeryataievaagnrlnnvvveddavakeaIELLKRRKAGRATFLPLNKMRDERRDL 590
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1463 QCQMK-----------EMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENML-LRQQLDD---AHNKADNKEKTVINIQD 1527
Cdd:TIGR02169 591 SILSEdgvigfavdlvEFDPKYEPAFKYVFGDTLVVEDIEAARRLMGKYRMVtLEGELFEksgAMTGGSRAPRGGILFSR 670
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1528 QFHAIVQKLQAESEKqsllLEERNKELISECNHLKER--QYQYENEKAEREVVVRQ-----LQQELADTLKKQSMSEASL 1600
Cdd:TIGR02169 671 SEPAELQRLRERLEG----LKRELSSLQSELRRIENRldELSQELSDASRKIGEIEkeieqLEQEEEKLKERLEELEEDL 746
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1601 EVTSRYRINLEDETQDLKKKLGQIRNQLQEAQdrhtEAVRCAEKMQDHK--QKLEKDNAKLKVTVKKQMDKIEELQKNLl 1678
Cdd:TIGR02169 747 SSLEQEIENVKSELKELEARIEELEEDLHKLE----EALNDLEARLSHSriPEIQAELSKLEEEVSRIEARLREIEQKL- 821
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1679 naNLSEDEKEQLKKLMELKQS----LECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYEngefSFHGDLKTSQFEMD 1754
Cdd:TIGR02169 822 --NRLTLEKEYLEKEIQELQEqridLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLE----SRLGDLKKERDELE 895
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1755 IQINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEEL-----------------LSMKTVQKKCEKLQKNKKKLEQevI 1817
Cdd:TIGR02169 896 AQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELseiedpkgedeeipeeeLSLEDVQAELQRVEEEIRALEP--V 973
|
810 820 830 840
....*....|....*....|....*....|....*....|
gi 1034567240 1818 NLRSHIERNMVELGQVK-QYKQEIEERARQEIAEKLKEVN 1856
Cdd:TIGR02169 974 NMLAIQEYEEVLKRLDElKEKRAKLEEERKAILERIEEYE 1013
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1083-1955 |
3.99e-15 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 81.76 E-value: 3.99e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1083 LLKIQDAALSCERLL-ELKKNHCELLTVKiKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRR 1161
Cdd:pfam01576 14 LQKVKERQQKAESELkELEKKHQQLCEEK-NALQEQLQAETELCAEAEEMRARLAARKQELEEILHELESRLEEEEERSQ 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1162 NADTLYEKIREQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVvqerNDAQRQLSREQNARMLQDGILTNHL 1241
Cdd:pfam01576 93 QLQNEKKKMQQHIQDLEEQLDEEEAARQKLQLEKVTTEAKIKKLEEDILLL----EDQNSKLSKERKLLEERISEFTSNL 168
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1242 SKQKE---------------IEMAQKKMNSENSHSHEEEK---DLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEkkcfed 1303
Cdd:pfam01576 169 AEEEEkakslsklknkheamISDLEERLKKEEKGRQELEKakrKLEGESTDLQEQIAELQAQIAELRAQLAKKE------ 242
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1304 lkivkeknEDLQKTIKQNEETLTQTiSQYNGRLSVLTAENAMLNSKLENEKQSKER-------LEAEVESYHSRLAAAIH 1376
Cdd:pfam01576 243 --------EELQAALARLEEETAQK-NNALKKIRELEAQISELQEDLESERAARNKaekqrrdLGEELEALKTELEDTLD 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1377 DR--DQSETSKRELELA-FQRARDECSRLQDKMNFDVSNLKDNN-EILSQQLFKTESKLNSLEiefhHTRDALREKTLGL 1452
Cdd:pfam01576 314 TTaaQQELRSKREQEVTeLKKALEEETRSHEAQLQEMRQKHTQAlEELTEQLEQAKRNKANLE----KAKQALESENAEL 389
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1453 ERVQKDLSQ----TQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQ 1528
Cdd:pfam01576 390 QAELRTLQQakqdSEHKRKKLEGQLQELQARLSESERQRAELAEKLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLESQ 469
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1529 FHAiVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQ----SMSEASLEVTS 1604
Cdd:pfam01576 470 LQD-TQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSDMKKKLeedaGTLEALEEGKK 548
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1605 RYRINLEDETQDLKKK------LGQIRNQLQE-------AQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKI- 1670
Cdd:pfam01576 549 RLQRELEALTQQLEEKaaaydkLEKTKNRLQQelddllvDLDHQRQLVSNLEKKQKKFDQMLAEEKAISARYAEERDRAe 628
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1671 ----EELQKNLLNANLSEDEKEQLKKLMELKQSLECNLDQ------EMKKNV-ELEREITGFKNLLKMTRKKLNEYENgE 1739
Cdd:pfam01576 629 aearEKETRALSLARALEEALEAKEELERTNKQLRAEMEDlvsskdDVGKNVhELERSKRALEQQVEEMKTQLEELED-E 707
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1740 FSFHGDLKTsQFEMDIQINKLKHKiDDLTAELETAGSKCLHLDTKNQILQEELlsmKTVQKKCEKLQKNKKKLEQEVINL 1819
Cdd:pfam01576 708 LQATEDAKL-RLEVNMQALKAQFE-RDLQARDEQGEEKRRQLVKQVRELEAEL---EDERKQRAQAVAAKKKLELDLKEL 782
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1820 RSHIE----------RNMVEL-GQVKQYKQEIEE--RARQEIAEKLKEVNLFLQAQAAsqeNLEQFRENNFAS--MKSQM 1884
Cdd:pfam01576 783 EAQIDaankgreeavKQLKKLqAQMKDLQRELEEarASRDEILAQSKESEKKLKNLEA---ELLQLQEDLAASerARRQA 859
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 1885 ELRIKDLESEL---SKIKTSQEDfNKTELEKYKQLYLEELKVRKSLSSKLTKTNERLAEVNTKLLVEKQQSRSL 1955
Cdd:pfam01576 860 QQERDELADEIasgASGKSALQD-EKRRLEARIAQLEEELEEEQSNTELLNDRLRKSTLQVEQLTTELAAERST 932
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1111-1892 |
2.45e-14 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 79.39 E-value: 2.45e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1111 IKKMEDKVNVLQRELSETKEIKsqlEHQKVEWERELCSLRFSLnQEEEKRRNAdtlyekiREQLRRKEEQYRKEVevKQQ 1190
Cdd:pfam15921 80 LEEYSHQVKDLQRRLNESNELH---EKQKFYLRQSVIDLQTKL-QEMQMERDA-------MADIRRRESQSQEDL--RNQ 146
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1191 LELSLQTLEMElRTVKSNLnqvvqeRNDAQRQLSREQNARMLQDGILTNHLSKQKEIEMAQ-KKMNSENSHSHEEEKDL- 1268
Cdd:pfam15921 147 LQNTVHELEAA-KCLKEDM------LEDSNTQIEQLRKMMLSHEGVLQEIRSILVDFEEASgKKIYEHDSMSTMHFRSLg 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1269 SHKNSMLQE---EIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNED-LQKTIKQNE---ETLTQTISQYNGRLSVLTA 1341
Cdd:pfam15921 220 SAISKILREldtEISYLKGRIFPVEDQLEALKSESQNKIELLLQQHQDrIEQLISEHEveiTGLTEKASSARSQANSIQS 299
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1342 ENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHD-----RDQSETSKRELELA---FQRARDEcsrlQDKMNFDVSNL 1413
Cdd:pfam15921 300 QLEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREakrmyEDKIEELEKQLVLAnseLTEARTE----RDQFSQESGNL 375
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1414 KDNNEILSQQLFKTESKLnSLE---------------IEFHHTRDALREKTLGLERVQKDLS--QTQCQmKEMEQKYQNE 1476
Cdd:pfam15921 376 DDQLQKLLADLHKREKEL-SLEkeqnkrlwdrdtgnsITIDHLRRELDDRNMEVQRLEALLKamKSECQ-GQMERQMAAI 453
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1477 QvkvnkyiGKQESVEERLS---QLQSENMLLRQQLDDAHNKA---DNKEKTVINIQDQFHAIVQKLQAESEKQSLLLEER 1550
Cdd:pfam15921 454 Q-------GKNESLEKVSSltaQLESTKEMLRKVVEELTAKKmtlESSERTVSDLTASLQEKERAIEATNAEITKLRSRV 526
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1551 NKELiSECNHLKERQYQYENEKAErevvvrqlqqelADTLKKQsMSEASlEVTSRYRINLEDETQdLKKKLGQIRNQLQe 1630
Cdd:pfam15921 527 DLKL-QELQHLKNEGDHLRNVQTE------------CEALKLQ-MAEKD-KVIEILRQQIENMTQ-LVGQHGRTAGAMQ- 589
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1631 aqdrhTEAVRCAEKMQDHKQKLEKdnakLKVTVKKQMDKIEELQknllnANLSEDEKEQLKKLMELKQSLECnldqemKK 1710
Cdd:pfam15921 590 -----VEKAQLEKEINDRRLELQE----FKILKDKKDAKIRELE-----ARVSDLELEKVKLVNAGSERLRA------VK 649
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1711 NVELEREitGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDIQINKLKHKIDDLTAELETA--------GS------ 1776
Cdd:pfam15921 650 DIKQERD--QLLNEVKTSRNELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTrntlksmeGSdghamk 727
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1777 --------------KCLHLDTKNQILQEellSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEE 1842
Cdd:pfam15921 728 vamgmqkqitakrgQIDALQSKIQFLEE---AMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKE 804
|
810 820 830 840 850
....*....|....*....|....*....|....*....|....*....|.
gi 1034567240 1843 R-ARQEIAekLKEVNLflqaQAASQENLEQFRENNFASMKSQMELRIKDLE 1892
Cdd:pfam15921 805 KvANMEVA--LDKASL----QFAECQDIIQRQEQESVRLKLQHTLDVKELQ 849
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1172-1933 |
1.26e-13 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 77.03 E-value: 1.26e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1172 EQLRRKEEQYRKEVEvkqQLELSLQTLEMELRTVKSNLNQVVQERNDAQR--QLSREqnarmLQDGILTNHLsKQKEIEM 1249
Cdd:TIGR02169 166 AEFDRKKEKALEELE---EVEENIERLDLIIDEKRQQLERLRREREKAERyqALLKE-----KREYEGYELL-KEKEALE 236
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1250 AQKKmnsenshshEEEKDLSHknsmLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTI 1329
Cdd:TIGR02169 237 RQKE---------AIERQLAS----LEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEI 303
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1330 SQ-------YNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAI----HDRDQSETSKRELE---LAFQRA 1395
Cdd:TIGR02169 304 ASlersiaeKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTeeyaELKEELEDLRAELEevdKEFAET 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1396 RDECSRLQDKM----------NFDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQ 1465
Cdd:TIGR02169 384 RDELKDYREKLeklkreinelKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAAD 463
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1466 MKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHN-----KADN-------------KEKTVINIQ- 1526
Cdd:TIGR02169 464 LSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRGGRAveevlKASIqgvhgtvaqlgsvGERYATAIEv 543
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1527 ---DQFHAIVQKLQAESEKQSLLLEERN------------------KELISECNHLK--------ERQYQ---------- 1567
Cdd:TIGR02169 544 aagNRLNNVVVEDDAVAKEAIELLKRRKagratflplnkmrderrdLSILSEDGVIGfavdlvefDPKYEpafkyvfgdt 623
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1568 --YENEKAEREVVVR----QLQQELADtlKKQSMS----------------EASLEVTSRYRINLEDETQDLKKKLGQIR 1625
Cdd:TIGR02169 624 lvVEDIEAARRLMGKyrmvTLEGELFE--KSGAMTggsraprggilfsrsePAELQRLRERLEGLKRELSSLQSELRRIE 701
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1626 NQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLlnanlsEDEKEQLKKLMELKQSLECNLD 1705
Cdd:TIGR02169 702 NRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEI------ENVKSELKELEARIEELEEDLH 775
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1706 QEMKKNVELEREITGFKnllkmTRKKLNEYEngefsfhgDLKTSQFEMDIQINKLKHKIDDLTAELETAGSKCLHL---- 1781
Cdd:TIGR02169 776 KLEEALNDLEARLSHSR-----IPEIQAELS--------KLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELqeqr 842
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1782 -DTKNQI--LQEELLSMKT----VQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERarqeiAEKLKE 1854
Cdd:TIGR02169 843 iDLKEQIksIEKEIENLNGkkeeLEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQ-----IEKKRK 917
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1855 VNLFLQAQAASQENLEQFRENNFASMKS---------QMELRIKDLESELSKIktsqEDFNKTELEKYKqlylEELKVRK 1925
Cdd:TIGR02169 918 RLSELKAKLEALEEELSEIEDPKGEDEEipeeelsleDVQAELQRVEEEIRAL----EPVNMLAIQEYE----EVLKRLD 989
|
....*...
gi 1034567240 1926 SLSSKLTK 1933
Cdd:TIGR02169 990 ELKEKRAK 997
|
|
| sbcc |
TIGR00618 |
exonuclease SbcC; All proteins in this family for which functions are known are part of an ... |
1121-1815 |
1.43e-13 |
|
exonuclease SbcC; All proteins in this family for which functions are known are part of an exonuclease complex with sbcD homologs. This complex is involved in the initiation of recombination to regulate the levels of palindromic sequences in DNA. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 129705 [Multi-domain] Cd Length: 1042 Bit Score: 76.55 E-value: 1.43e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1121 LQRELSETKEIKSQLEhqKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLrrKEEQYRKEVEVKQQLELSLQTLEM 1200
Cdd:TIGR00618 158 LKAKSKEKKELLMNLF--PLDQYTQLALMEFAKKKSLHGKAELLTLRSQLLTLC--TPCMPDTYHERKQVLEKELKHLRE 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1201 ELRTVKSNLNQVVQERNDAQRQLSREQNARMLQDGI------LTNHLSKQKEIEMA------------------------ 1250
Cdd:TIGR00618 234 ALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIeelraqEAVLEETQERINRArkaaplaahikavtqieqqaqrih 313
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1251 ---QKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQ 1327
Cdd:TIGR00618 314 telQSKMRSRAKLLMKRAAHVKQQSSIEEQRRLLQTLHSQEIHIRDAHEVATSIREISCQQHTLTQHIHTLQQQKTTLTQ 393
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1328 TISQYNGRLSVLTAENAM---LNSKLENEKQSKERLEAEVE---SYHSRLAAAIHDRDQSET-SKRELELAFQRARDECS 1400
Cdd:TIGR00618 394 KLQSLCKELDILQREQATidtRTSAFRDLQGQLAHAKKQQElqqRYAELCAAAITCTAQCEKlEKIHLQESAQSLKEREQ 473
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1401 RLQDKMNF--DVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREktlglervqkdLSQTQCQMKEMEQKYQNEQV 1478
Cdd:TIGR00618 474 QLQTKEQIhlQETRKKAVVLARLLELQEEPCPLCGSCIHPNPARQDIDN-----------PGPLTRRMQRGEQTYAQLET 542
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1479 KVNKYIGKQESVEERLSQLQSENMLLRQQLdDAHNKADNKEKTVINIQDQFHAIVQKL---QAESEKQSLLLEERNKELI 1555
Cdd:TIGR00618 543 SEEDVYHQLTSERKQRASLKEQMQEIQQSF-SILTQCDNRSKEDIPNLQNITVRLQDLtekLSEAEDMLACEQHALLRKL 621
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1556 SECNHLKERQyQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTS------RYRINLEDETQDLKKKLGQIRNQLQ 1629
Cdd:TIGR00618 622 QPEQDLQDVR-LHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVlpkellASRQLALQKMQSEKEQLTYWKEMLA 700
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1630 EAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLnanlsEDEKEQLKKLMELKQslecnldqemK 1709
Cdd:TIGR00618 701 QCQTLLRELETHIEEYDREFNEIENASSSLGSDLAAREDALNQSLKELM-----HQARTVLKARTEAHF----------N 765
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1710 KNVELEREITGFKNLLKMTRKKLNEYENGEFSFHgdlKTSQFEMDIQiNKLKHKIDDLTAELETAGSKCLHLDTKNQILQ 1789
Cdd:TIGR00618 766 NNEEVTAALQTGAELSHLAAEIQFFNRLREEDTH---LLKTLEAEIG-QEIPSDEDILNLQCETLVQEEEQFLSRLEEKS 841
|
730 740
....*....|....*....|....*.
gi 1034567240 1790 EELLSMKTVQKKCEKLQKNKKKLEQE 1815
Cdd:TIGR00618 842 ATLGEITHQLLKYEECSKQLAQLTQE 867
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1285-1945 |
1.49e-13 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 76.21 E-value: 1.49e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1285 EIDTIKNQ--NQEKEKKCFEDLKIVKEKN----EDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKE 1358
Cdd:TIGR04523 41 KLKTIKNElkNKEKELKNLDKNLNKDEEKinnsNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDKEQKN 120
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1359 RLEAEVESyhsrlaaaihdrdqsetSKRELElafqrardECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEF 1438
Cdd:TIGR04523 121 KLEVELNK-----------------LEKQKK--------ENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENEL 175
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1439 HHTRDALREKtlglervQKDLSQTQCQMKEMEQKYQNEQvkvnKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNK 1518
Cdd:TIGR04523 176 NLLEKEKLNI-------QKNIDKIKNKLLKLELLLSNLK----KKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEK 244
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1519 EKTVINIQDQFHAIVQKLQAESEKqsllLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKkqsmsea 1598
Cdd:TIGR04523 245 TTEISNTQTQLNQLKDEQNKIKKQ----LSEKQKELEQNNKKIKELEKQLNQLKSEISDLNNQKEQDWNKELK------- 313
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1599 slevtsryrinleDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNll 1678
Cdd:TIGR04523 314 -------------SELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNEIEKLKKE-- 378
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1679 nanlSEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGF---KNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDI 1755
Cdd:TIGR04523 379 ----NQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLqqeKELLEKEIERLKETIIKNNSEIKDLTNQDSVKEL 454
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1756 QINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEELlsmKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQ 1835
Cdd:TIGR04523 455 IIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKEL---KSKEKELKKLNEEKKELEEKVKDLTKKISSLKEKIEKLES 531
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1836 YKQEIEERARQ------EIAEKLKEVNLFLQAQaASQENLEQFRENNFASMKSQMEL--RIKDLESELSKIKTSQEDFNK 1907
Cdd:TIGR04523 532 EKKEKESKISDledelnKDDFELKKENLEKEID-EKNKEIEELKQTQKSLKKKQEEKqeLIDQKEKEKKDLIKEIEEKEK 610
|
650 660 670
....*....|....*....|....*....|....*...
gi 1034567240 1908 TELEKYKQLYLEELKVRKsLSSKLTKTNERLAEVNTKL 1945
Cdd:TIGR04523 611 KISSLEKELEKAKKENEK-LSSIIKNIKSKKNKLKQEV 647
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1507-1945 |
1.98e-13 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 76.26 E-value: 1.98e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1507 QLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLL--LEERNKELISECNHL--KERQYQYENEKAEREV----- 1577
Cdd:PRK03918 156 GLDDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIkeKEKELEEVLREINEIssELPELREELEKLEKEVkelee 235
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1578 ---VVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIrNQLQEAQDRHTEAVRCAEKMQDHKQKLEK 1654
Cdd:PRK03918 236 lkeEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKEL-KELKEKAEEYIKLSEFYEEYLDELREIEK 314
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1655 DNAKLKV---TVKKQMDKIEELQKNLlnANLSEDEKEQLKKLMELKQSLEcNLDQEMKKNVELEREITGFKNL-LKMTRK 1730
Cdd:PRK03918 315 RLSRLEEeinGIEERIKELEEKEERL--EELKKKLKELEKRLEELEERHE-LYEEAKAKKEELERLKKRLTGLtPEKLEK 391
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1731 KLNEYENGEFSFHGDLKtsqfEMDIQINKLKHKIDDLTA---ELETAGSKC------LHLDTKNQILQEELLSMKTVQKK 1801
Cdd:PRK03918 392 ELEELEKAKEEIEEEIS----KITARIGELKKEIKELKKaieELKKAKGKCpvcgreLTEEHRKELLEEYTAELKRIEKE 467
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1802 CEKLQKNKKKLEQEVINLRSHI--ERNMVELGQVKQYKQEIEER----------ARQEIAEKLKEVNLFLQAQAASQENl 1869
Cdd:PRK03918 468 LKEIEEKERKLRKELRELEKVLkkESELIKLKELAEQLKELEEKlkkynleeleKKAEEYEKLKEKLIKLKGEIKSLKK- 546
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1870 EQFRENNFASMKSQMELRIKDLESELSKIKT--------SQEDFNKT--ELEKYKQLYLE----------ELKVRKSLSS 1929
Cdd:PRK03918 547 ELEKLEELKKKLAELEKKLDELEEELAELLKeleelgfeSVEELEERlkELEPFYNEYLElkdaekelerEEKELKKLEE 626
|
490
....*....|....*.
gi 1034567240 1930 KLTKTNERLAEVNTKL 1945
Cdd:PRK03918 627 ELDKAFEELAETEKRL 642
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1295-1955 |
7.54e-13 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 74.32 E-value: 7.54e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1295 EKEKKCFEDLKIVKEKNEDLQKTI--------KQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVES 1366
Cdd:TIGR02168 206 ERQAEKAERYKELKAELRELELALlvlrleelREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEE 285
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1367 YHSRLAAAihdrdQSETSKRELELAFQRARDEcsrlqdkmnfdvsNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALR 1446
Cdd:TIGR02168 286 LQKELYAL-----ANEISRLEQQKQILRERLA-------------NLERQLEELEAQLEELESKLDELAEELAELEEKLE 347
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1447 EKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAhnkADNKEKTVINIQ 1526
Cdd:TIGR02168 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERL---EDRRERLQQEIE 424
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1527 DQfhaivqkLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRY 1606
Cdd:TIGR02168 425 EL-------LKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERL 497
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1607 RINLEDETQDLKKKLgQIRNQLQEAQDRHTEAVRCAEK--------MQDHKQKLE--------------KDNAKLKVTV- 1663
Cdd:TIGR02168 498 QENLEGFSEGVKALL-KNQSGLSGILGVLSELISVDEGyeaaieaaLGGRLQAVVvenlnaakkaiaflKQNELGRVTFl 576
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1664 -----------KKQMDKIEELQKNLLNANLSEDEKEQLKKLME-----------LKQSLEC------------------- 1702
Cdd:TIGR02168 577 pldsikgteiqGNDREILKNIEGFLGVAKDLVKFDPKLRKALSyllggvlvvddLDNALELakklrpgyrivtldgdlvr 656
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1703 -----------------NLDQEMKKNV----ELEREITGFKNLLKMTRKKLNEYENGEfsfhGDLKTSQFEMDIQINKLK 1761
Cdd:TIGR02168 657 pggvitggsaktnssilERRREIEELEekieELEEKIAELEKALAELRKELEELEEEL----EQLRKELEELSRQISALR 732
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1762 HKIDDLTAELETAGSKCLHLDTKNQILQEEllsMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIE 1841
Cdd:TIGR02168 733 KDLARLEAEVEQLEERIAQLSKELTELEAE---IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELR 809
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1842 ERARQEIAE--KLKEVNLFLQAQAASQENLEQFRENNFAsmksQMELRIKDLESELSKIKTSQEDFNKtELEKYKQLYLE 1919
Cdd:TIGR02168 810 AELTLLNEEaaNLRERLESLERRIAATERRLEDLEEQIE----ELSEDIESLAAEIEELEELIEELES-ELEALLNERAS 884
|
730 740 750
....*....|....*....|....*....|....*.
gi 1034567240 1920 ELKVRKSLSSKLTKTNERLAEVNTKLLVEKQQSRSL 1955
Cdd:TIGR02168 885 LEEALALLRSELEELSEELRELESKRSELRRELEEL 920
|
|
| PHA02875 |
PHA02875 |
ankyrin repeat protein; Provisional |
9-220 |
1.27e-12 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165206 [Multi-domain] Cd Length: 413 Bit Score: 72.33 E-value: 1.27e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 9 GESPLgSFARRQRSSAGGGGEPGEGAYsqPGYHVRDRDlGKIHKAASAGNVAKVQQILLLRKNGLNDRDKMNRTALHLAC 88
Cdd:PHA02875 35 GISPI-KLAMKFRDSEAIKLLMKHGAI--PDVKYPDIE-SELHDAVEEGDVKAVEELLDLGKFADDVFYKDGMTPLHLAT 110
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 89 ANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLY 168
Cdd:PHA02875 111 ILKKLDIMKLLIARGADPDIPNTDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCTPLIIAMAKGDIAICKMLLDS 190
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|...
gi 1034567240 169 DANIEAKNKD-DLTPLLLAVSGKKQQMVEFLIKKKANVNAVDKLESSHQLISE 220
Cdd:PHA02875 191 GANIDYFGKNgCVAALCYAIENNKIDIVRLFIKRGADCNIMFMIEGEECTILD 243
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1199-1945 |
1.50e-12 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 73.33 E-value: 1.50e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1199 EMELRTVKSNLNQVVQER--NDAQRQLSREQNARMLQDgiltnhlskqkeIEMAQKKMNsenshsheeekdLSHKNSMLQ 1276
Cdd:pfam12128 192 EGKFRDVKSMIVAILEDDgvVPPKSRLNRQQVEHWIRD------------IQAIAGIMK------------IRPEFTKLQ 247
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1277 EEIAMLR---LEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLEN- 1352
Cdd:pfam12128 248 QEFNTLEsaeLRLSHLHFGYKSDETLIASRQEERQETSAELNQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEAl 327
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1353 EKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARD---ECSRLQDKM----NFDVSNLKDN----NEILS 1421
Cdd:pfam12128 328 EDQHGAFLDADIETAAADQEQLPSWQSELENLEERLKALTGKHQDvtaKYNRRRSKIkeqnNRDIAGIKDKlakiREARD 407
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1422 QQLFKTESKLNSLEIEFHHTRDA----LREKTLG----LERVQKDLSQTQCQMKEMEQKYQNeQVKVNKYIGKQESVEER 1493
Cdd:pfam12128 408 RQLAVAEDDLQALESELREQLEAgkleFNEEEYRlksrLGELKLRLNQATATPELLLQLENF-DERIERAREEQEAANAE 486
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1494 LSQLQSEnmllRQQLDDAHNKADNKEKtviniqdQFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKA 1573
Cdd:pfam12128 487 VERLQSE----LRQARKRRDQASEALR-------QASRRLEERQSALDELELQLFPQAGTLLHFLRKEAPDWEQSIGKVI 555
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1574 EREVVVR-QLQQELADTLKKQSMSEASLEVTSRyRINLED---ETQDLKKKLGQIRNQLQEAQDRHTEAvrcAEKMQDHK 1649
Cdd:pfam12128 556 SPELLHRtDLDPEVWDGSVGGELNLYGVKLDLK-RIDVPEwaaSEEELRERLDKAEEALQSAREKQAAA---EEQLVQAN 631
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1650 QKLekDNAKLKVTVKKQMDKIEELQKNLLNanlseDEKEQLKklMELKQSLECNLDQEMKKNVELEREitgfKNLLKMTR 1729
Cdd:pfam12128 632 GEL--EKASREETFARTALKNARLDLRRLF-----DEKQSEK--DKKNKALAERKDSANERLNSLEAQ----LKQLDKKH 698
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1730 KKLNEYENGEFSFHGDLKtsQFEMDIQINKLKHKIDDLTAELETAGSKclhLDTKNQILQEEllsMKTVQKKCEKLQKNK 1809
Cdd:pfam12128 699 QAWLEEQKEQKREARTEK--QAYWQVVEGALDAQLALLKAAIAARRSG---AKAELKALETW---YKRDLASLGVDPDVI 770
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1810 KKLEQEVINLRSHIERNMVELGQVKQYKQEIEER---ARQEIAEKLKEVN---LFLQAQAASQE-----NLEQFRENNFA 1878
Cdd:pfam12128 771 AKLKREIRTLERKIERIAVRRQEVLRYFDWYQETwlqRRPRLATQLSNIEraiSELQQQLARLIadtklRRAKLEMERKA 850
|
730 740 750 760 770 780 790
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034567240 1879 SMKSQMEL--RIKDLESELSKIKTSQEDFNKTELE---KYKQLYLEELK-VRKSLSSKLTKTNERLAEVNTKL 1945
Cdd:pfam12128 851 SEKQQVRLseNLRGLRCEMSKLATLKEDANSEQAQgsiGERLAQLEDLKlKRDYLSESVKKYVEHFKNVIADH 923
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
839-1630 |
1.85e-12 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 73.17 E-value: 1.85e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 839 SANNYKSMEPELENVRSSpprgdRTSKVSLKEELQQDMQRFKNEIGMLKVEFQALEKEKVQLQKEVEeerkkhrnnemEV 918
Cdd:TIGR02168 230 LVLRLEELREELEELQEE-----LKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY-----------AL 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 919 SANIHDGATDDAEDDDDDDGLIQKRKSGETDHQQFPRKEN---KEYASSGPALQMKEVKSTEKEKRTSKESVNSPVFGKA 995
Cdd:TIGR02168 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDelaEELAELEEKLEELKEELESLEAELEELEAELEELESR 373
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 996 SlltgGLLQVDDDSSLSEIDEdegrptkkTSNEKNKVKNQIQSMDD-VDDLTQSSETASEDcelphssyknfmllIEQLG 1074
Cdd:TIGR02168 374 L----EELEEQLETLRSKVAQ--------LELQIASLNNEIERLEArLERLEDRRERLQQE--------------IEELL 427
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1075 MEckdsvslLKIQDAALSCERLLELKKnhcelltvKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLN 1154
Cdd:TIGR02168 428 KK-------LEEAELKELQAELEELEE--------ELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLD 492
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1155 QEEEKRRNADTLYEKIREQLRRK----------------EEQYRKEVEVkqQLELSLQTLEMElrtvksNLNQVVQ---- 1214
Cdd:TIGR02168 493 SLERLQENLEGFSEGVKALLKNQsglsgilgvlselisvDEGYEAAIEA--ALGGRLQAVVVE------NLNAAKKaiaf 564
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1215 -ERNDAQRQL-------------SREQNARMLQDGILTNHLSKQKEIEMAQKKMNSENSHSHEEEkDLSHKNSMLQEEIA 1280
Cdd:TIGR02168 565 lKQNELGRVTflpldsikgteiqGNDREILKNIEGFLGVAKDLVKFDPKLRKALSYLLGGVLVVD-DLDNALELAKKLRP 643
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1281 MLR---LEIDTIKN---------------QNQEKE-KKCFEDLKIVKEKNEDLQKTIKQNEETLtqtiSQYNGRLSVLTA 1341
Cdd:TIGR02168 644 GYRivtLDGDLVRPggvitggsaktnssiLERRREiEELEEKIEELEEKIAELEKALAELRKEL----EELEEELEQLRK 719
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1342 ENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNfdvsNLKDNNEILS 1421
Cdd:TIGR02168 720 ELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIE----ELEAQIEQLK 795
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1422 QQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSEn 1501
Cdd:TIGR02168 796 EELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESE- 874
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1502 mllrqqLDDAHNKADNKEKTVINIQDQFHAIVQKLQAesekqsllLEERNKELISECNHLKERQYQYENEKAEREVVVRQ 1581
Cdd:TIGR02168 875 ------LEALLNERASLEEALALLRSELEELSEELRE--------LESKRSELRRELEELREKLAQLELRLEGLEVRIDN 940
|
810 820 830 840
....*....|....*....|....*....|....*....|....*....
gi 1034567240 1582 LQQELADTLkkqsmsEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQE 1630
Cdd:TIGR02168 941 LQERLSEEY------SLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1157-1952 |
3.08e-12 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 72.31 E-value: 3.08e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1157 EEKRRNADTLYEKIREQLRRKEEQYRKEVEVKQQL-------------ELSLQTLEMELRTVKSNLNQVVQERNDaqrQL 1223
Cdd:pfam02463 166 RLKRKKKEALKKLIEETENLAELIIDLEELKLQELklkeqakkaleyyQLKEKLELEEEYLLYLDYLKLNEERID---LL 242
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1224 SREQNARMLQDGILTNHLSKQKEIEMAQKKMNSENShshEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFED 1303
Cdd:pfam02463 243 QELLRDEQEEIESSKQEIEKEEEKLAQVLKENKEEE---KEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKES 319
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1304 LKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIhdRDQSET 1383
Cdd:pfam02463 320 EKEKKKAEKELKKEKEEIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAK--LKEEEL 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1384 SKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALR-EKTLGLERVQKDLSQT 1462
Cdd:pfam02463 398 ELKSEEEKEAQLLLELARQLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLlKDELELKKSEDLLKET 477
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1463 QCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQ-QLDDAHNKADNKEKTVINIQdqfHAIVQKLQAESE 1541
Cdd:pfam02463 478 QLVKLQEQLELLLSRQKLEERSQKESKARSGLKVLLALIKDGVGgRIISAHGRLGDLGVAVENYK---VAISTAVIVEVS 554
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1542 KQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDET--QDLKK 1619
Cdd:pfam02463 555 ATADEVEERQKLVRALTELPLGARKLRLLIPKLKLPLKSIAVLEIDPILNLAQLDKATLEADEDDKRAKVVEGilKDTEL 634
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1620 KLGQIRNQLQEAQDRHTEAVRCAEKM-QDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQLKKLMELKQ 1698
Cdd:pfam02463 635 TKLKESAKAKESGLRKGVSLEEGLAEkSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREKEELKK 714
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1699 SLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDIQINKLKHKIDDLTAELETAGSKc 1778
Cdd:pfam02463 715 LKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVEEEKE- 793
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1779 lhldtKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEIAEKLKEVNLF 1858
Cdd:pfam02463 794 -----EKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITKEEL 868
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1859 LQAQAASQENLEQFRENNFASMKSQMELRIKDLESELSKIKTSQEDFNKTELEKYKQLYLEELKVRKSL--SSKLTKTNE 1936
Cdd:pfam02463 869 LQELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEEEPeeLLLEEADEK 948
|
810
....*....|....*.
gi 1034567240 1937 RLAEVNTKLLVEKQQS 1952
Cdd:pfam02463 949 EKEENNKEEEEERNKR 964
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1083-1630 |
3.34e-12 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 72.28 E-value: 3.34e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1083 LLKIQDAALSCERL---LELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEK 1159
Cdd:COG1196 231 LLKLRELEAELEELeaeLEELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIARLEER 310
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1160 RRNADTLYEKIREQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQV-------VQERNDAQRQLSREQNARML 1232
Cdd:COG1196 311 RRELEERLEELEEELAELEEELEELEEELEELEEELEEAEEELEEAEAELAEAeealleaEAELAEAEEELEELAEELLE 390
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1233 QDGILTNHLSKQKEIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEkkcfEDLKIVKEKNE 1312
Cdd:COG1196 391 ALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELE----EEEEALLELLA 466
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1313 DLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAf 1392
Cdd:COG1196 467 ELLEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALA- 545
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1393 qrardecSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEfhhtRDALREKTLGLERVQKDLSQTQCQMKEMEQK 1472
Cdd:COG1196 546 -------AALQNIVVEDDEVAAAAIEYLKAAKAGRATFLPLDKIR----ARAALAAALARGAIGAAVDLVASDLREADAR 614
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1473 Y---QNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQS---LL 1546
Cdd:COG1196 615 YyvlGDTLLGRTLVAARLEAALRRAVTLAGRLREVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAeeeLE 694
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1547 LEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRN 1626
Cdd:COG1196 695 LEEALLAEEEEERELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELERLER 774
|
....
gi 1034567240 1627 QLQE 1630
Cdd:COG1196 775 EIEA 778
|
|
| PHA03100 |
PHA03100 |
ankyrin repeat protein; Provisional |
54-210 |
8.47e-12 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222984 [Multi-domain] Cd Length: 422 Bit Score: 69.69 E-value: 8.47e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 54 ASAGNVAKVQQILLLRKNGLNDRDKMNRTALHLACAN--GHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATI 131
Cdd:PHA03100 80 YNLTDVKEIVKLLLEYGANVNAPDNNGITPLLYAISKksNSYSIVEYLLDNGANVNIKNSDGENLLHLYLESNKIDLKIL 159
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 132 ------------------LLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQ 193
Cdd:PHA03100 160 kllidkgvdinaknrvnyLLSYGVPINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKE 239
|
170
....*....|....*..
gi 1034567240 194 MVEFLIKKKANVNAVDK 210
Cdd:PHA03100 240 IFKLLLNNGPSIKTIIE 256
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
94-212 |
2.68e-11 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 68.51 E-value: 2.68e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 94 EVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATI---LLEHGADPNLADVHGNTALHYAVYNED-ISVATKLLLYD 169
Cdd:PHA03095 28 EEVRRLLAAGADVNFRGEYGKTPLHLYLHYSSEKVKDIvrlLLEAGADVNAPERCGFTPLHLYLYNATtLDVIKLLIKAG 107
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1034567240 170 ANIEAKNKDDLTPLLLAVSGK--KQQMVEFLIKKKANVNAVDKLE 212
Cdd:PHA03095 108 ADVNAKDKVGRTPLHVYLSGFniNPKVIRLLLRKGADVNALDLYG 152
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1157-1772 |
2.89e-11 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 69.20 E-value: 2.89e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1157 EEKRRNADTL---------YEKIREQLRRKEEQYRkeVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQ 1227
Cdd:COG1196 196 GELERQLEPLerqaekaerYRELKEELKELEAELL--LLKLRELEAELEELEAELEELEAELEELEAELAELEAELEELR 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1228 NArmlqdgiltnHLSKQKEIEMAQKKMNSENSHSHEEEKDLSHknsmLQEEIAMLRLEIDTIKNQNQEKEkkcfedlkiv 1307
Cdd:COG1196 274 LE----------LEELELELEEAQAEEYELLAELARLEQDIAR----LEERRRELEERLEELEEELAELE---------- 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1308 kEKNEDLQKTIKQNEETLTQtisqyngrlsvltaenamlnskLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRE 1387
Cdd:COG1196 330 -EELEELEEELEELEEELEE----------------------AEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAE 386
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1388 LELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMK 1467
Cdd:COG1196 387 ELLEALRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLA 466
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1468 EMEQKYQNEQVKVnkyigkQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLLL 1547
Cdd:COG1196 467 ELLEEAALLEAAL------AELLEELAEAAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAAL 540
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1548 EERNKELIsecnhlkeRQYQYENEKAEREVVVRQLQQELA-------DTLKKQSMSEASLE--VTSRYRINLEDETQDLK 1618
Cdd:COG1196 541 EAALAAAL--------QNIVVEDDEVAAAAIEYLKAAKAGratflplDKIRARAALAAALArgAIGAAVDLVASDLREAD 612
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1619 KKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLE------------KDNAKLKVTVKKQMDKIEELQKNLLNANLSEDE 1686
Cdd:COG1196 613 ARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRLRevtlegeggsagGSLTGGSRRELLAALLEAEAELEELAERLAEEE 692
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1687 KEQLKKLMELKQSLEcNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDIQINKLKHKIDD 1766
Cdd:COG1196 693 LELEEALLAEEEEER-ELAEAEEERLEEELEEEALEEQLEAEREELLEELLEEEELLEEEALEELPEPPDLEELERELER 771
|
....*.
gi 1034567240 1767 LTAELE 1772
Cdd:COG1196 772 LEREIE 777
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1018-1933 |
3.31e-11 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 68.92 E-value: 3.31e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1018 EGRPTKKTSNEKNKVKNQIQSMDDVDDLTQSSETASEDCELPHSSYKNFM---------LLIEQLGMECKDSVSLLKIQD 1088
Cdd:TIGR00606 167 EGKALKQKFDEIFSATRYIKALETLRQVRQTQGQKVQEHQMELKYLKQYKekaceirdqITSKEAQLESSREIVKSYENE 246
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1089 AALSCERLLELKKNhcellTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEwerELCSLRFSLNQEEEKRRNADTLYE 1168
Cdd:TIGR00606 247 LDPLKNRLKEIEHN-----LSKIMKLDNEIKALKSRKKQMEKDNSELELKMEK---VFQGTDEQLNDLYHNHQRTVREKE 318
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1169 KIREQLRRKEEQYRKEVEvkqqlELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQNARMLQDGILTNHLSKQKEI- 1247
Cdd:TIGR00606 319 RELVDCQRELEKLNKERR-----LLNQEKTELLVEQGRLQLQADRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIk 393
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1248 ---EMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEI----DTIKNQNQEKEKKCfEDLKIVKEKNEDLQ---KT 1317
Cdd:TIGR00606 394 nfhTLVIERQEDEAKTAAQLCADLQSKERLKQEQADEIRDEKkglgRTIELKKEILEKKQ-EELKFVIKELQQLEgssDR 472
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1318 IKQNEETLTQTISQYNgrlsvLTAENAMLNSKLENEKqSKERLEAEVESYHSRLAAAIHDRDQSETSKRELEL------- 1390
Cdd:TIGR00606 473 ILELDQELRKAERELS-----KAEKNSLTETLKKEVK-SLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMltkdkmd 546
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1391 AFQRARDECSRLQDKMNFDVS-------------NLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQK 1457
Cdd:TIGR00606 547 KDEQIRKIKSRHSDELTSLLGyfpnkkqledwlhSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYED 626
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1458 DLSQTqCQMKEMEQKYQNEQVKVNK-------YIGKQESVEERLSQLQSEN----------MLLRQQLDDAHNKADNKEK 1520
Cdd:TIGR00606 627 KLFDV-CGSQDEESDLERLKEEIEKsskqramLAGATAVYSQFITQLTDENqsccpvcqrvFQTEAELQEFISDLQSKLR 705
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1521 TVINIQDQFHAIVQKLQAESEKQSLLLEERNKEL---ISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSE 1597
Cdd:TIGR00606 706 LAPDKLKSTESELKKKEKRRDEMLGLAPGRQSIIdlkEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAK 785
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1598 ASL-EVTSRYRinLEDETQDLKKKLGQIRNQLQEAQ-DRHTEAVRcaEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQK 1675
Cdd:TIGR00606 786 VCLtDVTIMER--FQMELKDVERKIAQQAAKLQGSDlDRTVQQVN--QEKQEKQHELDTVVSKIELNRKLIQDQQEQIQH 861
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1676 NLLNANLSEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDI 1755
Cdd:TIGR00606 862 LKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQD 941
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1756 QINKLKHKIDDLTAELETAGSKClhLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLR-----SHIERNMVEL 1830
Cdd:TIGR00606 942 KVNDIKEKVKNIHGYMKDIENKI--QDGKDDYLKQKETELNTVNAQLEECEKHQEKINEDMRLMRqdidtQKIQERWLQD 1019
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1831 GQVKQYKQEIEERARQEIAEKLKEVNlflQAQAASQENLEQFRENNFASMKSQMEL---RIKDLESEL--SKIKTSQEDF 1905
Cdd:TIGR00606 1020 NLTLRKRENELKEVEEELKQHLKEMG---QMQVLQMKQEHQKLEENIDLIKRNHVLalgRQKGYEKEIkhFKKELREPQF 1096
|
970 980
....*....|....*....|....*...
gi 1034567240 1906 NKTElEKYKQLYLeELKVRKSLSSKLTK 1933
Cdd:TIGR00606 1097 RDAE-EKYREMMI-VMRTTELVNKDLDI 1122
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1099-1535 |
3.70e-11 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 68.51 E-value: 3.70e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1099 LKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKE 1178
Cdd:TIGR04523 194 NKLLKLELLLSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQ 273
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1179 EQYRKEVEVKQQLELSLQTLEMEL------------RTVKSNLNQVVQERNDAQRQLSR-EQNARMLQDGIL-------- 1237
Cdd:TIGR04523 274 KELEQNNKKIKELEKQLNQLKSEIsdlnnqkeqdwnKELKSELKNQEKKLEEIQNQISQnNKIISQLNEQISqlkkeltn 353
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1238 --TNHLSKQKEIEMAQKKMNSENshshEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKcfedLKIVKEKNEDLQ 1315
Cdd:TIGR04523 354 seSENSEKQRELEEKQNEIEKLK----KENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQ----IKKLQQEKELLE 425
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1316 KTIKQneetLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSE---TSKRELELAF 1392
Cdd:TIGR04523 426 KEIER----LKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQkelKSKEKELKKL 501
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1393 QRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLE-----IEFHHTRDALREKTLG-------LERVQKDLS 1460
Cdd:TIGR04523 502 NEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEdelnkDDFELKKENLEKEIDEknkeieeLKQTQKSLK 581
|
410 420 430 440 450 460 470
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034567240 1461 QTQCQMKEMEQKYQNEQVKVNK----YIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQK 1535
Cdd:TIGR04523 582 KKQEEKQELIDQKEKEKKDLIKeieeKEKKISSLEKELEKAKKENEKLSSIIKNIKSKKNKLKQEVKQIKETIKEIRNK 660
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
1112-1962 |
5.63e-11 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 68.07 E-value: 5.63e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1112 KKMEDKVNVLQRELSETKEIKSQLEHQKVEwERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYRKEVEVKQQL 1191
Cdd:pfam02463 177 KLIEETENLAELIIDLEELKLQELKLKEQA-KKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIES 255
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1192 ELSLQTLEMELRTVKSNLN-------QVVQERNDAQRQLSREQNARMLQDGILTNHLSKQKEIEMAQ-KKMNSENSHSHE 1263
Cdd:pfam02463 256 SKQEIEKEEEKLAQVLKENkeeekekKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKEkKKAEKELKKEKE 335
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1264 EEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLT------QTISQYNGRLS 1337
Cdd:pfam02463 336 EIEELEKELKELEIKREAEEEEEEELEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELElkseeeKEAQLLLELAR 415
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1338 VLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNN 1417
Cdd:pfam02463 416 QLEDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKL 495
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1418 EILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKyIGKQESVEERLSQL 1497
Cdd:pfam02463 496 EERSQKESKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEV-EERQKLVRALTELP 574
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1498 QSENMLLRQQLDDAHNKADNKEKTVINIQDQfhaiVQKLQAESEKQSLLLEERNKELISECnHLKERQYQYENEKAEREV 1577
Cdd:pfam02463 575 LGARKLRLLIPKLKLPLKSIAVLEIDPILNL----AQLDKATLEADEDDKRAKVVEGILKD-TELTKLKESAKAKESGLR 649
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1578 VVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEavrcaEKMQDHKQKLEKDNA 1657
Cdd:pfam02463 650 KGVSLEEGLAEKSEVKASLSELTKELLEIQELQEKAESELAKEEILRRQLEIKKKEQREK-----EELKKLKLEAEELLA 724
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1658 KLKVTVKKQMDKIEELQKNLLNANLSEDEKEQLKKlMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYEN 1737
Cdd:pfam02463 725 DRVQEAQDKINEELKLLKQKIDEEEEEEEKSRLKK-EEKEEEKSELSLKEKELAEEREKTEKLKVEEEKEEKLKAQEEEL 803
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1738 GEFSFHGDLKTSQFEMDIQINKLKHKIDDLTAELetagskcLHLDTKNQILQEELLSMKT--VQKKCEKLQKNKKKLEQE 1815
Cdd:pfam02463 804 RALEEELKEEAELLEEEQLLIEQEEKIKEEELEE-------LALELKEEQKLEKLAEEELerLEEEITKEELLQELLLKE 876
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1816 VINLRSHIERNMVELGQVKQYKQEIEERARQEIAEKLKEVNLFLQAQAASQENLEQFREnnfasMKSQMELRIKDLESEL 1895
Cdd:pfam02463 877 EELEEQKLKDELESKEEKEKEEKKELEEESQKLNLLEEKENEIEERIKEEAEILLKYEE-----EPEELLLEEADEKEKE 951
|
810 820 830 840 850 860
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034567240 1896 SKIKTSQEDFNKTELEKYKQLYLEELKVRKSLSSKLTKTNERLAEVNTKLLVEKQQSRSLFTTLTTR 1962
Cdd:pfam02463 952 ENNKEEEEERNKRLLLAKEELGKVNLMAIEEFEEKEERYNKDELEKERLEEEKKKLIRAIIEETCQR 1018
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1302-1874 |
7.89e-11 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 67.40 E-value: 7.89e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1302 EDLKIVKEKNEDLQKTIKQNEETLTQTISqyngRLSVLTAENAMLNSKLENEKQSKERLEAEVEsyhsRLAAAIHDRDQS 1381
Cdd:PRK03918 179 ERLEKFIKRTENIEELIKEKEKELEEVLR----EINEISSELPELREELEKLEKEVKELEELKE----EIEELEKELESL 250
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1382 ETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEIlsQQLFKTESKLNSLEIEFhhtRDALREKTLGLERVQKDLSQ 1461
Cdd:PRK03918 251 EGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKEL--KEKAEEYIKLSEFYEEY---LDELREIEKRLSRLEEEING 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1462 TQCQMKEMEQKyqneQVKVNKYIGKQESVEERLSQLQSENMLLrqqlDDAHNKADNKEKTVINIQDQfhaIVQKLQAESE 1541
Cdd:PRK03918 326 IEERIKELEEK----EERLEELKKKLKELEKRLEELEERHELY----EEAKAKKEELERLKKRLTGL---TPEKLEKELE 394
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1542 KqsllLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQ----------QELADTLKKQSMSEASLEVT--SRYRIN 1609
Cdd:PRK03918 395 E----LEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKkakgkcpvcgRELTEEHRKELLEEYTAELKriEKELKE 470
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1610 LEDETQDLKKKLGQIRNQLQEaQDRHTEAVRCAEKMQDHKQKLEKDN-------AKLKVTVKKQMDKIEELQKNLlnanl 1682
Cdd:PRK03918 471 IEEKERKLRKELRELEKVLKK-ESELIKLKELAEQLKELEEKLKKYNleelekkAEEYEKLKEKLIKLKGEIKSL----- 544
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1683 sEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREIT--GFKNL--LKMTRKKLNEYENGEFSfhgdLKTSQFEMDIQIN 1758
Cdd:PRK03918 545 -KKELEKLEELKKKLAELEKKLDELEEELAELLKELEelGFESVeeLEERLKELEPFYNEYLE----LKDAEKELEREEK 619
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1759 KLKhkidDLTAELETAGSKCLHLDTKNQILQEEL--LSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQY 1836
Cdd:PRK03918 620 ELK----KLEEELDKAFEELAETEKRLEELRKELeeLEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKT 695
|
570 580 590
....*....|....*....|....*....|....*...
gi 1034567240 1837 KQEIEERaRQEIAEKLKEVNLFLQAQAASQENLEQFRE 1874
Cdd:PRK03918 696 LEKLKEE-LEEREKAKKELEKLEKALERVEELREKVKK 732
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1172-1924 |
8.37e-11 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 67.51 E-value: 8.37e-11
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1172 EQLRRKEEQYRKEVEVKQQLELSLQTLEmelrtvkSNLNQVVQERNDAQRQL---------SREQNARMLQ-----DGIL 1237
Cdd:pfam01576 5 EEMQAKEEELQKVKERQQKAESELKELE-------KKHQQLCEEKNALQEQLqaetelcaeAEEMRARLAArkqelEEIL 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1238 TNHLSKQKEIEMAQKKMNSENSHSHEEEKDLSHKnsMLQEEIAMLRLEIDtiKNQNQEKEKKCFEDLKIVKEKNEDLQKT 1317
Cdd:pfam01576 78 HELESRLEEEEERSQQLQNEKKKMQQHIQDLEEQ--LDEEEAARQKLQLE--KVTTEAKIKKLEEDILLLEDQNSKLSKE 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1318 IKQNEEtltqtisqyngRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARD 1397
Cdd:pfam01576 154 RKLLEE-----------RISEFTSNLAEEEEKAKSLSKLKNKHEAMISDLEERLKKEEKGRQELEKAKRKLEGESTDLQE 222
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1398 ECSRLQDKMnfdvsnlkdnnEILSQQLFKTESKLNSLEiefhhtrDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQ 1477
Cdd:pfam01576 223 QIAELQAQI-----------AELRAQLAKKEEELQAAL-------ARLEEETAQKNNALKKIRELEAQISELQEDLESER 284
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1478 VKVNKYIGKQESVEERLSQLQSEnmlLRQQLDDAHNKAD---NKEKTVIN----IQDQFHAIVQKLQAESEKQSLLLEER 1550
Cdd:pfam01576 285 AARNKAEKQRRDLGEELEALKTE---LEDTLDTTAAQQElrsKREQEVTElkkaLEEETRSHEAQLQEMRQKHTQALEEL 361
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1551 NKELisecNHLKERQYQYENEKAEREVVVRQLQQELadtlkkQSMSEASLEVTSRyRINLEDETQDLKKKLGQIRNQLQE 1630
Cdd:pfam01576 362 TEQL----EQAKRNKANLEKAKQALESENAELQAEL------RTLQQAKQDSEHK-RKKLEGQLQELQARLSESERQRAE 430
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1631 AQDRHTEAVRCAEKMQDHKQKLEKDNAKLK---VTVKKQMDKIEELQKNLLNANLSEDEKeqLKKLMELKQSLECNLDQE 1707
Cdd:pfam01576 431 LAEKLSKLQSELESVSSLLNEAEGKNIKLSkdvSSLESQLQDTQELLQEETRQKLNLSTR--LRQLEDERNSLQEQLEEE 508
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1708 MKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKtsqfemdiqinKLKHKIDDLTAELETAGSKCLHLDTKNQI 1787
Cdd:pfam01576 509 EEAKRNVERQLSTLQAQLSDMKKKLEEDAGTLEALEEGKK-----------RLQRELEALTQQLEEKAAAYDKLEKTKNR 577
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1788 LQEEL--LSMKTVQKK--CEKLQKNKKKLEQEVINlrshiERNMVelgqvkqyKQEIEERARQEIAEKLKEVNlFLQAQA 1863
Cdd:pfam01576 578 LQQELddLLVDLDHQRqlVSNLEKKQKKFDQMLAE-----EKAIS--------ARYAEERDRAEAEAREKETR-ALSLAR 643
|
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034567240 1864 ASQENLEQfrennfasmKSQMELRIKDLESELSKIKTSQEDFNKT--ELEKYKQLY---LEELKVR 1924
Cdd:pfam01576 644 ALEEALEA---------KEELERTNKQLRAEMEDLVSSKDDVGKNvhELERSKRALeqqVEEMKTQ 700
|
|
| PHA02878 |
PHA02878 |
ankyrin repeat protein; Provisional |
75-209 |
1.04e-10 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222939 [Multi-domain] Cd Length: 477 Bit Score: 66.44 E-value: 1.04e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 75 DRDKMNrTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYAV 154
Cdd:PHA02878 164 DRHKGN-TALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGNTPLHISV 242
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*..
gi 1034567240 155 -YNEDISVATKLLLYDANIEAKNK-DDLTPLLLAVsgKKQQMVEFLIKKKANVNAVD 209
Cdd:PHA02878 243 gYCKDYDILKLLLEHGVDVNAKSYiLGLTALHSSI--KSERKLKLLLEYGADINSLN 297
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1170-1849 |
1.06e-10 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 67.05 E-value: 1.06e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1170 IREQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDaqrqLSREQNARMLQDGILTNHLSKQKEiem 1249
Cdd:pfam05483 97 IEAELKQKENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQENKD----LIKENNATRHLCNLLKETCARSAE--- 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1250 AQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDtikNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTI 1329
Cdd:pfam05483 170 KTKKYEYEREETRQVYMDLNNNIEKMILAFEELRVQAE---NARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLL 246
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1330 SQYNGRlsvltaENAM--LNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMN 1407
Cdd:pfam05483 247 IQITEK------ENKMkdLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQ 320
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1408 F---DVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALrEKTLGLERVQKDLSQTQCQMKEME-QKYQNEQVKVNKY 1483
Cdd:pfam05483 321 IatkTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSL-EELLRTEQQRLEKNEDQLKIITMElQKKSSELEEMTKF 399
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1484 IGKQESVEERLSQLQSENMLLRQQLDDAHNKADN--------------KEKTVINIQDQFHAI----------VQKLQAE 1539
Cdd:pfam05483 400 KNNKEVELEELKKILAEDEKLLDEKKQFEKIAEElkgkeqelifllqaREKEIHDLEIQLTAIktseehylkeVEDLKTE 479
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1540 SEKQSLlleeRNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKK 1619
Cdd:pfam05483 480 LEKEKL----KNIELTAHCDKLLLENKELTQEASDMTLELKKHQEDIINCKKQEERMLKQIENLEEKEMNLRDELESVRE 555
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1620 KLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQmdkIEELQKNLlnANLSEDEKEQLKKLMELKQS 1699
Cdd:pfam05483 556 EFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLKKQ---IENKNKNI--EELHQENKALKKKGSAENKQ 630
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1700 LECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDIQIN-------KLKHKIDDLTAELE 1772
Cdd:pfam05483 631 LNAYEIKVNKLELELASAKQKFEEIIDNYQKEIEDKKISEEKLLEEVEKAKAIADEAVKlqkeidkRCQHKIAEMVALME 710
|
650 660 670 680 690 700 710
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034567240 1773 TagskclHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEIA 1849
Cdd:pfam05483 711 K------HKHQYDKIIEERDSELGLYKNKEQEQSSAKAALEIELSNIKAELLSLKKQLEIEKEEKEKLKMEAKENTA 781
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1070-1633 |
1.17e-10 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 67.07 E-value: 1.17e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1070 IEQLGMECKDSVSLLKIQDAALSCERLLELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKS----QLEHQKVEWERE 1145
Cdd:pfam15921 278 VEITGLTEKASSARSQANSIQSQLEIIQEQARNQNSMYMRQLSDLESTVSQLRSELREAKRMYEdkieELEKQLVLANSE 357
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1146 LCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYRKEVEVKQQL---------------------ELSLQTLEMELRT 1204
Cdd:pfam15921 358 LTEARTERDQFSQESGNLDDQLQKLLADLHKREKELSLEKEQNKRLwdrdtgnsitidhlrrelddrNMEVQRLEALLKA 437
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1205 VKSNlnqvVQERNDAQRQLSREQNARMLQDGILTNHLSKQKEI------EMAQKKMNSENShsheeEKDLSHKNSMLQE- 1277
Cdd:pfam15921 438 MKSE----CQGQMERQMAAIQGKNESLEKVSSLTAQLESTKEMlrkvveELTAKKMTLESS-----ERTVSDLTASLQEk 508
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1278 ---------EIAMLRLEIDT-------IKNQN---QEKEKKCfEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSV 1338
Cdd:pfam15921 509 eraieatnaEITKLRSRVDLklqelqhLKNEGdhlRNVQTEC-EALKLQMAEKDKVIEILRQQIENMTQLVGQHGRTAGA 587
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1339 LTAENAMLNSK-------------LENEKQSKER-LEAEV---ESYHSRLAAAIHDRDQS----ETSKRELELAFQRARD 1397
Cdd:pfam15921 588 MQVEKAQLEKEindrrlelqefkiLKDKKDAKIReLEARVsdlELEKVKLVNAGSERLRAvkdiKQERDQLLNEVKTSRN 667
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1398 ECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALR----------------EKTLGLERVQKDLSQ 1461
Cdd:pfam15921 668 ELNSLSEDYEVLKRNFRNKSEEMETTTNKLKMQLKSAQSELEQTRNTLKsmegsdghamkvamgmQKQITAKRGQIDALQ 747
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1462 TQCQMKEMEQKYQNEQV-----KVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQF---HAIV 1533
Cdd:pfam15921 748 SKIQFLEEAMTNANKEKhflkeEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKEKVANMEVALDKASLQFaecQDII 827
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1534 QKLQAESEKQSLLLEERNKEL----ISECNHLKERQYQYENEKAEREVVVRQlqQELADTLKKQSMSEASLEvtsryrin 1609
Cdd:pfam15921 828 QRQEQESVRLKLQHTLDVKELqgpgYTSNSSMKPRLLQPASFTRTHSNVPSS--QSTASFLSHHSRKTNALK-------- 897
|
650 660
....*....|....*....|....
gi 1034567240 1610 lEDETQDLKKKLGQIRNQLQEAQD 1633
Cdd:pfam15921 898 -EDPTRDLKQLLQELRSVINEEPT 920
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1175-1598 |
1.21e-10 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 67.02 E-value: 1.21e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1175 RRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSReqnarmlqdgiltnhlsKQKEIEMAQKKM 1254
Cdd:TIGR02169 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGE-----------------IEKEIEQLEQEE 732
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1255 NSENshshEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEkkcfEDLKIVKEKNEDLqktikqneetltqtisqyng 1334
Cdd:TIGR02169 733 EKLK----ERLEELEEDLSSLEQEIENVKSELKELEARIEELE----EDLHKLEEALNDL-------------------- 784
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1335 rlsvltaENAMLNSKLENEKQSKErleaEVESYHSRLAAAIHDRDQsETSKRELELAFqrARDECSRLQDKMNfdvsNLK 1414
Cdd:TIGR02169 785 -------EARLSHSRIPEIQAELS----KLEEEVSRIEARLREIEQ-KLNRLTLEKEY--LEKEIQELQEQRI----DLK 846
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1415 DNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERL 1494
Cdd:TIGR02169 847 EQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKL 926
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1495 SQLQSENmllrQQLDDAhNKADNKEKTVINIQDQFHAIVQKLQAESEKqsllLEERNKELISECNHLKERQYQYENEKAE 1574
Cdd:TIGR02169 927 EALEEEL----SEIEDP-KGEDEEIPEEELSLEDVQAELQRVEEEIRA----LEPVNMLAIQEYEEVLKRLDELKEKRAK 997
|
410 420
....*....|....*....|....*.
gi 1034567240 1575 REVVVRQLQQELA--DTLKKQSMSEA 1598
Cdd:TIGR02169 998 LEEERKAILERIEeyEKKKREVFMEA 1023
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1158-1726 |
1.65e-10 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 66.63 E-value: 1.65e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1158 EKRRNADTLYEKIREQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSR-EQNARMLQdgi 1236
Cdd:PRK03918 158 DDYENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKlEKEVKELE--- 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1237 ltnhlSKQKEIEMAQKKMNSENSHSHEEEKDLSHknsmLQEEIAMLRLEIDTIKNQNQE-----KEKKCFEDLKIVKEKN 1311
Cdd:PRK03918 235 -----ELKEEIEELEKELESLEGSKRKLEEKIRE----LEERIEELKKEIEELEEKVKElkelkEKAEEYIKLSEFYEEY 305
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1312 EDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLE----------------AEVESYHSRLAA-- 1373
Cdd:PRK03918 306 LDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEeleerhelyeeakakkEELERLKKRLTGlt 385
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1374 ---AIHDRDQSETSKRELELAFQRARDECSRLQDKmnfdVSNLKDNNEILSQQlfKTESKLNSLEIEFHHTRDALREKTL 1450
Cdd:PRK03918 386 pekLEKELEELEKAKEEIEEEISKITARIGELKKE----IKELKKAIEELKKA--KGKCPVCGRELTEEHRKELLEEYTA 459
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1451 GLERVQKDLSQTqcqmKEMEQKYQNEQVKVNKYIGKQ-------------ESVEERLSQLQSENM--------LLRQQLD 1509
Cdd:PRK03918 460 ELKRIEKELKEI----EEKERKLRKELRELEKVLKKEseliklkelaeqlKELEEKLKKYNLEELekkaeeyeKLKEKLI 535
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1510 DAHNKADNKEKTVINIQD---QFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYEnEKAEREVVVRQLQQEL 1586
Cdd:PRK03918 536 KLKGEIKSLKKELEKLEElkkKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELE-PFYNEYLELKDAEKEL 614
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1587 ADTLKKQSMSEASLEVTsryrinlEDETQDLKKKLGQIRNQLQEAQDRHTEAVRcaEKMQDHKQKLEKDNAKLKVTVKKQ 1666
Cdd:PRK03918 615 EREEKELKKLEEELDKA-------FEELAETEKRLEELRKELEELEKKYSEEEY--EELREEYLELSRELAGLRAELEEL 685
|
570 580 590 600 610 620
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1667 MDKIEELQKNLlnanlsEDEKEQLKKLMELKQSLEcNLDQEMKKNVELEREITGFKNLLK 1726
Cdd:PRK03918 686 EKRREEIKKTL------EKLKEELEEREKAKKELE-KLEKALERVEELREKVKKYKALLK 738
|
|
| PLN03192 |
PLN03192 |
Voltage-dependent potassium channel; Provisional |
66-280 |
1.85e-10 |
|
Voltage-dependent potassium channel; Provisional
Pssm-ID: 215625 [Multi-domain] Cd Length: 823 Bit Score: 66.43 E-value: 1.85e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 66 LLLRKNGLNDRDKMNRTALHLAcANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVH 145
Cdd:PLN03192 512 LLGDNGGEHDDPNMASNLLTVA-STGNAALLEELLKAKLDPDIGDSKGRTPLHIAASKGYEDCVLVLLKHACNVHIRDAN 590
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 146 GNTAL-------HYAVYN------------------------EDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQM 194
Cdd:PLN03192 591 GNTALwnaisakHHKIFRilyhfasisdphaagdllctaakrNDLTAMKELLKQGLNVDSEDHQGATALQVAMAEDHVDM 670
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 195 VEFLIKKKANVNAVDKLES-SHQLISEYKEERIPKHSSQNSNSVDESSEDSLSRLSGKPGVDDSwpTSDDEdlnfdtKNV 273
Cdd:PLN03192 671 VRLLIMNGADVDKANTDDDfSPTELRELLQKRELGHSITIVDSVPADEPDLGRDGGSRPGRLQG--TSSDN------QCR 742
|
....*..
gi 1034567240 274 PKPSLAK 280
Cdd:PLN03192 743 PRVSIYK 749
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1106-1726 |
2.13e-10 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 66.40 E-value: 2.13e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1106 LLTVKIKKMEDKVNVLQRELSET-KEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKE----EQ 1180
Cdd:pfam12128 262 HLHFGYKSDETLIASRQEERQETsAELNQLLRTLDDQWKEKRDELNGELSAADAAVAKDRSELEALEDQHGAFLdadiET 341
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1181 YRKEVEVKQQLELSLQTLEMELRTVKSNLNQVvQERNDAQRQLSREQNARMLQDgiLTNHLSKQKEIEMAQKKmnsensh 1260
Cdd:pfam12128 342 AAADQEQLPSWQSELENLEERLKALTGKHQDV-TAKYNRRRSKIKEQNNRDIAG--IKDKLAKIREARDRQLA------- 411
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1261 shEEEKDLSHKNSMLQEEIAMLRLEIdtikNQNQEKEKKCFEDLKIvkeknedLQKTIKQNEETLTQTisqyngrlsvlt 1340
Cdd:pfam12128 412 --VAEDDLQALESELREQLEAGKLEF----NEEEYRLKSRLGELKL-------RLNQATATPELLLQL------------ 466
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1341 aenAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDK-----------MNFD 1409
Cdd:pfam12128 467 ---ENFDERIERAREEQEAANAEVERLQSELRQARKRRDQASEALRQASRRLEERQSALDELELQlfpqagtllhfLRKE 543
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1410 VSNLKDN--NEILSQQLFKT-------------ESKLNSLEI--------EFHHTRDALREKtlgLERVQKDLSQTQCQM 1466
Cdd:pfam12128 544 APDWEQSigKVISPELLHRTdldpevwdgsvggELNLYGVKLdlkridvpEWAASEEELRER---LDKAEEALQSAREKQ 620
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1467 KEMEQkyqnEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAH-NKADNKEKTVINIQDQFHAIVQKLQAESE---- 1541
Cdd:pfam12128 621 AAAEE----QLVQANGELEKASREETFARTALKNARLDLRRLFDEKqSEKDKKNKALAERKDSANERLNSLEAQLKqldk 696
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1542 KQSLLLEERNKELISecnHLKERQYQYENEKAEREVVVRQLQQELA----------DTLKKQSMSE-ASLEVTSRYRINL 1610
Cdd:pfam12128 697 KHQAWLEEQKEQKRE---ARTEKQAYWQVVEGALDAQLALLKAAIAarrsgakaelKALETWYKRDlASLGVDPDVIAKL 773
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1611 EDETQDLKKKLGQIrnqlqeAQDRHteAVRCAEKMQDHKQKLEKDNakLKVTVKKQMDKIEELQKNLlnANLSEDEKEQL 1690
Cdd:pfam12128 774 KREIRTLERKIERI------AVRRQ--EVLRYFDWYQETWLQRRPR--LATQLSNIERAISELQQQL--ARLIADTKLRR 841
|
650 660 670
....*....|....*....|....*....|....*.
gi 1034567240 1691 KKLMELKQSLEcnldqemKKNVELEREITGFKNLLK 1726
Cdd:pfam12128 842 AKLEMERKASE-------KQQVRLSENLRGLRCEMS 870
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1265-1955 |
4.91e-10 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 64.74 E-value: 4.91e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1265 EKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEkNEDLqktIKQNEETltqtiSQYNGRLSVLTAENA 1344
Cdd:pfam05483 98 EAELKQKENKLQENRKIIEAQRKAIQELQFENEKVSLKLEEEIQE-NKDL---IKENNAT-----RHLCNLLKETCARSA 168
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1345 MLNSKLENEKQSKERLEAEVESYHSRLAAAIHD-RDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQ 1423
Cdd:pfam05483 169 EKTKKYEYEREETRQVYMDLNNNIEKMILAFEElRVQAENARLEMHFKLKEDHEKIQHLEEEYKKEINDKEKQVSLLLIQ 248
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1424 LFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQlqsenml 1503
Cdd:pfam05483 249 ITEKENKMKDLTFLLEESRDKANQLEEKTKLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQI------- 321
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1504 lrqqlddahnkadnKEKTVINIQDQFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQ 1583
Cdd:pfam05483 322 --------------ATKTICQLTEEKEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMELQ 387
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1584 QEladtlkkqsmsEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQdrhtEAVRCAEKMQDHKQKL---------EK 1654
Cdd:pfam05483 388 KK-----------SSELEEMTKFKNNKEVELEELKKILAEDEKLLDEKK----QFEKIAEELKGKEQELifllqarekEI 452
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1655 DNAKLKVTVKKQMDK-----IEELQKNLLNANLSEDEKEQLKKLMELKQSlecNLDQEMKKNV-ELEREITGFKNLLKMT 1728
Cdd:pfam05483 453 HDLEIQLTAIKTSEEhylkeVEDLKTELEKEKLKNIELTAHCDKLLLENK---ELTQEASDMTlELKKHQEDIINCKKQE 529
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1729 RKKLNEYEN---GEFSFHGDLKTSQFEMDIQINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEELLSMKtvqKKCEKL 1805
Cdd:pfam05483 530 ERMLKQIENleeKEMNLRDELESVREEFIQKGDEVKCKLDKSEENARSIEYEVLKKEKQMKILENKCNNLK---KQIENK 606
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1806 QKNKKKLEQE--VINLRSHIERNMVELGQVKQYKQEIE-ERARQEIAEKLKEVNLFLQAQAASQENL----EQFRENNFA 1878
Cdd:pfam05483 607 NKNIEELHQEnkALKKKGSAENKQLNAYEIKVNKLELElASAKQKFEEIIDNYQKEIEDKKISEEKLleevEKAKAIADE 686
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1879 SMKSQMEL------RIKDLESELSKIKTSQEDF---NKTELEKYKQLYLEELKVRKSLSSKLTKTNERLAEVNTKLLVEK 1949
Cdd:pfam05483 687 AVKLQKEIdkrcqhKIAEMVALMEKHKHQYDKIieeRDSELGLYKNKEQEQSSAKAALEIELSNIKAELLSLKKQLEIEK 766
|
....*.
gi 1034567240 1950 QQSRSL 1955
Cdd:pfam05483 767 EEKEKL 772
|
|
| SMC_N |
pfam02463 |
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The ... |
885-1755 |
6.90e-10 |
|
RecF/RecN/SMC N terminal domain; This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Pssm-ID: 426784 [Multi-domain] Cd Length: 1161 Bit Score: 64.61 E-value: 6.90e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 885 MLKVEFQALEKEKVQLQKEVEEERKKHRNNEMEVSANIHDGATDDAEDDDDDDGLIQKRKSgETDHQQFPRKENKEYASS 964
Cdd:pfam02463 202 LKEQAKKALEYYQLKEKLELEEEYLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEIEK-EEEKLAQVLKENKEEEKE 280
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 965 GPALQMKEVKSTEKEKRTSKEsvnspvfgkasLLTGGLLQVDDDSSLSEIDEDEGRPTKKTSNEKNKVKNQIQSMDDVDD 1044
Cdd:pfam02463 281 KKLQEEELKLLAKEEEELKSE-----------LLKLERRKVDDEEKLKESEKEKKKAEKELKKEKEEIEELEKELKELEI 349
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1045 LTQSSETASEDCELphssyknfmLLIEQLGMECKDSVSLLKIQDAALSCERLLELKKnhcELLTVKIKKMEDKVNVLQRE 1124
Cdd:pfam02463 350 KREAEEEEEEELEK---------LQEKLEQLEEELLAKKKLESERLSSAAKLKEEEL---ELKSEEEKEAQLLLELARQL 417
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1125 LSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYRKEVEVKQQLELSLQTLEMELRT 1204
Cdd:pfam02463 418 EDLLKEEKKEELEILEEEEESIELKQGKLTEEKEELEKQELKLLKDELELKKSEDLLKETQLVKLQEQLELLLSRQKLEE 497
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1205 VKSNLnQVVQERNDAQRQLSREQNARMLQDGILTNHLSKQKEIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRL 1284
Cdd:pfam02463 498 RSQKE-SKARSGLKVLLALIKDGVGGRIISAHGRLGDLGVAVENYKVAISTAVIVEVSATADEVEERQKLVRALTELPLG 576
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1285 EIDTIknqnQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTqtisqyngrlsvltaenAMLNSKLENEKQSKERLEAEV 1364
Cdd:pfam02463 577 ARKLR----LLIPKLKLPLKSIAVLEIDPILNLAQLDKATLE-----------------ADEDDKRAKVVEGILKDTELT 635
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1365 ESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKmnfdvsNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDA 1444
Cdd:pfam02463 636 KLKESAKAKESGLRKGVSLEEGLAEKSEVKASLSELTKELL------EIQELQEKAESELAKEEILRRQLEIKKKEQREK 709
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1445 LREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGK-QESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVI 1523
Cdd:pfam02463 710 EELKKLKLEAEELLADRVQEAQDKINEELKLLKQKIDEEEEEeEKSRLKKEEKEEEKSELSLKEKELAEEREKTEKLKVE 789
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1524 NIQDQFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEA----- 1598
Cdd:pfam02463 790 EEKEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKLAEEELERLEEEITkeell 869
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1599 SLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAV-------RCAEKMQDHKQKLEKDNAKLK------VTVKK 1665
Cdd:pfam02463 870 QELLLKEEELEEQKLKDELESKEEKEKEEKKELEEESQKLNlleekenEIEERIKEEAEILLKYEEEPEellleeADEKE 949
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1666 QMDKIEELQKNLLNANLSEDEKEQLKKLMELKQSLECN---LDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSF 1742
Cdd:pfam02463 950 KEENNKEEEEERNKRLLLAKEELGKVNLMAIEEFEEKEeryNKDELEKERLEEEKKKLIRAIIEETCQRLKEFLELFVSI 1029
|
890
....*....|...
gi 1034567240 1743 HGDLKTSQFEMDI 1755
Cdd:pfam02463 1030 NKGWNKVFFYLEL 1042
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
39-214 |
7.96e-10 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 63.93 E-value: 7.96e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 39 GYHVRDRDLGK---IHKAASAGNVAKVQQILLLRKNGLNDRDKMNRTALHLACANGH-PEVVTLLVDRKCQLNVCDNENR 114
Cdd:PHA02876 263 GFSVNSIDDCKntpLHHASQAPSLSRLVPKLLERGADVNAKNIKGETPLYLMAKNGYdTENIRTLIMLGADVNAADRLYI 342
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 115 TALMKAVQCQEEKCATI-LLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQ 193
Cdd:PHA02876 343 TPLHQASTLDRNKDIVItLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEALSQKIGTALHFALCGTNPY 422
|
170 180
....*....|....*....|..
gi 1034567240 194 M-VEFLIKKKANVNAVDKLESS 214
Cdd:PHA02876 423 MsVKTLIDRGANVNSKNKDLST 444
|
|
| PHA02878 |
PHA02878 |
ankyrin repeat protein; Provisional |
125-210 |
1.15e-09 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222939 [Multi-domain] Cd Length: 477 Bit Score: 62.98 E-value: 1.15e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 125 EEKCATILLEHGADPNLADVH-GNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKA 203
Cdd:PHA02878 146 EAEITKLLLSYGADINMKDRHkGNTALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGA 225
|
....*..
gi 1034567240 204 NVNAVDK 210
Cdd:PHA02878 226 STDARDK 232
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
55-179 |
1.88e-09 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 62.35 E-value: 1.88e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 55 SAGNVAKVqqILLLRKNG--LNDRDKMNRTALHL-ACANGHPEVVTLLVDRKCQLNVCDNENRTALMK--AVQCQEEKCA 129
Cdd:PHA03095 58 SSEKVKDI--VRLLLEAGadVNAPERCGFTPLHLyLYNATTLDVIKLLIKAGADVNAKDKVGRTPLHVylSGFNINPKVI 135
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 1034567240 130 TILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDD 179
Cdd:PHA03095 136 RLLLRKGADVNALDLYGMTPLAVLLKSRNANVELLRLLIDAGADVYAVDD 185
|
|
| PHA03100 |
PHA03100 |
ankyrin repeat protein; Provisional |
50-214 |
3.02e-09 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222984 [Multi-domain] Cd Length: 422 Bit Score: 61.60 E-value: 3.02e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNVAKVQQILllrKNG--LNDRDKMNRTALHLACANGH-----PEVVTLLVDRKCQLNVCDNENRTALMKAVQ 122
Cdd:PHA03100 39 LYLAKEARNIDVVKILL---DNGadINSSTKNNSTPLHYLSNIKYnltdvKEIVKLLLEYGANVNAPDNNGITPLLYAIS 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 123 CQEEKCATI--LLEHGADPNLADVHGNTALHYAV---------------YNEDISVATK---LLLYDANIEAKNKDDLTP 182
Cdd:PHA03100 116 KKSNSYSIVeyLLDNGANVNIKNSDGENLLHLYLesnkidlkilkllidKGVDINAKNRvnyLLSYGVPINIKDVYGFTP 195
|
170 180 190
....*....|....*....|....*....|..
gi 1034567240 183 LLLAVSGKKQQMVEFLIKKKANVNAVDKLESS 214
Cdd:PHA03100 196 LHYAVYNNNPEFVKYLLDLGANPNLVNKYGDT 227
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
50-211 |
3.33e-09 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 62.00 E-value: 3.33e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNvAKVQQILLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNvcdnENRTALMKAVQCQEEKCA 129
Cdd:PHA02876 182 IHYAAERGN-AKMVNLLLSYGADVNIIALDDLSVLECAVDSKNIDTIKAIIDNRSNIN----KNDLSLLKAIRNEDLETS 256
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 130 TILLEHGADPNLADVHGNTALHYAVYNEDIS-VATKLLLYDANIEAKNKDDLTPL-LLAVSGKKQQMVEFLIKKKANVNA 207
Cdd:PHA02876 257 LLLYDAGFSVNSIDDCKNTPLHHASQAPSLSrLVPKLLERGADVNAKNIKGETPLyLMAKNGYDTENIRTLIMLGADVNA 336
|
....
gi 1034567240 208 VDKL 211
Cdd:PHA02876 337 ADRL 340
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1385-1941 |
3.40e-09 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 62.39 E-value: 3.40e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1385 KRELELAFQRARDECSRLQDkmnfdVSNLKDNNEilsQQLFKTESKLNSLEIEfhhtRDALREKTLGLERVQKDLsqtqc 1464
Cdd:PRK03918 171 IKEIKRRIERLEKFIKRTEN-----IEELIKEKE---KELEEVLREINEISSE----LPELREELEKLEKEVKEL----- 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1465 qmKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKAdnKEKTVINIQDQFHAIVQKLQAESEKQS 1544
Cdd:PRK03918 234 --EELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV--KELKELKEKAEEYIKLSEFYEEYLDEL 309
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1545 LLLEERNKELISECNHLKERQYQYENEKAErevvVRQLQQELADTLKKQSMSEASLEVTSRYRiNLEDETQDLKKKLGQi 1624
Cdd:PRK03918 310 REIEKRLSRLEEEINGIEERIKELEEKEER----LEELKKKLKELEKRLEELEERHELYEEAK-AKKEELERLKKRLTG- 383
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1625 rNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKN-----LLNANLSEDEKEQL--------K 1691
Cdd:PRK03918 384 -LTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAkgkcpVCGRELTEEHRKELleeytaelK 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1692 KLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMtRKKLNEYENGEFSFHGdlktsqfemdIQINKLKHKiddlTAEL 1771
Cdd:PRK03918 463 RIEKELKEIEEKERKLRKELRELEKVLKKESELIKL-KELAEQLKELEEKLKK----------YNLEELEKK----AEEY 527
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1772 ETAGSKCLHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVE-LGQVKQYKQEIEERARQEIae 1850
Cdd:PRK03918 528 EKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFEsVEELEERLKELEPFYNEYL-- 605
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1851 KLKEVNLFLQAQAASQENLEQFRENNFAsmksQMELRIKDLESELSKIKTSQEDFNKTELEKYKQLYLEELKVRKSLSSK 1930
Cdd:PRK03918 606 ELKDAEKELEREEKELKKLEEELDKAFE----ELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAE 681
|
570
....*....|.
gi 1034567240 1931 LTKTNERLAEV 1941
Cdd:PRK03918 682 LEELEKRREEI 692
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
50-209 |
5.16e-09 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 61.19 E-value: 5.16e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNVAKVQQiLLLRKNG-LNDRDKMNRTALH--LACANGHPEVVTLLVDRKCQLNVCDNENRT---ALMKAVQC 123
Cdd:PHA03095 87 LHLYLYNATTLDVIK-LLIKAGAdVNAKDKVGRTPLHvyLSGFNINPKVIRLLLRKGADVNALDLYGMTplaVLLKSRNA 165
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 124 QEE----------------------------------KCATILLEHGADPNLADVHGNTALHYAVYNEDI--SVATKLLL 167
Cdd:PHA03095 166 NVEllrllidagadvyavddrfrsllhhhlqsfkpraRIVRELIRAGCDPAATDMLGNTPLHSMATGSSCkrSLVLPLLI 245
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1034567240 168 YDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKANVNAVD 209
Cdd:PHA03095 246 AGISINARNRYGQTPLHYAAVFNNPRACRRLIALGADINAVS 287
|
|
| PTZ00322 |
PTZ00322 |
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional |
129-282 |
1.13e-08 |
|
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
Pssm-ID: 140343 [Multi-domain] Cd Length: 664 Bit Score: 60.30 E-value: 1.13e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 129 ATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVeflikkkanvnav 208
Cdd:PTZ00322 98 ARILLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVV------------- 164
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 209 dklesshQLISEYKEEripkHSSQNSNSVDESsedslsrLSGKPGVDDSWPTSDDEDlnfDTKNVPKPSLAKLM 282
Cdd:PTZ00322 165 -------QLLSRHSQC----HFELGANAKPDS-------FTGKPPSLEDSPISSHHP---DFSAVPQPMMGSLI 217
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1088-1638 |
1.38e-08 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 60.05 E-value: 1.38e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1088 DAALSCERLLELKKNHCELLTVKIKKMEDK-----VNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRN 1162
Cdd:PRK02224 173 DARLGVERVLSDQRGSLDQLKAQIEEKEEKdlherLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREE 252
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1163 ADTLYEKIRE------QLRRKEEQYRKEVEVKQQLELSLQTLEMELRTvKSNLNQVVQERNDAQRQ-LSREQNArmLQDG 1235
Cdd:PRK02224 253 LETLEAEIEDlretiaETEREREELAEEVRDLRERLEELEEERDDLLA-EAGLDDADAEAVEARREeLEDRDEE--LRDR 329
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1236 ILtnhlskqkEIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDTIKnqnqekekkcfEDLKIVKEKNEDLQ 1315
Cdd:PRK02224 330 LE--------ECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAR-----------EAVEDRREEIEELE 390
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1316 KTIKQNEETLTQTISQYNGrlsvLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAA---------------IHDR-- 1378
Cdd:PRK02224 391 EEIEELRERFGDAPVDLGN----AEDFLEELREERDELREREAELEATLRTARERVEEAealleagkcpecgqpVEGSph 466
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1379 ----DQSETSKRELELAFQRARDECSRLQDKMNfdvsnlkdnneiLSQQLFKTESKLNSLE---------IEFHHTRDAL 1445
Cdd:PRK02224 467 vetiEEDRERVEELEAELEDLEEEVEEVEERLE------------RAEDLVEAEDRIERLEerredleelIAERRETIEE 534
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1446 REKTLGLERVQKDLSQTQCQMK-----EMEQKYQNEQVKVNKYIGKQESVEERLSQLQSenmlLRQQLDDAHNKADN--- 1517
Cdd:PRK02224 535 KRERAEELRERAAELEAEAEEKreaaaEAEEEAEEAREEVAELNSKLAELKERIESLER----IRTLLAAIADAEDEier 610
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1518 ---KEKTVINIQDQFHaivQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAERevvVRQLQQELADTLKKQS 1594
Cdd:PRK02224 611 lreKREALAELNDERR---ERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEK---LDELREERDDLQAEIG 684
|
570 580 590 600
....*....|....*....|....*....|....*....|....
gi 1034567240 1595 MSEASLEvtsryrinledETQDLKKKLGQIRNQLQEAQDRHTEA 1638
Cdd:PRK02224 685 AVENELE-----------ELEELRERREALENRVEALEALYDEA 717
|
|
| PHA02874 |
PHA02874 |
ankyrin repeat protein; Provisional |
73-218 |
4.45e-08 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165205 [Multi-domain] Cd Length: 434 Bit Score: 58.05 E-value: 4.45e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 73 LNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHY 152
Cdd:PHA02874 117 VNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHN 196
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034567240 153 AVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSgKKQQMVEFLIKKKA-NVNAVDKLESSHQLI 218
Cdd:PHA02874 197 AAEYGDYACIKLLIDHGNHIMNKCKNGFTPLHNAII-HNRSAIELLINNASiNDQDIDGSTPLHHAI 262
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
1125-1961 |
5.21e-08 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 58.52 E-value: 5.21e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1125 LSETKEIKSQLEH-----QKVEWERELCSLRFSLNQEEEKRRNADTLYEK-------IREQLRRKEEQYRKEVEVKQQLE 1192
Cdd:TIGR00606 165 LSEGKALKQKFDEifsatRYIKALETLRQVRQTQGQKVQEHQMELKYLKQykekaceIRDQITSKEAQLESSREIVKSYE 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1193 LSLQTLEMELRTVKSNLNQVVQERND----AQRQLSREQNARMLQ--------------DGILTNHLSKQKEIEMAQKKM 1254
Cdd:TIGR00606 245 NELDPLKNRLKEIEHNLSKIMKLDNEikalKSRKKQMEKDNSELElkmekvfqgtdeqlNDLYHNHQRTVREKERELVDC 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1255 NSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEK----------------EKKCFEDLKI------VKEKNE 1312
Cdd:TIGR00606 325 QRELEKLNKERRLLNQEKTELLVEQGRLQLQADRHQEHIRARdsliqslatrleldgfERGPFSERQIknfhtlVIERQE 404
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1313 DLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEkqsKERLEAEVEsyhsRLAAAIHDRDQSETSKR---ELE 1389
Cdd:TIGR00606 405 DEAKTAAQLCADLQSKERLKQEQADEIRDEKKGLGRTIELK---KEILEKKQE----ELKFVIKELQQLEGSSDrilELD 477
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1390 LAFQRARDECSRLQDKMNfdvsnlkdnneilSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEM 1469
Cdd:TIGR00606 478 QELRKAERELSKAEKNSL-------------TETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDK 544
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1470 EQKYQneQVKVNKYIGKQESVEerlsqlQSENMLLRQQLDDAHNKADNKektvINIQDQFHAIVQKLQAESEKQSLLLEE 1549
Cdd:TIGR00606 545 MDKDE--QIRKIKSRHSDELTS------LLGYFPNKKQLEDWLHSKSKE----INQTRDRLAKLNKELASLEQNKNHINN 612
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1550 RNKELISECNHLKERQYQYENEKAErEVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQD---LKKKLGQIRN 1626
Cdd:TIGR00606 613 ELESKEEQLSSYEDKLFDVCGSQDE-ESDLERLKEEIEKSSKQRAMLAGATAVYSQFITQLTDENQSccpVCQRVFQTEA 691
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1627 QLQEAQDRHTEAVRCA----EKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLlnANLSEDEKEQLKKLMELKQSLEC 1702
Cdd:TIGR00606 692 ELQEFISDLQSKLRLApdklKSTESELKKKEKRRDEMLGLAPGRQSIIDLKEKEI--PELRNKLQKVNRDIQRLKNDIEE 769
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1703 NLDQEMKKNVELERE---------ITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQF-----EMDIQINKLKHKIDDLT 1768
Cdd:TIGR00606 770 QETLLGTIMPEEESAkvcltdvtiMERFQMELKDVERKIAQQAAKLQGSDLDRTVQQVnqekqEKQHELDTVVSKIELNR 849
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1769 AELETAGSKCLHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEI 1848
Cdd:TIGR00606 850 KLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELI 929
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1849 AEK---LKEVNLFLQAQAASQENLEQFRENNFASMKSQMELRIKDLESELSKIKTSQEdfnktELEKYKQLYLEELKVRK 1925
Cdd:TIGR00606 930 SSKetsNKKAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDDYLKQKETELNTVNAQLE-----ECEKHQEKINEDMRLMR 1004
|
890 900 910
....*....|....*....|....*....|....*.
gi 1034567240 1926 SlSSKLTKTNERLAEVNTKLLVEKQQSRSLFTTLTT 1961
Cdd:TIGR00606 1005 Q-DIDTQKIQERWLQDNLTLRKRENELKEVEEELKQ 1039
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1104-1717 |
8.39e-08 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 57.81 E-value: 8.39e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1104 CELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFslnQEEEKRRNADTLYEKIREQLRRKEEQYRK 1183
Cdd:pfam05483 157 CNLLKETCARSAEKTKKYEYEREETRQVYMDLNNNIEKMILAFEELRV---QAENARLEMHFKLKEDHEKIQHLEEEYKK 233
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1184 EVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQNarmLQDGILTNHLSKQKEIEMAQKKMNSENSHSHE 1263
Cdd:pfam05483 234 EINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKTK---LQDENLKELIEKKDHLTKELEDIKMSLQRSMS 310
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1264 EEKDLSHKNSMLQEEIAMLRLEidtiKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAEN 1343
Cdd:pfam05483 311 TQKALEEDLQIATKTICQLTEE----KEAQMEELNKAKAAHSFVVTEFEATTCSLEELLRTEQQRLEKNEDQLKIITMEL 386
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1344 AMLNSKLENEKQSKERLEAEVESYHSRLA---AAIHDRDQSET-------SKRELELAFQRARDECSRLQdkmnFDVSNL 1413
Cdd:pfam05483 387 QKKSSELEEMTKFKNNKEVELEELKKILAedeKLLDEKKQFEKiaeelkgKEQELIFLLQAREKEIHDLE----IQLTAI 462
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1414 KDNNEILSQQL--FKTE---SKLNSLEIEFHHTRDALREK---------TLGLERVQKDLSQTQCQMKEMEQKYQNEQVK 1479
Cdd:pfam05483 463 KTSEEHYLKEVedLKTElekEKLKNIELTAHCDKLLLENKeltqeasdmTLELKKHQEDIINCKKQEERMLKQIENLEEK 542
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1480 VNKYIGKQESVEERLSQLQSEnmlLRQQLDDAHNKADNKEKTVINIQDQFHAIVQK---LQAESEKQSLLLEernkELIS 1556
Cdd:pfam05483 543 EMNLRDELESVREEFIQKGDE---VKCKLDKSEENARSIEYEVLKKEKQMKILENKcnnLKKQIENKNKNIE----ELHQ 615
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1557 ECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSmseaslEVTSRYRINLEDETQDLKKKLGQIrnqlQEAQDRHT 1636
Cdd:pfam05483 616 ENKALKKKGSAENKQLNAYEIKVNKLELELASAKQKFE------EIIDNYQKEIEDKKISEEKLLEEV----EKAKAIAD 685
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1637 EAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKI-EELQKNLLNANLSEDEKEQLKKLMELKQSLECNLDQEMKKNVELE 1715
Cdd:pfam05483 686 EAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIiEERDSELGLYKNKEQEQSSAKAALEIELSNIKAELLSLKKQLEIE 765
|
..
gi 1034567240 1716 RE 1717
Cdd:pfam05483 766 KE 767
|
|
| Ank_2 |
pfam12796 |
Ankyrin repeats (3 copies); |
150-210 |
8.48e-08 |
|
Ankyrin repeats (3 copies);
Pssm-ID: 463710 [Multi-domain] Cd Length: 91 Bit Score: 51.66 E-value: 8.48e-08
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034567240 150 LHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIkKKANVNAVDK 210
Cdd:pfam12796 1 LHLAAKNGNLELVKLLLENGADANLQDKNGRTALHLAAKNGHLEIVKLLL-EHADVNLKDN 60
|
|
| HCR |
pfam07111 |
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ... |
1172-1716 |
8.76e-08 |
|
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Pssm-ID: 284517 [Multi-domain] Cd Length: 749 Bit Score: 57.45 E-value: 8.76e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1172 EQLRRKEEQYR--KEVEVKQQLELSLQTLEME---------------LRT-------VKSNLNQVVQERNDAQRQLSREQ 1227
Cdd:pfam07111 73 QELRRLEEEVRllRETSLQQKMRLEAQAMELDalavaekagqaeaegLRAalagaemVRKNLEEGSQRELEEIQRLHQEQ 152
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1228 NARMLQ--DGILTNHLSKQKEIEmaqKKMNSENSHSHEEEKDLshknSMLQEEIAMLRLEIDTIKnQNQEKEKKCFEDLK 1305
Cdd:pfam07111 153 LSSLTQahEEALSSLTSKAEGLE---KSLNSLETKRAGEAKQL----AEAQKEAELLRKQLSKTQ-EELEAQVTLVESLR 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1306 -----------------IVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAE----- 1363
Cdd:pfam07111 225 kyvgeqvppevhsqtweLERQELLDTMQHLQEDRADLQATVELLQVRVQSLTHMLALQEEELTRKIQPSDSLEPEfpkkc 304
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1364 ---VESYHSRLAAAIHDRDQSETSKRElelAFQRARDECSRLQDKmnfdVSNLKDNNEILSQQLFKTESKL-------NS 1433
Cdd:pfam07111 305 rslLNRWREKVFALMVQLKAQDLEHRD---SVKQLRGQVAELQEQ----VTSQSQEQAILQRALQDKAAEVevermsaKG 377
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1434 LEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQlqsenmllrqqlddahn 1513
Cdd:pfam07111 378 LQMELSRAQEARRRQQQQTASAEEQLKFVVNAMSSTQIWLETTMTRVEQAVARIPSLSNRLSY----------------- 440
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1514 kADNKEKTVINIQDQFHAIVQkLQAESEKQSLLLEERNKELISECNHLKERQYQYEnekAEREVVVRQLQQELAdtlKKQ 1593
Cdd:pfam07111 441 -AVRKVHTIKGLMARKVALAQ-LRQESCPPPPPAPPVDADLSLELEQLREERNRLD---AELQLSAHLIQQEVG---RAR 512
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1594 SMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCA--------EKMQDHKQKLEKDNAKLKVTVKK 1665
Cdd:pfam07111 513 EQGEAERQQLSEVAQQLEQELQRAQESLASVGQQLEVARQGQQESTEEAaslrqeltQQQEIYGQALQEKVAEVETRLRE 592
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|.
gi 1034567240 1666 QMDKIEElqknllnaNLSEDEKEQLKKLMELKQsLECNLDQEMKKNVELER 1716
Cdd:pfam07111 593 QLSDTKR--------RLNEARREQAKAVVSLRQ-IQHRATQEKERNQELRR 634
|
|
| Smc |
COG1196 |
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ... |
1619-1959 |
2.28e-07 |
|
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain] Cd Length: 983 Bit Score: 56.48 E-value: 2.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1619 KKLGQIRNQLQEAQDRHTEavrcaekMQDHKQKLEKDnAKLKVTVKKQMDKIEELQKNLLNANLsEDEKEQLKKLMELKQ 1698
Cdd:COG1196 179 RKLEATEENLERLEDILGE-------LERQLEPLERQ-AEKAERYRELKEELKELEAELLLLKL-RELEAELEELEAELE 249
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1699 SLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSfhgdLKTSQFEMDIQINKLKHKIDDLTAELETAGSKC 1778
Cdd:COG1196 250 ELEAELEELEAELAELEAELEELRLELEELELELEEAQAEEYE----LLAELARLEQDIARLEERRRELEERLEELEEEL 325
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1779 LHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEIAEKLKEVNLF 1858
Cdd:COG1196 326 AELEEELEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEEL 405
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1859 LQAQAASQENLEQFRENnfasmKSQMELRIKDLESELSKIKTSQEDfnktELEKYKQLYLEELKVRKSLSSKLTKTNERL 1938
Cdd:COG1196 406 EEAEEALLERLERLEEE-----LEELEEALAELEEEEEEEEEALEE----AAEEEAELEEEEEALLELLAELLEEAALLE 476
|
330 340
....*....|....*....|.
gi 1034567240 1939 AEVNtKLLVEKQQSRSLFTTL 1959
Cdd:COG1196 477 AALA-ELLEELAEAAARLLLL 496
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
113-166 |
3.05e-07 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 48.81 E-value: 3.05e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 113 NRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLL 166
Cdd:pfam13637 1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1172-1656 |
4.60e-07 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 55.16 E-value: 4.60e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1172 EQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQNARMLQDgiLTNHL-SKQKEIEMA 1250
Cdd:COG4717 74 KELEEELKEAEEKEEEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELEA--LEAELaELPERLEEL 151
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1251 QKKMnsenshshEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLkivKEKNEDLQKTIKQNEETLTQTIS 1330
Cdd:COG4717 152 EERL--------EELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDL---AEELEELQQRLAELEEELEEAQE 220
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1331 QYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESyhSRLAAAIHDRDQSETSKRELELAFQRA--------RDECSRL 1402
Cdd:COG4717 221 ELEELEEELEQLENELEAAALEERLKEARLLLLIAA--ALLALLGLGGSLLSLILTIAGVLFLVLgllallflLLAREKA 298
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1403 QDKMNFDVSNLKDNNEILSQQlfKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNK 1482
Cdd:COG4717 299 SLGKEAEELQALPALEELEEE--ELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEELQLEELEQEIAALL 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1483 YIGKQESVEErlsqlqsenmlLRQQLdDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLlleernkelisecNHLK 1562
Cdd:COG4717 377 AEAGVEDEEE-----------LRAAL-EQAEEYQELKEELEELEEQLEELLGELEELLEALDE-------------EELE 431
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1563 ERQYQYENEKAEREVVVRQLQQELADTlkkqsmsEASLEVTSRyrinlEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCA 1642
Cdd:COG4717 432 EELEELEEELEELEEELEELREELAEL-------EAELEQLEE-----DGELAELLQELEELKAELRELAEEWAALKLAL 499
|
490
....*....|....
gi 1034567240 1643 EKMQDHKQKLEKDN 1656
Cdd:COG4717 500 ELLEEAREEYREER 513
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1110-1701 |
6.31e-07 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 54.64 E-value: 6.31e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1110 KIKKMEDKVNVLQRELSETKEIKSQLehqkvewERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYRKEVEVKQ 1189
Cdd:TIGR04523 97 KINKLNSDLSKINSEIKNDKEQKNKL-------EVELNKLEKQKKENKKNIDKFLTEIKKKEKELEKLNNKYNDLKKQKE 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1190 QLELSLQTLEMELRTVKSNLNQVVQERNdaqrqlsreqnarmlqdgILTNHLSKQKEIEMAQKKMNSENSHSHEEEKDLS 1269
Cdd:TIGR04523 170 ELENELNLLEKEKLNIQKNIDKIKNKLL------------------KLELLLSNLKKKIQKNKSLESQISELKKQNNQLK 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1270 HKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEEtLTQTISQYNGRLSVLTAE-----NA 1344
Cdd:TIGR04523 232 DNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSEKQKELEQNNKKIKE-LEKQLNQLKSEISDLNNQkeqdwNK 310
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1345 MLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKmnfdVSNLKDNNEILSQQL 1424
Cdd:TIGR04523 311 ELKSELKNQEKKLEEIQNQISQNNKIISQLNEQISQLKKELTNSESENSEKQRELEEKQNE----IEKLKKENQSYKQEI 386
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1425 FKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLL 1504
Cdd:TIGR04523 387 KNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKELLEKEIERLKETIIKNNSEIKDLTNQDSVKELIIKNLDNTRESL 466
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1505 RQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSlLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQ 1584
Cdd:TIGR04523 467 ETQLKVLSRSINKIKQNLEQKQKELKSKEKELKKLNEEKK-ELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLED 545
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1585 ELA---DTLKKQsmseaslevtsryriNLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKV 1661
Cdd:TIGR04523 546 ELNkddFELKKE---------------NLEKEIDEKNKEIEELKQTQKSLKKKQEEKQELIDQKEKEKKDLIKEIEEKEK 610
|
570 580 590 600
....*....|....*....|....*....|....*....|
gi 1034567240 1662 TVKKQMDKIEELQKNllNANLSEDEKEQLKKLMELKQSLE 1701
Cdd:TIGR04523 611 KISSLEKELEKAKKE--NEKLSSIIKNIKSKKNKLKQEVK 648
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1465-1950 |
6.98e-07 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 54.76 E-value: 6.98e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1465 QMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAeSEKQS 1544
Cdd:PTZ00121 1288 EKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEA-AEEKA 1366
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1545 LLLEERNKELISECNHLKERQYqyENEKAEREVVVRQLQQELADTLKKQSMSEASLEvTSRYRINLEDETQDLKKKLGQI 1624
Cdd:PTZ00121 1367 EAAEKKKEEAKKKADAAKKKAE--EKKKADEAKKKAEEDKKKADELKKAAAAKKKAD-EAKKKAEEKKKADEAKKKAEEA 1443
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1625 RnQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMD----KIEELQKNLLNANLSEDEK---EQLKKLMELK 1697
Cdd:PTZ00121 1444 K-KADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADeakkKAEEAKKKADEAKKAAEAKkkaDEAKKAEEAK 1522
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1698 QSLECNLDQEMKKNVELER--------EITGFKNLLKM-TRKKLNEYENGEFSFHGDLKTSQFEMDIQiNKLKHKIDDLT 1768
Cdd:PTZ00121 1523 KADEAKKAEEAKKADEAKKaeekkkadELKKAEELKKAeEKKKAEEAKKAEEDKNMALRKAEEAKKAE-EARIEEVMKLY 1601
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1769 AELETAGSKCLHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQE----IEERA 1844
Cdd:PTZ00121 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEdkkkAEEAK 1681
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1845 RQEIAEKLKEVNLFLQAQAASQenLEQFRENNFASMKSQMELRikdLESELSKIKTSQEDfNKTELEKYKQlylEELKVR 1924
Cdd:PTZ00121 1682 KAEEDEKKAAEALKKEAEEAKK--AEELKKKEAEEKKKAEELK---KAEEENKIKAEEAK-KEAEEDKKKA---EEAKKD 1752
|
490 500
....*....|....*....|....*.
gi 1034567240 1925 KSLSSKLTKTNERLAEVNTKLLVEKQ 1950
Cdd:PTZ00121 1753 EEEKKKIAHLKKEEEKKAEEIRKEKE 1778
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
132-186 |
8.06e-07 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 47.73 E-value: 8.06e-07
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 1034567240 132 LLEHG-ADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLA 186
Cdd:pfam13857 1 LLEHGpIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALDLA 56
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1619-1953 |
9.07e-07 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 54.30 E-value: 9.07e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1619 KKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNAnlsEDEKEQLKKLMELKQ 1698
Cdd:PRK03918 165 KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKL---EKEVKELEELKEEIE 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1699 SLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEfsfhGDLKTSQFEMD--IQINKLKHKIDDLTAELETAGS 1776
Cdd:PRK03918 242 ELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKV----KELKELKEKAEeyIKLSEFYEEYLDELREIEKRLS 317
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1777 KclhLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHI---ERNMVELGQVKQYKQEIEERARQEIAEKLK 1853
Cdd:PRK03918 318 R---LEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERHelyEEAKAKKEELERLKKRLTGLTPEKLEKELE 394
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1854 EVNlflQAQAASQENLEQFREnnfasMKSQMELRIKDLESELSKIKTSQ-----------EDFNKTELEKYKQLYLEELK 1922
Cdd:PRK03918 395 ELE---KAKEEIEEEISKITA-----RIGELKKEIKELKKAIEELKKAKgkcpvcgreltEEHRKELLEEYTAELKRIEK 466
|
330 340 350
....*....|....*....|....*....|.
gi 1034567240 1923 VRKSLSSKLTKTNERLAEVNTKLLVEKQQSR 1953
Cdd:PRK03918 467 ELKEIEEKERKLRKELRELEKVLKKESELIK 497
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1371-1634 |
1.02e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 53.23 E-value: 1.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1371 LAAAIHDRDQSETSKRELELAFQRARDECSRLQDKmnfdVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTL 1450
Cdd:COG4942 15 AAAQADAAAEAEAELEQLQQEIAELEKELAALKKE----EKALLKQLAALERRIAALARRIRALEQELAALEAELAELEK 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1451 GLERVQKDLSQTQCQMKEMEQKYQNeqvkvnkyIGKQESVEERLSQLQSENMLLRQQLDDAHNKADnkektviniQDQFH 1530
Cdd:COG4942 91 EIAELRAELEAQKEELAELLRALYR--------LGRQPPLALLLSPEDFLDAVRRLQYLKYLAPAR---------REQAE 153
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1531 AIVQKLQAESEKQSLLLEERnKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSmseaslevtsryriNL 1610
Cdd:COG4942 154 ELRADLAELAALRAELEAER-AELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELA--------------EL 218
|
250 260
....*....|....*....|....
gi 1034567240 1611 EDETQDLKKKLGQIRNQLQEAQDR 1634
Cdd:COG4942 219 QQEAEELEALIARLEAEAAAAAER 242
|
|
| CCDC158 |
pfam15921 |
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ... |
1079-1698 |
1.02e-06 |
|
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.
Pssm-ID: 464943 [Multi-domain] Cd Length: 1112 Bit Score: 54.35 E-value: 1.02e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1079 DSVSLLKIQDAALSCERLLELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEW---EREL--------- 1146
Cdd:pfam15921 208 DSMSTMHFRSLGSAISKILRELDTEISYLKGRIFPVEDQLEALKSESQNKIELLLQQHQDRIEQlisEHEVeitglteka 287
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1147 -------CSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYRKEV-EVKQQLELSLQTLEMELRTVKSNLNQVVQERNd 1218
Cdd:pfam15921 288 ssarsqaNSIQSQLEIIQEQARNQNSMYMRQLSDLESTVSQLRSELrEAKRMYEDKIEELEKQLVLANSELTEARTERD- 366
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1219 aqrQLSREQnarmlqdGILTNHLSKqkeIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDT---------- 1288
Cdd:pfam15921 367 ---QFSQES-------GNLDDQLQK---LLADLHKREKELSLEKEQNKRLWDRDTGNSITIDHLRRELDDrnmevqrlea 433
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1289 -IKNQNQEKEKKCFEDLKIVKEKNEDLQKT------IKQNEETLTQTISQYNG-RLSVLTAENAM--LNSKLENEKQSKE 1358
Cdd:pfam15921 434 lLKAMKSECQGQMERQMAAIQGKNESLEKVssltaqLESTKEMLRKVVEELTAkKMTLESSERTVsdLTASLQEKERAIE 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1359 RLEAEVESYHSRLAAAIHDRDQSETSKRELelafQRARDECSRLQDKMNFD---VSNLKDNNEILSQQLFKTESKLNSLE 1435
Cdd:pfam15921 514 ATNAEITKLRSRVDLKLQELQHLKNEGDHL----RNVQTECEALKLQMAEKdkvIEILRQQIENMTQLVGQHGRTAGAMQ 589
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1436 IEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIgkqESVEERLSQLQSenmlLRQQLDDAHNKA 1515
Cdd:pfam15921 590 VEKAQLEKEINDRRLELQEFKILKDKKDAKIRELEARVSDLELEKVKLV---NAGSERLRAVKD----IKQERDQLLNEV 662
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1516 DNKEKTVINIQDQFHAIVQKLQAESEKQSLL---LEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKK 1592
Cdd:pfam15921 663 KTSRNELNSLSEDYEVLKRNFRNKSEEMETTtnkLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGMQKQITAKRGQ 742
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1593 QSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEE 1672
Cdd:pfam15921 743 IDALQSKIQFLEEAMTNANKEKHFLKEEKNKLSQELSTVATEKNKMAGELEVLRSQERRLKEKVANMEVALDKASLQFAE 822
|
650 660
....*....|....*....|....*.
gi 1034567240 1673 LQkNLLNANLSEDEKEQLKKLMELKQ 1698
Cdd:pfam15921 823 CQ-DIIQRQEQESVRLKLQHTLDVKE 847
|
|
| PHA03095 |
PHA03095 |
ankyrin-like protein; Provisional |
66-210 |
1.10e-06 |
|
ankyrin-like protein; Provisional
Pssm-ID: 222980 [Multi-domain] Cd Length: 471 Bit Score: 53.49 E-value: 1.10e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 66 LLLRK-NGLNDRDKMNRTALHLACANGHP--EVVTLLVDRKCQLNVCDNENRTAL-MKAVQCQeekCATI----LLEHGA 137
Cdd:PHA03095 172 LLIDAgADVYAVDDRFRSLLHHHLQSFKPraRIVRELIRAGCDPAATDMLGNTPLhSMATGSS---CKRSlvlpLLIAGI 248
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 138 DPNLADVHGNTALHYA-VYNEDISVAtKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKANVNAVDK 210
Cdd:PHA03095 249 SINARNRYGQTPLHYAaVFNNPRACR-RLIALGADINAVSSDGNTPLSLMVRNNNGRAVRAALAKNPSAETVAA 321
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
121-292 |
1.29e-06 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 53.53 E-value: 1.29e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 121 VQCQEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLIK 200
Cdd:PHA02876 153 IQQDELLIAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDLSVLECAVDSKNIDTIKAIID 232
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 201 KKANVNAVDkLESSHQLISEYKEERIPKHSSQNS-NSVDESSEDSLSRLSGKPGVDDSWPTSDDEDLNFDTKNVPKPSLA 279
Cdd:PHA02876 233 NRSNINKND-LSLLKAIRNEDLETSLLLYDAGFSvNSIDDCKNTPLHHASQAPSLSRLVPKLLERGADVNAKNIKGETPL 311
|
170
....*....|...
gi 1034567240 280 KLMTASQQSRKNL 292
Cdd:PHA02876 312 YLMAKNGYDTENI 324
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1143-1374 |
1.46e-06 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 52.84 E-value: 1.46e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1143 ERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYrkevevkQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQ 1222
Cdd:COG4942 26 EAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRI-------AALARRIRALEQELAALEAELAELEKEIAELRAE 98
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1223 LSREQN--ARMLQDgiltnhLSKQKEIEMAQKKMNSENSHSHEEEKD-LSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKK 1299
Cdd:COG4942 99 LEAQKEelAELLRA------LYRLGRQPPLALLLSPEDFLDAVRRLQyLKYLAPARREQAEELRADLAELAALRAELEAE 172
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034567240 1300 cFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAA 1374
Cdd:COG4942 173 -RAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAERTPAA 246
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1488-1959 |
1.65e-06 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 53.64 E-value: 1.65e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1488 ESVEERLSQLQSENMLLRQQLDDAHNKADNKEktviniqdqfhAIVQKLQAES----------EKQSLLLEERNKELISE 1557
Cdd:pfam01576 85 EEEEERSQQLQNEKKKMQQHIQDLEEQLDEEE-----------AARQKLQLEKvtteakikklEEDILLLEDQNSKLSKE 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1558 CNHLKERQYQYENEKAEREVVVRQLQQ----------ELADTLKKQSMSEASLEVTSRyriNLEDETQDLKKKLGQIRNQ 1627
Cdd:pfam01576 154 RKLLEERISEFTSNLAEEEEKAKSLSKlknkheamisDLEERLKKEEKGRQELEKAKR---KLEGESTDLQEQIAELQAQ 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1628 LQEAQdrhTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQ---MDKIEELQKNLLNANLSEDEKEQLKK-----LMELKQS 1699
Cdd:pfam01576 231 IAELR---AQLAKKEEELQAALARLEEETAQKNNALKKIrelEAQISELQEDLESERAARNKAEKQRRdlgeeLEALKTE 307
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1700 LECNLD-----QEMKKNVELEreitgfknlLKMTRKKLNEyengefsfhgdlKTSQFEMDIQINKLKH--KIDDLTAELE 1772
Cdd:pfam01576 308 LEDTLDttaaqQELRSKREQE---------VTELKKALEE------------ETRSHEAQLQEMRQKHtqALEELTEQLE 366
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1773 TAGSKCLHLDTKNQILQEELLS----MKTVQKKCEKLQKNKKKLEQEVINLRSHI---ERNMVELG-------------- 1831
Cdd:pfam01576 367 QAKRNKANLEKAKQALESENAElqaeLRTLQQAKQDSEHKRKKLEGQLQELQARLsesERQRAELAeklsklqselesvs 446
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1832 ------------------QVKQYKQEIEERARQEIAEKLKEVNLFLQAQAASQENLEQFRENNFAsmKSQMELRIKDLES 1893
Cdd:pfam01576 447 sllneaegkniklskdvsSLESQLQDTQELLQEETRQKLNLSTRLRQLEDERNSLQEQLEEEEEA--KRNVERQLSTLQA 524
|
490 500 510 520 530 540 550
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034567240 1894 ELSKIKTSQEDFNKT-----ELEKYKQLYLEELKVR----KSLSSKLTKTNERLAEVNTKLLVEKQQSRSLFTTL 1959
Cdd:pfam01576 525 QLSDMKKKLEEDAGTlealeEGKKRLQRELEALTQQleekAAAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNL 599
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
68-117 |
2.01e-06 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 46.57 E-value: 2.01e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1034567240 68 LRKNG---LNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTAL 117
Cdd:pfam13857 1 LLEHGpidLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTAL 53
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1101-1327 |
2.15e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 53.14 E-value: 2.15e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1101 KNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQ 1180
Cdd:TIGR02168 795 KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESE 874
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1181 YRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLS--REQNARM-LQDGILTNHLSKQKEIEMAQKKMNSE 1257
Cdd:TIGR02168 875 LEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEelREKLAQLeLRLEGLEVRIDNLQERLSEEYSLTLE 954
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034567240 1258 NSHSHEEEKDLSHKNsmLQEEIAMLRLEIDTIKNQNQ------EKEKKCFEDLKivKEKnEDLQKTIKQNEETLTQ 1327
Cdd:TIGR02168 955 EAEALENKIEDDEEE--ARRRLKRLENKIKELGPVNLaaieeyEELKERYDFLT--AQK-EDLTEAKETLEEAIEE 1025
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
50-100 |
3.17e-06 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 46.11 E-value: 3.17e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|...
gi 1034567240 50 IHKAASAGNVAKVQqilLLRKNG--LNDRDKMNRTALHLACANGHPEVVTLLV 100
Cdd:pfam13637 5 LHAAAASGHLELLR---LLLEKGadINAVDGNGETALHFAASNGNVEVLKLLL 54
|
|
| PHA02874 |
PHA02874 |
ankyrin repeat protein; Provisional |
56-237 |
4.59e-06 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165205 [Multi-domain] Cd Length: 434 Bit Score: 51.50 E-value: 4.59e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 56 AGNVAKVQQILLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQ------------- 122
Cdd:PHA02874 11 SGDIEAIEKIIKNKGNCINISVDETTTPLIDAIRSGDAKIVELFIKHGADINHINTKIPHPLLTAIKigahdiikllidn 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 123 ----------CQEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQ 192
Cdd:PHA02874 91 gvdtsilpipCIEKDMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFF 170
|
170 180 190 200
....*....|....*....|....*....|....*....|....*.
gi 1034567240 193 QMVEFLIKKKANVNAVD-KLESSHQLISEYKEERIPKHSSQNSNSV 237
Cdd:PHA02874 171 DIIKLLLEKGAYANVKDnNGESPLHNAAEYGDYACIKLLIDHGNHI 216
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1095-1790 |
6.01e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 51.68 E-value: 6.01e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1095 RLLELKKNHCELLTVKIKKMEDkVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYekiREQL 1174
Cdd:PTZ00121 1213 KAEEARKAEDAKKAEAVKKAEE-AKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELK---KAEE 1288
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1175 RRKEEQYRKEVEVKQQLELSLQTleMELRTVKSNLNQVVQERNDAQRQLSREQNARMLQDGILTNHLSKQKEIEMAQKKM 1254
Cdd:PTZ00121 1289 KKKADEAKKAEEKKKADEAKKKA--EEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKA 1366
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1255 NSENSHSHEEEKDLSHKNSMLQEeiamlRLEIDTIKNQNQEKEKKCfEDLKIVKEKNEDLQKTIKQNEEtltqtisqyng 1334
Cdd:PTZ00121 1367 EAAEKKKEEAKKKADAAKKKAEE-----KKKADEAKKKAEEDKKKA-DELKKAAAAKKKADEAKKKAEE----------- 1429
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1335 rlsVLTAENAmlNSKLENEKQSKE-RLEAEVESYHSRLAAAIHDRDQSETSKRELELAfqRARDECSRLQDKMNFDVSNL 1413
Cdd:PTZ00121 1430 ---KKKADEA--KKKAEEAKKADEaKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEA--KKADEAKKKAEEAKKKADEA 1502
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1414 KDnneilsqqlfKTESKLNSLEIEFHHTRDALREKTLGLERVQKDlsqtqcQMKEMEQKYQNEQVKVNKYIGKQES---V 1490
Cdd:PTZ00121 1503 KK----------AAEAKKKADEAKKAEEAKKADEAKKAEEAKKAD------EAKKAEEKKKADELKKAEELKKAEEkkkA 1566
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1491 EERLSQLQSENMLLRQQlddahNKADNKEKTVINIQDQFHAIVQKLQAESEKQSllLEERNK-ELISECNHLKERQYQYE 1569
Cdd:PTZ00121 1567 EEAKKAEEDKNMALRKA-----EEAKKAEEARIEEVMKLYEEEKKMKAEEAKKA--EEAKIKaEELKKAEEEKKKVEQLK 1639
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1570 NEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAqdRHTEAVRCAEKMQDHK 1649
Cdd:PTZ00121 1640 KKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA--KKAEELKKKEAEEKKK 1717
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1650 -QKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMT 1728
Cdd:PTZ00121 1718 aEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797
|
650 660 670 680 690 700
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034567240 1729 RKKLNEYENGEFSFHGDLKTSQFemdiqINKLKHKIDDLTAELetagskclhLDTKNQILQE 1790
Cdd:PTZ00121 1798 KKIKDIFDNFANIIEGGKEGNLV-----INDSKEMEDSAIKEV---------ADSKNMQLEE 1845
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1380-1925 |
6.76e-06 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 51.28 E-value: 6.76e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1380 QSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDL 1459
Cdd:pfam05557 13 QLQNEKKQMELEHKRARIELEKKASALKRQLDRESDRNQELQKRIRLLEKREAEAEEALREQAELNRLKKKYLEALNKKL 92
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1460 SQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQfhaivQKLQAE 1539
Cdd:pfam05557 93 NEKESQLADAREVISCLKNELSELRRQIQRAELELQSTNSELEELQERLDLLKAKASEAEQLRQNLEKQ-----QSSLAE 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1540 SEKQSLLLEERNKelisecnhlkerqyQYENEKAerevVVRQLQQELADTLKKQSMSEASLEVTSRYRIN------LEDE 1613
Cdd:pfam05557 168 AEQRIKELEFEIQ--------------SQEQDSE----IVKNSKSELARIPELEKELERLREHNKHLNENienkllLKEE 229
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1614 TQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKqKLEKDNAKLKVTVKKQMDKIEELQ---------KNLLNANLSE 1684
Cdd:pfam05557 230 VEDLKRKLEREEKYREEAATLELEKEKLEQELQSWV-KLAQDTGLNLRSPEDLSRRIEQLQqreivlkeeNSSLTSSARQ 308
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1685 DEKEQLKKLMELKQSLECNLDQEM-------------KKNVELEREITGFKNLLKMTRKKLNEYENGEfsfhgDLKTSQF 1751
Cdd:pfam05557 309 LEKARRELEQELAQYLKKIEDLNKklkrhkalvrrlqRRVLLLTKERDGYRAILESYDKELTMSNYSP-----QLLERIE 383
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1752 EMDIQINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEELLSMKTvqkkcEKLQKNKKKLEQEVINLRSHIERNMVELG 1831
Cdd:pfam05557 384 EAEDMTQKMQAHNEEMEAQLSVAEEELGGYKQQAQTLERELQALRQ-----QESLADPSYSKEEVDSLRRKLETLELERQ 458
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1832 QVKQYKQEIE-ERARQEIAEKLKEV---------NLFLQAQAASQENLEQFRENNfASMKSQMELRIKDLESELSKIKTS 1901
Cdd:pfam05557 459 RLREQKNELEmELERRCLQGDYDPKktkvlhlsmNPAAEAYQQRKNQLEKLQAEI-ERLKRLLKKLEDDLEQVLRLPETT 537
|
570 580
....*....|....*....|....
gi 1034567240 1902 QEDFNKTELEKYKQLYLEELKVRK 1925
Cdd:pfam05557 538 STMNFKEVLDLRKELESAELKNQR 561
|
|
| DR0291 |
COG1579 |
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ... |
1752-1925 |
7.18e-06 |
|
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];
Pssm-ID: 441187 [Multi-domain] Cd Length: 236 Bit Score: 49.54 E-value: 7.18e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1752 EMDIQINKLKHKIDDLTAELEtagskclHLDTKNQILQEELlsmKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELG 1831
Cdd:COG1579 14 ELDSELDRLEHRLKELPAELA-------ELEDELAALEARL---EAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQLG 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1832 QVKQYKQ------EIE--ERARQEIAEKLKEVNLFLQAQAASQENLEQF---RENNFASMKSQMELRIKDLESELSKIKT 1900
Cdd:COG1579 84 NVRNNKEyealqkEIEslKRRISDLEDEILELMERIEELEEELAELEAElaeLEAELEEKKAELDEELAELEAELEELEA 163
|
170 180
....*....|....*....|....*
gi 1034567240 1901 SQEDFNKTELEKYKQLYlEELKVRK 1925
Cdd:COG1579 164 EREELAAKIPPELLALY-ERIRKRK 187
|
|
| TRPV5-6 |
cd22192 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and ... |
82-202 |
8.00e-06 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV), types 5 and 6; TRPV5 and TRPV6 (TRPV5/6) are two homologous members within the vanilloid subfamily of the transient receptor potential (TRP) family. TRPV5 and TRPV6 show only 30-40% homology with other members of the TRP family and have unique properties that differentiates them from other TRP channels. They mediate calcium uptake in epithelia and their expression is dramatically increased in numerous types of cancer. The structure of TRPV5/6 shows the typical topology features of all TRP family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6, which is predicted to form the Ca2+ pore, and large intracellular N- and C-terminal domains. The N-terminal domain of TRPV5/6 contains three ankyrin repeats. This structural element is present in several proteins and plays a role in protein-protein interactions. The N- and C-terminal tails of TRPV5/6 each contain an internal PDZ motif which can function as part of a molecular scaffold via interaction with PDZ-domain containing proteins. A major difference between the properties of TRPV5 and TRPV6 is in their tissue distribution: TRPV5 is predominantly expressed in the distal convoluted tubules (DCT) and connecting tubules (CNT) of the kidney, with limited expression in extrarenal tissues. In contrast, TRPV6 has a broader expression pattern such as expression in the intestine, kidney, placenta, epididymis, exocrine tissues, and a few other tissues.
Pssm-ID: 411976 [Multi-domain] Cd Length: 609 Bit Score: 51.17 E-value: 8.00e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 82 TALHLACANGHPEVVTLLVDRKCQLN------VCDNENRTALMK--------AVQCQEEKCATILLEHGADPNLADVHGN 147
Cdd:cd22192 91 TALHIAVVNQNLNLVRELIARGADVVspratgTFFRPGPKNLIYygehplsfAACVGNEEIVRLLIEHGADIRAQDSLGN 170
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034567240 148 TALHYAVYNEDISVATK----LLLYDANIEA------KNKDDLTPLLLAVSGKKQQMVEFLIKKK 202
Cdd:cd22192 171 TVLHILVLQPNKTFACQmydlILSYDKEDDLqpldlvPNNQGLTPFKLAAKEGNIVMFQHLVQKR 235
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1669-1953 |
8.57e-06 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 51.21 E-value: 8.57e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1669 KIEELQKNL--LNANLSEDEK---------EQLKKLMELKQSLEcnldqemkknvELEREITGfkNLLKMTRKKLNEyen 1737
Cdd:TIGR02168 180 KLERTRENLdrLEDILNELERqlkslerqaEKAERYKELKAELR-----------ELELALLV--LRLEELREELEE--- 243
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1738 gefsfhgdLKTSQFEMDIQINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEELLSMKTVQKKCEK-----------LQ 1806
Cdd:TIGR02168 244 --------LQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQqkqilrerlanLE 315
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1807 KNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIE---ERARQEIAEKLKEVNLFLQAQAASQENLEQFReNNFASMKSQ 1883
Cdd:TIGR02168 316 RQLEELEAQLEELESKLDELAEELAELEEKLEELKeelESLEAELEELEAELEELESRLEELEEQLETLR-SKVAQLELQ 394
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1034567240 1884 MEL---RIKDLESELSKIKTSQEDFNKTELEKYKQLYLEELK-VRKSLSSK---LTKTNERLAEVNTKLLVEKQQSR 1953
Cdd:TIGR02168 395 IASlnnEIERLEARLERLEDRRERLQQEIEELLKKLEEAELKeLQAELEELeeeLEELQEELERLEEALEELREELE 471
|
|
| rad50 |
TIGR00606 |
rad50; All proteins in this family for which functions are known are involvedin recombination, ... |
870-1768 |
1.21e-05 |
|
rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Pssm-ID: 129694 [Multi-domain] Cd Length: 1311 Bit Score: 50.82 E-value: 1.21e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 870 EELQQDMQRFKNEIGMLKVEFQALEKEKVQLQK---EVEEERKKHRNNEMEVSANIHDGATDDAEDDDDDDGLIQKRKSG 946
Cdd:TIGR00606 251 KNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKdnsELELKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEK 330
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 947 ETDHQQFPRKENKEYASSGPALQMK-EVKSTEKEKRTSKESVNSPVFGKASLLTGGLLQVDDDSSLS---EIDEDEGRPT 1022
Cdd:TIGR00606 331 LNKERRLLNQEKTELLVEQGRLQLQaDRHQEHIRARDSLIQSLATRLELDGFERGPFSERQIKNFHTlviERQEDEAKTA 410
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1023 KK-----TSNEKNKVKNQIQSMDDVDDLTQSSETASEDCELPHSSYKNFMLLIEQLGMECKDsvsLLKIQDAALSCERLL 1097
Cdd:TIGR00606 411 AQlcadlQSKERLKQEQADEIRDEKKGLGRTIELKKEILEKKQEELKFVIKELQQLEGSSDR---ILELDQELRKAEREL 487
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1098 EL--KKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQL- 1174
Cdd:TIGR00606 488 SKaeKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKDEQIRKIKSRHSDELTSLLg 567
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1175 -----RRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSReqnarmLQDGILTNHLSKQKEIEM 1249
Cdd:TIGR00606 568 yfpnkKQLEDWLHSKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSS------YEDKLFDVCGSQDEESDL 641
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1250 AQKKmnsenshshEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEdlqktikqneetltqTI 1329
Cdd:TIGR00606 642 ERLK---------EEIEKSSKQRAMLAGATAVYSQFITQLTDENQSCCPVCQRVFQTEAELQE---------------FI 697
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1330 SQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESyhsrlaaaihdrDQSETSKRELELAFQRARDEcsrlqdKMNFD 1409
Cdd:TIGR00606 698 SDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPG------------RQSIIDLKEKEIPELRNKLQ------KVNRD 759
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1410 VSNLKDNNEilsqqlfKTESKLNSLEIEFHHTRDALREKTLglervqkdLSQTQCQMKEMEQKYQNEQVKVNKYIG---- 1485
Cdd:TIGR00606 760 IQRLKNDIE-------EQETLLGTIMPEEESAKVCLTDVTI--------MERFQMELKDVERKIAQQAAKLQGSDLdrtv 824
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1486 -----KQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQaesekQSLLLEERNKELISECNH 1560
Cdd:TIGR00606 825 qqvnqEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLQ-----RRQQFEEQLVELSTEVQS 899
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1561 LKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASlevtsryrinLEDETQDLKKKLGQ-------IRNQLQEAQD 1633
Cdd:TIGR00606 900 LIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKK----------AQDKVNDIKEKVKNihgymkdIENKIQDGKD 969
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1634 RH--------TEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQLKKLME-LKQSLECNL 1704
Cdd:TIGR00606 970 DYlkqketelNTVNAQLEECEKHQEKINEDMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQhLKEMGQMQV 1049
|
890 900 910 920 930 940 950
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034567240 1705 DQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQF--------EMDIQINKLKHKIDDLT 1768
Cdd:TIGR00606 1050 LQMKQEHQKLEENIDLIKRNHVLALGRQKGYEKEIKHFKKELREPQFrdaeekyrEMMIVMRTTELVNKDLD 1121
|
|
| trp |
TIGR00870 |
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ ... |
82-202 |
1.25e-05 |
|
transient-receptor-potential calcium channel protein; The Transient Receptor Potential Ca2+ Channel (TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also been called the store-operated calcium channel (SOC) family. The prototypical members include the Drosophila retinal proteinsTRP and TRPL (Montell and Rubin, 1989; Hardie and Minke, 1993). SOC members of the family mediate the entry of extracellular Ca2+ into cells in responseto depletion of intracellular Ca2+ stores (Clapham, 1996) and agonist stimulated production of inositol-1,4,5 trisphosphate (IP3). One member of the TRP-CCfamily, mammalian Htrp3, has been shown to form a tight complex with the IP3 receptor (TC #1.A.3.2.1). This interaction is apparently required for IP3 tostimulate Ca2+ release via Htrp3. The vanilloid receptor subtype 1 (VR1), which is the receptor for capsaicin (the ?hot? ingredient in chili peppers) and servesas a heat-activated ion channel in the pain pathway (Caterina et al., 1997), is also a member of this family. The stretch-inhibitable non-selective cation channel(SIC) is identical to the vanilloid receptor throughout all of its first 700 residues, but it exhibits a different sequence in its last 100 residues. VR1 and SICtransport monovalent cations as well as Ca2+. VR1 is about 10x more permeable to Ca2+ than to monovalent ions. Ca2+ overload probably causes cell deathafter chronic exposure to capsaicin. (McCleskey and Gold, 1999). [Transport and binding proteins, Cations and iron carrying compounds]
Pssm-ID: 273311 [Multi-domain] Cd Length: 743 Bit Score: 50.46 E-value: 1.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 82 TALHLACANGHPEVVTLLVDRKCQLNV---CD-------------NENRTALMKAVQcqEEKCATILLEHGADPNLADVH 145
Cdd:TIGR00870 130 TALHLAAHRQNYEIVKLLLERGASVPAracGDffvksqgvdsfyhGESPLNAAACLG--SPSIVALLSEDPADILTADSL 207
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034567240 146 GNTALHYAVYNEDISVATK---------LLLYDANI-------EAKNKDDLTPLLLAVSGKKQQMVEFLIKKK 202
Cdd:TIGR00870 208 GNTLLHLLVMENEFKAEYEelscqmynfALSLLDKLrdskeleVILNHQGLTPLKLAAKEGRIVLFRLKLAIK 280
|
|
| PHA02874 |
PHA02874 |
ankyrin repeat protein; Provisional |
41-209 |
1.46e-05 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165205 [Multi-domain] Cd Length: 434 Bit Score: 49.96 E-value: 1.46e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 41 HVRDRDLGK-IHKAASAGNVAKVQQILLLRKNgLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMK 119
Cdd:PHA02874 118 NIKDAELKTfLHYAIKKGDLESIKMLFEYGAD-VNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHN 196
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 120 AVQCQEEKCATILLEHGADPNLADVHGNTALHYAV-YNEDisvATKLLLYDANIEAKNKDDLTPLLLAVSGK-KQQMVEF 197
Cdd:PHA02874 197 AAEYGDYACIKLLIDHGNHIMNKCKNGFTPLHNAIiHNRS---AIELLINNASINDQDIDGSTPLHHAINPPcDIDIIDI 273
|
170
....*....|..
gi 1034567240 198 LIKKKANVNAVD 209
Cdd:PHA02874 274 LLYHKADISIKD 285
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1471-1734 |
1.67e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 50.02 E-value: 1.67e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1471 QKYQNEQVKVnkyigKQESVEERLSQLQSENMLLRQQLDDAHNKadnkektviniqdqfhaiVQKLQAESekQSLLLEER 1550
Cdd:COG3206 159 EAYLEQNLEL-----RREEARKALEFLEEQLPELRKELEEAEAA------------------LEEFRQKN--GLVDLSEE 213
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1551 NKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVtsryrinledetQDLKKKLGQIRNQLQE 1630
Cdd:COG3206 214 AKLLLQQLSELESQLAEARAELAEAEARLAALRAQLGSGPDALPELLQSPVI------------QQLRAQLAELEAELAE 281
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1631 AQ----DRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEElQKNLLNANLsEDEKEQLKKLMELKQSLEcnldq 1706
Cdd:COG3206 282 LSarytPNHPDVIALRAQIAALRAQLQQEAQRILASLEAELEALQA-REASLQAQL-AQLEARLAELPELEAELR----- 354
|
250 260
....*....|....*....|....*...
gi 1034567240 1707 emkknvELEREITGFKNLLKMTRKKLNE 1734
Cdd:COG3206 355 ------RLEREVEVARELYESLLQRLEE 376
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1034-1638 |
1.74e-05 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 50.30 E-value: 1.74e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1034 NQIQSMDDVDDLTqssetasedcELphssYKNFML-------LIEQLgmecKDSVSLLK-----IQDAALSCERLLELKK 1101
Cdd:COG4913 198 HKTQSFKPIGDLD----------DF----VREYMLeepdtfeAADAL----VEHFDDLEraheaLEDAREQIELLEPIRE 259
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1102 NHcelltVKIKKMEDKVNVLQRELSETKEIKSQLEHQkvEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQY 1181
Cdd:COG4913 260 LA-----ERYAAARERLAELEYLRAALRLWFAQRRLE--LLEAELEELRAELARLEAELERLEARLDALREELDELEAQI 332
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1182 RK-EVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAqrQLSREQNARMLQDgiltnhlsKQKEIEMAQKKMNSENSH 1260
Cdd:COG4913 333 RGnGGDRLEQLEREIERLERELEERERRRARLEALLAAL--GLPLPASAEEFAA--------LRAEAAALLEALEEELEA 402
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1261 SHEEEKDLSHKNSMLQEEIAMLRLEIDTIKN------QNQEK-----EKKC---FEDLKI------VKEKNEDLQKTI-- 1318
Cdd:COG4913 403 LEEALAEAEAALRDLRRELRELEAEIASLERrksnipARLLAlrdalAEALgldEAELPFvgelieVRPEEERWRGAIer 482
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1319 ------------KQNEETLTQTISQYNGRLSVLTaenamlnSKLENEKQSKERLEAEVESYHSRLAAAIHD-RD--QSET 1383
Cdd:COG4913 483 vlggfaltllvpPEHYAAALRWVNRLHLRGRLVY-------ERVRTGLPDPERPRLDPDSLAGKLDFKPHPfRAwlEAEL 555
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1384 SKR------ELELAFQRAR----DECsrlQDKMNFDVSNLKDNNEILSQQL--FKTESKLNSLEIEfhhtRDALREKtlg 1451
Cdd:COG4913 556 GRRfdyvcvDSPEELRRHPraitRAG---QVKGNGTRHEKDDRRRIRSRYVlgFDNRAKLAALEAE----LAELEEE--- 625
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1452 LERVQKDLSQTQCQMKEMEQK---------YQNEQVKVnkyigkqESVEERLSQLQSEnmllRQQLDDAHNKadnkektV 1522
Cdd:COG4913 626 LAEAEERLEALEAELDALQERrealqrlaeYSWDEIDV-------ASAEREIAELEAE----LERLDASSDD-------L 687
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1523 INIQDQFHAIVQKLQaESEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEaslEV 1602
Cdd:COG4913 688 AALEEQLEELEAELE-ELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGD---AV 763
|
650 660 670
....*....|....*....|....*....|....*.
gi 1034567240 1603 TSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEA 1638
Cdd:COG4913 764 ERELRENLEERIDALRARLNRAEEELERAMRAFNRE 799
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1400-1874 |
1.91e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 49.77 E-value: 1.91e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1400 SRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEI---EFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNE 1476
Cdd:COG4717 49 ERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEkeeEYAELQEELEELEEELEELEAELEELREELEKLEKLLQLL 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1477 QvkvnkYIGKQESVEERLSQLQSENMLLRQQLD---DAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLLLEERNKE 1553
Cdd:COG4717 129 P-----LYQELEALEAELAELPERLEELEERLEelrELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEE 203
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1554 LISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEA-----------SLEVTSRYRINLEDE--------- 1613
Cdd:COG4717 204 LQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEArlllliaaallALLGLGGSLLSLILTiagvlflvl 283
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1614 ------TQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNAnlsEDEK 1687
Cdd:COG4717 284 gllallFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREA---EELE 360
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1688 EQLKkLMELKQSLECNLDQemkKNVELEREItgfknllkmtRKKLNEYEngefsfhgdlktsqfemdiQINKLKHKIDDL 1767
Cdd:COG4717 361 EELQ-LEELEQEIAALLAE---AGVEDEEEL----------RAALEQAE-------------------EYQELKEELEEL 407
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1768 TAELEtagskclhldtknQILQEELLSMKTVQKkcEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEE----- 1842
Cdd:COG4717 408 EEQLE-------------ELLGELEELLEALDE--EELEEELEELEEELEELEEELEELREELAELEAELEQLEEdgela 472
|
490 500 510
....*....|....*....|....*....|....*
gi 1034567240 1843 RARQEIAEKLKEVNLFLQ---AQAASQENLEQFRE 1874
Cdd:COG4717 473 ELLQELEELKAELRELAEewaALKLALELLEEARE 507
|
|
| Ank_4 |
pfam13637 |
Ankyrin repeats (many copies); |
146-199 |
2.05e-05 |
|
Ankyrin repeats (many copies);
Pssm-ID: 372654 [Multi-domain] Cd Length: 54 Bit Score: 43.80 E-value: 2.05e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 146 GNTALHYAVYNEDISVATKLLLYDANIEAKNKDDLTPLLLAVSGKKQQMVEFLI 199
Cdd:pfam13637 1 ELTALHAAAASGHLELLRLLLEKGADINAVDGNGETALHFAASNGNVEVLKLLL 54
|
|
| PTZ00322 |
PTZ00322 |
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional |
51-179 |
2.18e-05 |
|
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
Pssm-ID: 140343 [Multi-domain] Cd Length: 664 Bit Score: 49.51 E-value: 2.18e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 51 HKAASAGNVAKvqQILLLRKNGLNDRDKMNRTALHLACANGHPEVVTLlvdrkcqlnvcdnenrtalmkavqcqeekcat 130
Cdd:PTZ00322 88 QLAASGDAVGA--RILLTGGADPNCRDYDGRTPLHIACANGHVQVVRV-------------------------------- 133
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|..
gi 1034567240 131 iLLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLY---DANIEAKNKDD 179
Cdd:PTZ00322 134 -LLEFGADPTLLDKDGKTPLELAEENGFREVVQLLSRHsqcHFELGANAKPD 184
|
|
| SCP-1 |
pfam05483 |
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major ... |
1085-1629 |
3.57e-05 |
|
Synaptonemal complex protein 1 (SCP-1); Synaptonemal complex protein 1 (SCP-1) is the major component of the transverse filaments of the synaptonemal complex. Synaptonemal complexes are structures that are formed between homologous chromosomes during meiotic prophase.
Pssm-ID: 114219 [Multi-domain] Cd Length: 787 Bit Score: 48.95 E-value: 3.57e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1085 KIQDAALSCERLLELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEhQKVEWERElcslrfSLNQEEEKRRNAD 1164
Cdd:pfam05483 223 KIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLE-EKTKLQDE------NLKELIEKKDHLT 295
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1165 TLYEKIREQLRRKEEqyrkevevkqqlelSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQNAR-MLQDGILTNHLSK 1243
Cdd:pfam05483 296 KELEDIKMSLQRSMS--------------TQKALEEDLQIATKTICQLTEEKEAQMEELNKAKAAHsFVVTEFEATTCSL 361
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1244 QKEIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAML---RLEIDTIKNQNQEKEKKCFEdlkivKEKNEDLQKTIKQ 1320
Cdd:pfam05483 362 EELLRTEQQRLEKNEDQLKIITMELQKKSSELEEMTKFKnnkEVELEELKKILAEDEKLLDE-----KKQFEKIAEELKG 436
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1321 NEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELelaFQRARDECS 1400
Cdd:pfam05483 437 KEQELIFLLQAREKEIHDLEIQLTAIKTSEEHYLKEVEDLKTELEKEKLKNIELTAHCDKLLLENKEL---TQEASDMTL 513
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1401 RLQDKMNfDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREK----TLGLERVQKDLSQTQCQMKEMEQKYQNE 1476
Cdd:pfam05483 514 ELKKHQE-DIINCKKQEERMLKQIENLEEKEMNLRDELESVREEFIQKgdevKCKLDKSEENARSIEYEVLKKEKQMKIL 592
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1477 QVKVNKYIGKQESVEERLSQLQSENMLLR-------QQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAE------SEKQ 1543
Cdd:pfam05483 593 ENKCNNLKKQIENKNKNIEELHQENKALKkkgsaenKQLNAYEIKVNKLELELASAKQKFEEIIDNYQKEiedkkiSEEK 672
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1544 SLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASL-----EVTSRYRINLEDETQDLK 1618
Cdd:pfam05483 673 LLEEVEKAKAIADEAVKLQKEIDKRCQHKIAEMVALMEKHKHQYDKIIEERDSELGLyknkeQEQSSAKAALEIELSNIK 752
|
570
....*....|.
gi 1034567240 1619 KKLGQIRNQLQ 1629
Cdd:pfam05483 753 AELLSLKKQLE 763
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
1085-1366 |
4.29e-05 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 48.90 E-value: 4.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1085 KIQDAALSCERLLELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNAD 1164
Cdd:TIGR02168 716 QLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1165 TLYEKIREQLRRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSR--------EQNARMLQDGI 1236
Cdd:TIGR02168 796 EELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESlaaeieelEELIEELESEL 875
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1237 ltNHLSKQK------------EIEMAQKKMNSENSHSHEEEKDLSHKNSML---QEEIAMLRLEIDTIKNQNQEKEKKCF 1301
Cdd:TIGR02168 876 --EALLNERasleealallrsELEELSEELRELESKRSELRRELEELREKLaqlELRLEGLEVRIDNLQERLSEEYSLTL 953
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034567240 1302 EDLKIVKEKNEDLQKTIKQNEETLTQTI--------------SQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVES 1366
Cdd:TIGR02168 954 EEAEALENKIEDDEEEARRRLKRLENKIkelgpvnlaaieeyEELKERYDFLTAQKEDLTEAKETLEEAIEEIDREARE 1032
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1580-1792 |
4.35e-05 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 48.22 E-value: 4.35e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1580 RQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKL 1659
Cdd:COG4942 23 AEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIAELRAELEAQ 102
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1660 KVTVKKQMDKIEEL-QKNLLNANLSEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENG 1738
Cdd:COG4942 103 KEELAELLRALYRLgRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELEAERAELEALLAE 182
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 1739 EFSFHGDLKTSQFEMDIQINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEEL 1792
Cdd:COG4942 183 LEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEA 236
|
|
| SMC_prok_A |
TIGR02169 |
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ... |
1112-1536 |
5.61e-05 |
|
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain] Cd Length: 1164 Bit Score: 48.53 E-value: 5.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1112 KKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNAdtlyEKIREQLRRKEEQYRKEVEvkqQL 1191
Cdd:TIGR02169 670 RSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEI----EKEIEQLEQEEEKLKERLE---EL 742
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1192 ELSLQTLEmelrtvksnlnqvvQERNDAQRQLsrEQNARMLQDgiltnhlsKQKEIEMAQKKMNS-ENSHSHEEEKDLSH 1270
Cdd:TIGR02169 743 EEDLSSLE--------------QEIENVKSEL--KELEARIEE--------LEEDLHKLEEALNDlEARLSHSRIPEIQA 798
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1271 KNSMLQEEIAMLRLEIDTI-KNQNQEKEKKCFEDLKI--VKEKNEDLQKTIKQNEETLtqtisqyngrlsvltaenAMLN 1347
Cdd:TIGR02169 799 ELSKLEEEVSRIEARLREIeQKLNRLTLEKEYLEKEIqeLQEQRIDLKEQIKSIEKEI------------------ENLN 860
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1348 SKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELelafQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKT 1427
Cdd:TIGR02169 861 GKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLREL----ERKIEELEAQIEKKRKRLSELKAKLEALEEELSEI 936
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1428 ESKLNSLEIEfhhtrdalREKTLGLERVQKDLSQTQCQMKEMEQkyqneqvkVN-KYIGKQESVEERLSQLQSENMLL-- 1504
Cdd:TIGR02169 937 EDPKGEDEEI--------PEEELSLEDVQAELQRVEEEIRALEP--------VNmLAIQEYEEVLKRLDELKEKRAKLee 1000
|
410 420 430
....*....|....*....|....*....|....*...
gi 1034567240 1505 -RQQLDDAHNKADNKEKTV-----INIQDQFHAIVQKL 1536
Cdd:TIGR02169 1001 eRKAILERIEEYEKKKREVfmeafEAINENFNEIFAEL 1038
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1525-1945 |
6.03e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 48.23 E-value: 6.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1525 IQDQFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYenekaerevvvRQLQQELADTLKKQSMSEASLEVTS 1604
Cdd:COG4717 47 LLERLEKEADELFKPQGRKPELNLKELKELEEELKEAEEKEEEY-----------AELQEELEELEEELEELEAELEELR 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1605 RYRINLEDETQ--DLKKKLGQIRNQLQEAQDRHTEAVrcaEKMQDHKQkLEKDNAKLKVTVKKQMDKIEELQKNLLNANL 1682
Cdd:COG4717 116 EELEKLEKLLQllPLYQELEALEAELAELPERLEELE---ERLEELRE-LEEELEELEAELAELQEELEELLEQLSLATE 191
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1683 SEDE--KEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDIQINKL 1760
Cdd:COG4717 192 EELQdlAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARLLLLIAAALLALLGLGGSLLSL 271
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1761 KHKIDDLTAELETAGSKCLHLDTKNQILQEELLSMKTVQKKCEKLQKNKKK--LEQEVINLRSHIERNMVELGQVKQYKQ 1838
Cdd:COG4717 272 ILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEelLAALGLPPDLSPEELLELLDRIEELQE 351
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1839 EIEERARQEIAEKLKEVNLFLQA--QAASQENLEQFRE-----NNFASMKSQMELRIKDLESELSKIKTSQEDFNKTELE 1911
Cdd:COG4717 352 LLREAEELEEELQLEELEQEIAAllAEAGVEDEEELRAaleqaEEYQELKEELEELEEQLEELLGELEELLEALDEEELE 431
|
410 420 430
....*....|....*....|....*....|....
gi 1034567240 1912 KYKQLYLEELkvrKSLSSKLTKTNERLAEVNTKL 1945
Cdd:COG4717 432 EELEELEEEL---EELEEELEELREELAELEAEL 462
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1068-1549 |
6.29e-05 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 48.23 E-value: 6.29e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1068 LLIEQLGmecKDSVSLLKIQDaalsceRLLELKKNHCELLTVKIKKMEDKvnvlQRELSETKEIKSQLEHQKVEWERELC 1147
Cdd:COG4717 46 MLLERLE---KEADELFKPQG------RKPELNLKELKELEEELKEAEEK----EEEYAELQEELEELEEELEELEAELE 112
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1148 SLRFSLNQEEEKRRNADTL--YEKIREQLRRKEEQYRkevEVKQQLElSLQTLEMELRTVKSNLNQVVQERNDAQRQLSR 1225
Cdd:COG4717 113 ELREELEKLEKLLQLLPLYqeLEALEAELAELPERLE---ELEERLE-ELRELEEELEELEAELAELQEELEELLEQLSL 188
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1226 EQNARMLQDGILTNHLSKQKEIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEE------------IAMLRLEIDTIKNQN 1293
Cdd:COG4717 189 ATEEELQDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALeerlkearllllIAAALLALLGLGGSL 268
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1294 QEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAA 1373
Cdd:COG4717 269 LSLILTIAGVLFLVLGLLALLFLLLAREKASLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEE 348
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1374 AIHDRDQSETSKRELELafQRARDECSRLQDKmnFDVSNLKDNNEILSQ--QLFKTESKLNSLEIEFHHTRDALRE--KT 1449
Cdd:COG4717 349 LQELLREAEELEEELQL--EELEQEIAALLAE--AGVEDEEELRAALEQaeEYQELKEELEELEEQLEELLGELEEllEA 424
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1450 LGLERVQKDLSQTQCQMKEMEQKYQNEQvkvnkyiGKQESVEERLSQLQSENML--LRQQLDDAHNKADNKEKTVINIQD 1527
Cdd:COG4717 425 LDEEELEEELEELEEELEELEEELEELR-------EELAELEAELEQLEEDGELaeLLQELEELKAELRELAEEWAALKL 497
|
490 500
....*....|....*....|...
gi 1034567240 1528 QFHAIVQKLQ-AESEKQSLLLEE 1549
Cdd:COG4717 498 ALELLEEAREeYREERLPPVLER 520
|
|
| PRK05771 |
PRK05771 |
V-type ATP synthase subunit I; Validated |
1661-1935 |
6.61e-05 |
|
V-type ATP synthase subunit I; Validated
Pssm-ID: 235600 [Multi-domain] Cd Length: 646 Bit Score: 48.00 E-value: 6.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1661 VTVKKQMDK-IEELQKnlLNA----NLSEDEK-EQLKKLMELKQSLECNLDQeMKKNVELEReiTGFKNLLKMTRKKLNE 1734
Cdd:PRK05771 12 VTLKSYKDEvLEALHE--LGVvhieDLKEELSnERLRKLRSLLTKLSEALDK-LRSYLPKLN--PLREEKKKVSVKSLEE 86
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1735 YENGEFSFHGDLKTSQFEMDIQINKLKHKIDDLTAELETAgSKCLHLDtknqILQEELLSMKTVQKKCEKLQKNKKKLEQ 1814
Cdd:PRK05771 87 LIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIERL-EPWGNFD----LDLSLLLGFKYVSVFVGTVPEDKLEELK 161
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1815 EVINLRSHIErnmvelgqVKQYKQE------IEERARQEIAEKLKEVNLflqaqaasqENLEQFRENNFASMKSQMELRI 1888
Cdd:PRK05771 162 LESDVENVEY--------ISTDKGYvyvvvvVLKELSDEVEEELKKLGF---------ERLELEEEGTPSELIREIKEEL 224
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1889 KDLESELSKIKTSQEDFNKtELEKYKQLYLEEL---KVRKSLSSKLTKTN 1935
Cdd:PRK05771 225 EEIEKERESLLEELKELAK-KYLEELLALYEYLeieLERAEALSKFLKTD 273
|
|
| PRK02224 |
PRK02224 |
DNA double-strand break repair Rad50 ATPase; |
1358-1921 |
6.68e-05 |
|
DNA double-strand break repair Rad50 ATPase;
Pssm-ID: 179385 [Multi-domain] Cd Length: 880 Bit Score: 48.11 E-value: 6.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1358 ERLEAEVESYHSRLAAAIHDRDQSETSKR--ELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTEsKLNSLE 1435
Cdd:PRK02224 179 ERVLSDQRGSLDQLKAQIEEKEEKDLHERlnGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERRE-ELETLE 257
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1436 IEFhhtrDALREKTLGLERVQKDLSQT----QCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDA 1511
Cdd:PRK02224 258 AEI----EDLRETIAETEREREELAEEvrdlRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEEC 333
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1512 HNKADNKEKtviniqdqfhaivqklQAESEKQSLL-LEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTL 1590
Cdd:PRK02224 334 RVAAQAHNE----------------EAESLREDADdLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELR 397
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1591 KKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVR------CAEKMQDHKQKLEKDnaklkvTVK 1664
Cdd:PRK02224 398 ERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEAlleagkCPECGQPVEGSPHVE------TIE 471
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1665 KQMDKIEELqknllnanlsEDEKEQLK-KLMELKQSLEcnldqEMKKNVELEREITGFKNLLKMTRKKLNEYENGefsfh 1743
Cdd:PRK02224 472 EDRERVEEL----------EAELEDLEeEVEEVEERLE-----RAEDLVEAEDRIERLEERREDLEELIAERRET----- 531
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1744 gdlktsQFEMDIQINKLKHKIDDLTAELETAGSKCLHLDTKNQILQEELlsmktvqKKCE-KLQKNKKKLEQevinlrsh 1822
Cdd:PRK02224 532 ------IEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEV-------AELNsKLAELKERIES-------- 590
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1823 IERNMVELGQVKQYKQEIEERA--RQEIAEKLKEvnlflqaqaaSQENLEQFREnnfasmksqmelRIKDLESEL--SKI 1898
Cdd:PRK02224 591 LERIRTLLAAIADAEDEIERLRekREALAELNDE----------RRERLAEKRE------------RKRELEAEFdeARI 648
|
570 580
....*....|....*....|...
gi 1034567240 1899 KTSQEDfnKTELEKYKQLYLEEL 1921
Cdd:PRK02224 649 EEARED--KERAEEYLEQVEEKL 669
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
1307-1511 |
6.97e-05 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 48.37 E-value: 6.97e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1307 VKEKNEDLQKTIKQneetLTQTISQYNGRLSVLTAEN-------------AMLNSKLENEKQSKERLEAEVESYHSRLAA 1373
Cdd:PRK11281 78 QKEETEQLKQQLAQ----APAKLRQAQAELEALKDDNdeetretlstlslRQLESRLAQTLDQLQNAQNDLAEYNSQLVS 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1374 AihdRDQSETSKRELELAFQRArdecsrlqDKMNFDVSNLKDNNEILS---QQLFKTESKLNSLEIEFHHT--------- 1441
Cdd:PRK11281 154 L---QTQPERAQAALYANSQRL--------QQIRNLLKGGKVGGKALRpsqRVLLQAEQALLNAQNDLQRKslegntqlq 222
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1442 ------RDALREKTLGLERV---------QKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQ---------- 1496
Cdd:PRK11281 223 dllqkqRDYLTARIQRLEHQlqllqeainSKRLTLSEKTVQEAQSQDEAARIQANPLVAQELEINLQLSQrllkatekln 302
|
250
....*....|....*.
gi 1034567240 1497 -LQSENMLLRQQLDDA 1511
Cdd:PRK11281 303 tLTQQNLRVKNWLDRL 318
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
1094-1542 |
8.68e-05 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 47.75 E-value: 8.68e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1094 ERLLELKKNHCELltVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQ 1173
Cdd:PRK03918 273 KEIEELEEKVKEL--KELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKEL 350
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1174 LRRKEEqyrkevevkqqLELSLQTLEmELRTVKSNLNQVVQERNDaqrqLSREQNARMLQDgiltnhLSKQK-EIEMAQK 1252
Cdd:PRK03918 351 EKRLEE-----------LEERHELYE-EAKAKKEELERLKKRLTG----LTPEKLEKELEE------LEKAKeEIEEEIS 408
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1253 KMNSENSHSHEEEKDLSHKNSMLQ-------------------EEIAMLRLEIDTIKNQNQEKEKKcFEDLKIVKEKNED 1313
Cdd:PRK03918 409 KITARIGELKKEIKELKKAIEELKkakgkcpvcgrelteehrkELLEEYTAELKRIEKELKEIEEK-ERKLRKELRELEK 487
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1314 L---QKTIKQNEETLTQtISQYNGRLSVLTAENamLNSKLENEKQSKERLeAEVESYHSRLAAAIHDRDQSETSKRELEL 1390
Cdd:PRK03918 488 VlkkESELIKLKELAEQ-LKELEEKLKKYNLEE--LEKKAEEYEKLKEKL-IKLKGEIKSLKKELEKLEELKKKLAELEK 563
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1391 AFQRARDECSRLQDKM-NFDVSNLKDNNEILSQ---------QLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLS 1460
Cdd:PRK03918 564 KLDELEEELAELLKELeELGFESVEELEERLKElepfyneylELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1461 QTQCQMKEMEQKYQNEqvkvnkyigKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIvQKLQAES 1540
Cdd:PRK03918 644 ELRKELEELEKKYSEE---------EYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEER-EKAKKEL 713
|
..
gi 1034567240 1541 EK 1542
Cdd:PRK03918 714 EK 715
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1487-1737 |
9.78e-05 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 47.70 E-value: 9.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1487 QESVEERLSQLQSENMLLR----QQLDDAHNKADNKEKTVINiqdqfhAIVQKLQAESEKQSLLLE----ERNKELISE- 1557
Cdd:COG3206 80 DSPLETQIEILKSRPVLERvvdkLNLDEDPLGEEASREAAIE------RLRKNLTVEPVKGSNVIEisytSPDPELAAAv 153
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1558 CNHLKE--RQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEvtsRYR-----INLEDETQDLKKKLGQIRNQLQE 1630
Cdd:COG3206 154 ANALAEayLEQNLELRREEARKALEFLEEQLPELRKELEEAEAALE---EFRqknglVDLSEEAKLLLQQLSELESQLAE 230
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1631 AQDRHTEAVRCAEKMQDHKQKLEKDNAKLK--VTVKKQMDKIEELQKNLlnANLSE---DEKEQLKKLMELKQSLECNLD 1705
Cdd:COG3206 231 ARAELAEAEARLAALRAQLGSGPDALPELLqsPVIQQLRAQLAELEAEL--AELSArytPNHPDVIALRAQIAALRAQLQ 308
|
250 260 270
....*....|....*....|....*....|...
gi 1034567240 1706 QEMKKN-VELEREITGFKNLLKMTRKKLNEYEN 1737
Cdd:COG3206 309 QEAQRIlASLEAELEALQAREASLQAQLAQLEA 341
|
|
| Ank_5 |
pfam13857 |
Ankyrin repeats (many copies); |
98-153 |
1.07e-04 |
|
Ankyrin repeats (many copies);
Pssm-ID: 433530 [Multi-domain] Cd Length: 56 Bit Score: 41.95 E-value: 1.07e-04
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 1034567240 98 LLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYA 153
Cdd:pfam13857 1 LLEHGPIDLNRLDGEGYTPLHVAAKYGALEIVRVLLAYGVDLNLKDEEGLTALDLA 56
|
|
| EzrA |
pfam06160 |
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like ... |
1618-1950 |
1.36e-04 |
|
Septation ring formation regulator, EzrA; During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. The structure contains 5 spectrin like alpha helical repeats.
Pssm-ID: 428797 [Multi-domain] Cd Length: 542 Bit Score: 46.77 E-value: 1.36e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1618 KKKLGQIRNQLQEAQDRhteavrcAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLS-------------- 1683
Cdd:pfam06160 85 KKALDEIEELLDDIEED-------IKQILEELDELLESEEKNREEVEELKDKYRELRKTLLANRFSygpaidelekqlae 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1684 -EDEKEQLKKLMELKQSLECN--LDQEMKKNVELEREITGFKNLLKMTRK----KLNEYENGefsfHGDLKTSQF----- 1751
Cdd:pfam06160 158 iEEEFSQFEELTESGDYLEARevLEKLEEETDALEELMEDIPPLYEELKTelpdQLEELKEG----YREMEEEGYalehl 233
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1752 EMDIQINKLKHKIDDLTAELETagskcLHLDT---KNQILQEELLSM-----------KTVQKKCEKLQKNKKKLEQEVI 1817
Cdd:pfam06160 234 NVDKEIQQLEEQLEENLALLEN-----LELDEaeeALEEIEERIDQLydllekevdakKYVEKNLPEIEDYLEHAEEQNK 308
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1818 NLRSHIER-------NMVELGQVKQYKQEIE--ERARQEIAEKLKEvnlflQAQAasqenleqfrennfasmKSQMELRI 1888
Cdd:pfam06160 309 ELKEELERvqqsytlNENELERVRGLEKQLEelEKRYDEIVERLEE-----KEVA-----------------YSELQEEL 366
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034567240 1889 KDLESELSKIKTSQEDFNktelEKYKQLYLEELKVRKslssKLTKTNERLAEVntKLLVEKQ 1950
Cdd:pfam06160 367 EEILEQLEEIEEEQEEFK----ESLQSLRKDELEARE----KLDEFKLELREI--KRLVEKS 418
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1612-1884 |
1.37e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 46.68 E-value: 1.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1612 DETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLlnANLSEDEKEQLK 1691
Cdd:COG4942 20 DAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAEL--AELEKEIAELRA 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1692 KLMELKQSLEcnldqemkknvelereitgfKNLLKMTRKKLNEYENGEFSFHGDLKTSQFEMDIQ--INKLKHKIDDLTA 1769
Cdd:COG4942 98 ELEAQKEELA--------------------ELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKylAPARREQAEELRA 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1770 ELETagskclhLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEIA 1849
Cdd:COG4942 158 DLAE-------LAALRAELEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIA 230
|
250 260 270
....*....|....*....|....*....|....*
gi 1034567240 1850 EklkevnlfLQAQAASQEnlEQFRENNFASMKSQM 1884
Cdd:COG4942 231 R--------LEAEAAAAA--ERTPAAGFAALKGKL 255
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1110-1469 |
1.48e-04 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 47.22 E-value: 1.48e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1110 KIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTlyekiREQLRRKEEQYRkevevkq 1189
Cdd:COG4913 611 KLAALEAELAELEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASA-----EREIAELEAELE------- 678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1190 QLELS---LQTLEMELRTVKSNLNQVVQERNDAQRQLSREQNARmlqdgiltnhlsKQKEIEMAQKKMNSENSHSHEEEK 1266
Cdd:COG4913 679 RLDASsddLAALEEQLEELEAELEELEEELDELKGEIGRLEKEL------------EQAEEELDELQDRLEAAEDLARLE 746
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1267 DLSHKNSMLQEEIAmlrleidtiknqnQEKEKKCFEDLkivKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAE---- 1342
Cdd:COG4913 747 LRALLEERFAAALG-------------DAVERELRENL---EERIDALRARLNRAEEELERAMRAFNREWPAETADldad 810
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1343 -------NAMLNsKLENekqskERLEAevesYHSRLAAAIHdrDQSETSKRELELAFQRARDECSRlqdkmnfdvsNLKD 1415
Cdd:COG4913 811 leslpeyLALLD-RLEE-----DGLPE----YEERFKELLN--ENSIEFVADLLSKLRRAIREIKE----------RIDP 868
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1034567240 1416 NNEILSQQLFKTESKLnSLEI------EFHHTRDALREKTLGLERVQKDLSQTQC-QMKEM 1469
Cdd:COG4913 869 LNDSLKRIPFGPGRYL-RLEArprpdpEVREFRQELRAVTSGASLFDEELSEARFaALKRL 928
|
|
| Ank |
pfam00023 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
81-111 |
2.00e-04 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.
Pssm-ID: 459634 [Multi-domain] Cd Length: 34 Bit Score: 40.35 E-value: 2.00e-04
10 20 30
....*....|....*....|....*....|..
gi 1034567240 81 RTALHLACA-NGHPEVVTLLVDRKCQLNVCDN 111
Cdd:pfam00023 3 NTPLHLAAGrRGNLEIVKLLLSKGADVNARDK 34
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1567-1695 |
2.07e-04 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 46.31 E-value: 2.07e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1567 QYENEKAEREVVVRQLQQElADTLKKQSMSEASLEVtSRYRINLEDETQDLKKKLGQIRNQLQ---EAQDRHTEAVrcaE 1643
Cdd:PRK12704 32 KIKEAEEEAKRILEEAKKE-AEAIKKEALLEAKEEI-HKLRNEFEKELRERRNELQKLEKRLLqkeENLDRKLELL---E 106
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1034567240 1644 KMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLN-----ANLSEDE-KEQLKKLME 1695
Cdd:PRK12704 107 KREEELEKKEKELEQKQQELEKKEEELEELIEEQLQeleriSGLTAEEaKEILLEKVE 164
|
|
| TRPV |
cd21882 |
Transient Receptor Potential channel, Vanilloid subfamily (TRPV); The vanilloid TRP subfamily ... |
81-202 |
2.38e-04 |
|
Transient Receptor Potential channel, Vanilloid subfamily (TRPV); The vanilloid TRP subfamily (TRPV), named after the vanilloid receptor 1 (TRPV1), consists of six members: four thermo-sensing channels (TRPV1, TRPV2, TRPV3, and TRPV4) and two Ca2+ selective channels (TRPV5 and TRPV6). The calcium-selective channels TRPV5 and TRPV6 can be heterotetramers and are important for general Ca2+ homeostasis. All four channels within the TRPV1-4 group show temperature-invoked currents when expressed in heterologous cell systems, ranging from activation at ~25C for TRPV4 to ~52C for TRPV2. The structure of TRPV shows the typical topology features of all Transient Receptor Potential (TRP) ion channel family members, such as six transmembrane regions, a short hydrophobic stretch between transmembrane segments 5 and 6 and large intracellular N- and C-terminal domains. The TRP family consists of membrane proteins that function as ion channels that communicate between the cell and its environment, by a vast array of physical or chemical stimuli, including radiation (in the form of temperature, infrared ,or light) and pressure (osmotic or mechanical). TRP channels are formed by a tetrameric complex of channel subunits. Based on sequence identity, the mammalian TRP channel family is classified into six subfamilies, with significant sequence similarity within the transmembrane domains, but very low similarity in their N- and C-terminal cytoplasmic regions. The six subfamilies are named based on their first member: TRPC (canonical), TRPV (vanilloid), TRPM (melastatin), TRPA (ankyrin), TRPML (mucolipin), and TRPP (polycystic).
Pssm-ID: 411975 [Multi-domain] Cd Length: 600 Bit Score: 46.03 E-value: 2.38e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 81 RTALHLACANGHPEVVTLLVDRKC--QLNVCDNENRTA-----------LMKAVQCQEEKCATILLEHGADP---NLADV 144
Cdd:cd21882 74 QTALHIAIENRNLNLVRLLVENGAdvSARATGRFFRKSpgnlfyfgelpLSLAACTNQEEIVRLLLENGAQPaalEAQDS 153
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1034567240 145 HGNTALHYAVYNED---------ISVATKLLLYDANI-------EAKNKDDLTPLLLAVSGKKQQMVEFLIKKK 202
Cdd:cd21882 154 LGNTVLHALVLQADntpensafvCQMYNLLLSYGAHLdptqqleEIPNHQGLTPLKLAAVEGKIVMFQHILQRE 227
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1470-1689 |
2.41e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 45.98 E-value: 2.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1470 EQKYQNEQVKVNKYIGKQESVEERLSQLQsenmllrQQLDDAHNKADNKEKTVINIQDQfhaiVQKLQAESEKQSLLLEE 1549
Cdd:COG3883 15 DPQIQAKQKELSELQAELEAAQAELDALQ-------AELEELNEEYNELQAELEALQAE----IDKLQAEIAEAEAEIEE 83
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1550 RNKELisecnhlKER-QYQYENEKA-----------------EREVVVRQLQQELADTLKKQSMSEASLEvtsRYRINLE 1611
Cdd:COG3883 84 RREEL-------GERaRALYRSGGSvsyldvllgsesfsdflDRLSALSKIADADADLLEELKADKAELE---AKKAELE 153
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034567240 1612 DETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQ 1689
Cdd:COG3883 154 AKLAELEALKAELEAAKAELEAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAA 231
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
1547-1855 |
3.11e-04 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 44.90 E-value: 3.11e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1547 LEERNKELISECNHLKERQYQYENEKAErevvVRQLQQELADTLKKqsmseaslevtsryrinLEDETQDLKKKLGQIRN 1626
Cdd:COG1340 13 LEEKIEELREEIEELKEKRDELNEELKE----LAEKRDELNAQVKE-----------------LREEAQELREKRDELNE 71
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1627 QLQEAQDRHTEAvrcAEKMQDHKQKLEKDNAKLKVTVKKQMD------KIEELQKNLLNANLS-EDEKEQLKKLMELKQS 1699
Cdd:COG1340 72 KVKELKEERDEL---NEKLNELREELDELRKELAELNKAGGSidklrkEIERLEWRQQTEVLSpEEEKELVEKIKELEKE 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1700 LEcnldqEMKKNVELEREITGFKNLLKMTRKKLNEYengefsfHGDLKTSQFEMDI---QINKLKHKIDDLTAELETAgs 1776
Cdd:COG1340 149 LE-----KAKKALEKNEKLKELRAELKELRKEAEEI-------HKKIKELAEEAQElheEMIELYKEADELRKEADEL-- 214
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034567240 1777 kclhldtknqilqeellsmktvQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEIAEKLKEV 1855
Cdd:COG1340 215 ----------------------HKEIVEAQEKADELHEEIIELQKELRELRKELKKLRKKQRALKREKEKEELEEKAEE 271
|
|
| MAD |
pfam05557 |
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ... |
1273-1855 |
3.23e-04 |
|
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
Pssm-ID: 461677 [Multi-domain] Cd Length: 660 Bit Score: 45.89 E-value: 3.23e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1273 SMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNE--ETLTQTISQYNGRLSVLTAENAMLNSKL 1350
Cdd:pfam05557 12 SQLQNEKKQMELEHKRARIELEKKASALKRQLDRESDRNQELQKRIRLLEkrEAEAEEALREQAELNRLKKKYLEALNKK 91
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1351 ENEKQSKerlEAEVESYHSRLAAAIHDRdQSETSKRELELAFQRARDECSR----LQDKMNFDVSNLKDNNEILSQQLFK 1426
Cdd:pfam05557 92 LNEKESQ---LADAREVISCLKNELSEL-RRQIQRAELELQSTNSELEELQerldLLKAKASEAEQLRQNLEKQQSSLAE 167
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1427 TESKLNSLEIEFH-HTRDALrektlglerVQKDLSQTQCQMKEMEqkyqneqvkvnKYIGKQESVEERLSQLQSENMLLR 1505
Cdd:pfam05557 168 AEQRIKELEFEIQsQEQDSE---------IVKNSKSELARIPELE-----------KELERLREHNKHLNENIENKLLLK 227
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1506 QQLDDAHNKADNKEKT---VINIQDQFHAIVQKLQA-ESEKQSLLLE--------ERNKELISECNHLKERQYQYENEKA 1573
Cdd:pfam05557 228 EEVEDLKRKLEREEKYreeAATLELEKEKLEQELQSwVKLAQDTGLNlrspedlsRRIEQLQQREIVLKEENSSLTSSAR 307
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1574 EREVVVRQLQQELADTLKKqsmseaslevtsryrinledeTQDLKKKLGQIRNQLQEAQDRhteavrcaekmqdhkqkle 1653
Cdd:pfam05557 308 QLEKARRELEQELAQYLKK---------------------IEDLNKKLKRHKALVRRLQRR------------------- 347
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1654 kdnaklKVTVKKQMDKIEELQKN----LLNANLSEDEKEQLKKLMELKQSLEcNLDQEMKKNVE-LEREITGFKNLLKM- 1727
Cdd:pfam05557 348 ------VLLLTKERDGYRAILESydkeLTMSNYSPQLLERIEEAEDMTQKMQ-AHNEEMEAQLSvAEEELGGYKQQAQTl 420
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1728 -----TRKKLNEYENGEFSFHG--DLKTSQFEMDIQINKLKHKIDDLTAELEtagSKCLHLDTKNQILQEELLSMKTVQK 1800
Cdd:pfam05557 421 erelqALRQQESLADPSYSKEEvdSLRRKLETLELERQRLREQKNELEMELE---RRCLQGDYDPKKTKVLHLSMNPAAE 497
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*
gi 1034567240 1801 KCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEIAEKLKEV 1855
Cdd:pfam05557 498 AYQQRKNQLEKLQAEIERLKRLLKKLEDDLEQVLRLPETTSTMNFKEVLDLRKEL 552
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1458-1638 |
3.56e-04 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 45.21 E-value: 3.56e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1458 DLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQ 1537
Cdd:COG3883 17 QIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALQAEIDKLQAEIAEAEAEIEERREELGERARALY 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1538 AESEKQSLL---------------------LEERNKELISEcnhLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMS 1596
Cdd:COG3883 97 RSGGSVSYLdvllgsesfsdfldrlsalskIADADADLLEE---LKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200
....*....|....*....|....*....|....*....|..
gi 1034567240 1597 EASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEA 1638
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAA 215
|
|
| Cast |
pfam10174 |
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part ... |
1171-1835 |
4.70e-04 |
|
RIM-binding protein of the cytomatrix active zone; This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Pssm-ID: 431111 [Multi-domain] Cd Length: 766 Bit Score: 45.20 E-value: 4.70e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1171 REQLRRKEE---------QYRKEVEVKQQLELSLQTLEMELRTVKsNLNQVVQ--------------------ERNDAQR 1221
Cdd:pfam10174 43 KERALRKEEaarisvlkeQYRVTQEENQHLQLTIQALQDELRAQR-DLNQLLQqdfttspvdgedkfstpeltEENFRRL 121
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1222 QLSREQNARML----------------QDGIL-TNHLSKQKEIEMAQKKMNSENSHSHEEEKdlshknsmlQEEIAMLRL 1284
Cdd:pfam10174 122 QSEHERQAKELfllrktleemelrietQKQTLgARDESIKKLLEMLQSKGLPKKSGEEDWER---------TRRIAEAEM 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1285 EIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQ-----KTIKQNEETLTQTISQYNGRLSVLTAENAML--NSKLENEKQSK 1357
Cdd:pfam10174 193 QLGHLEVLLDQKEKENIHLREELHRRNQLQPdpaktKALQTVIEMKDTKISSLERNIRDLEDEVQMLktNGLLHTEDREE 272
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1358 ERLEAEVESYHSRLAAAIHDRDQSETSKRELEL-AFQrardecSRLqDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEI 1436
Cdd:pfam10174 273 EIKQMEVYKSHSKFMKNKIDQLKQELSKKESELlALQ------TKL-ETLTNQNSDCKQHIEVLKESLTAKEQRAAILQT 345
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1437 EFHHTRDALREKTLGLERVQKDLSQTQ----CQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLR------Q 1506
Cdd:pfam10174 346 EVDALRLRLEEKESFLNKKTKQLQDLTeeksTLAGEIRDLKDMLDVKERKINVLQKKIENLQEQLRDKDKQLAglkervK 425
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1507 QLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQel 1586
Cdd:pfam10174 426 SLQTDSSNTDTALTTLEEALSEKERIIERLKEQREREDRERLEELESLKKENKDLKEKVSALQPELTEKESSLIDLKE-- 503
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1587 adtlKKQSMSEASLEVTSRYRiNLEDETQDLKKKLGQIRNQLQEAQDRhTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQ 1666
Cdd:pfam10174 504 ----HASSLASSGLKKDSKLK-SLEIAVEQKKEECSKLENQLKKAHNA-EEAVRTNPEINDRIRLLEQEVARYKEESGKA 577
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1667 MDKIEELQKNLLNANLSEDEKEqlKKLMELKQSLECNLDQEMKKNVELEREITGfknllkMTRKKLNEYENGefsfhgdl 1746
Cdd:pfam10174 578 QAEVERLLGILREVENEKNDKD--KKIAELESLTLRQMKEQNKKVANIKHGQQE------MKKKGAQLLEEA-------- 641
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1747 ktSQFEMDIQINKLKHKIDDLTAELETAGSKC----LHLDTKNQILQ-----------------EELLSMK------TVQ 1799
Cdd:pfam10174 642 --RRREDNLADNSQQLQLEELMGALEKTRQELdatkARLSSTQQSLAekdghltnlraerrkqlEEILEMKqeallaAIS 719
|
730 740 750 760
....*....|....*....|....*....|....*....|..
gi 1034567240 1800 KK------CEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQ 1835
Cdd:pfam10174 720 EKdanialLELSSSKKKKTQEEVMALKREKDRLVHQLKQQTQ 761
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
1116-1342 |
6.37e-04 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 45.04 E-value: 6.37e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1116 DKVNVLQRELSETKEIKSQLE----HQKV--EWERELCSLRFSLNQEEEKRRNadtlyekIREQLRRKE-EQyrKEVEVK 1188
Cdd:PRK10929 45 EIVEALQSALNWLEERKGSLErakqYQQVidNFPKLSAELRQQLNNERDEPRS-------VPPNMSTDAlEQ--EILQVS 115
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1189 QQL-ELS--LQTLEMELRTVKSNLNQVVQERNDAQRQLSreQNARMLQdgILTNHLSKQKEIEMAQKKMNSENSHSHEEE 1265
Cdd:PRK10929 116 SQLlEKSrqAQQEQDRAREISDSLSQLPQQQTEARRQLN--EIERRLQ--TLGTPNTPLAQAQLTALQAESAALKALVDE 191
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1266 KDLSHKNSMLQEEIAMLRLEI----------------DTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTI----KQNEEtL 1325
Cdd:PRK10929 192 LELAQLSANNRQELARLRSELakkrsqqldaylqalrNQLNSQRQREAERALESTELLAEQSGDLPKSIvaqfKINRE-L 270
|
250
....*....|....*..
gi 1034567240 1326 TQTISQYNGRLSVLTAE 1342
Cdd:PRK10929 271 SQALNQQAQRMDLIASQ 287
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1276-1500 |
7.34e-04 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 44.37 E-value: 7.34e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1276 QEEIAMLRLEIDTIKNQNQEKEKKcfedLKIVKEKNEDLQKTIKQneetLTQTISQYNGRLSVLTAENAMLNSKLENEKQ 1355
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKE----LAALKKEEKALLKQLAA----LERRIAALARRIRALEQELAALEAELAELEK 90
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1356 SKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLE 1435
Cdd:COG4942 91 EIAELRAELEAQKEELAELLRALYRLGRQPPLALLLSPEDFLDAVRRLQYLKYLAPARREQAEELRADLAELAALRAELE 170
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1034567240 1436 IEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSE 1500
Cdd:COG4942 171 AERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAE 235
|
|
| ANK |
smart00248 |
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ... |
79-108 |
8.03e-04 |
|
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.
Pssm-ID: 197603 [Multi-domain] Cd Length: 30 Bit Score: 38.72 E-value: 8.03e-04
10 20 30
....*....|....*....|....*....|
gi 1034567240 79 MNRTALHLACANGHPEVVTLLVDRKCQLNV 108
Cdd:smart00248 1 DGRTPLHLAAENGNLEVVKLLLDKGADINA 30
|
|
| COG5022 |
COG5022 |
Myosin heavy chain [General function prediction only]; |
1112-1738 |
8.16e-04 |
|
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain] Cd Length: 1463 Bit Score: 44.68 E-value: 8.16e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1112 KKMEDKVNVLQRELsETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNA---DTLYEKI---REQLRRKEEQYRKEV 1185
Cdd:COG5022 813 RSYLACIIKLQKTI-KREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLlkkETIYLQSaqrVELAERQLQELKIDV 891
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1186 EVKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQnARMLQDGILTNHLSKQKEIEMAQKKMNSENSHSHEEE 1265
Cdd:COG5022 892 KSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARL-KKLLNNIDLEEGPSIEYVKLPELNKLHEVESKLKETS 970
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1266 KDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKcfedlkivKEKNEDLQKTIKQNEETltqtisqyNGRLSVLTAENAM 1345
Cdd:COG5022 971 EEYEDLLKKSTILVREGNKANSELKNFKKELAEL--------SKQYGALQESTKQLKEL--------PVEVAELQSASKI 1034
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1346 LNSKLENEKQSKErlEAEVESyhsrlaaaIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLF 1425
Cdd:COG5022 1035 ISSESTELSILKP--LQKLKG--------LLLLENNQLQARYKALKLRRENSLLDDKQLYQLESTENLLKTINVKDLEVT 1104
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1426 KTESKLNSLEIEF-----------HHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQV---KVNKYIGKQESV- 1490
Cdd:COG5022 1105 NRNLVKPANVLQFivaqmiklnllQEISKFLSQLVNTLEPVFQKLSVLQLELDGLFWEANLEALpspPPFAALSEKRLYq 1184
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1491 ----EERLSQLQSENMLLRQQLD---DAHNKADNKEKTVINIQDQ---FHAIVQKLQAESEKQSLLLEER---NKELISE 1557
Cdd:COG5022 1185 salyDEKSKLSSSEVNDLKNELIalfSKIFSGWPRGDKLKKLISEgwvPTEYSTSLKGFNNLNKKFDTPAsmsNEKLLSL 1264
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1558 CNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQ-SMSEASLEVTSRYRINlEDETQDLKKKLGQIRNQLQEAQDRHT 1636
Cdd:COG5022 1265 LNSIDNLLSSYKLEEEVLPATINSLLQYINVGLFNAlRTKASSLRWKSATEVN-YNSEELDDWCREFEISDVDEELEELI 1343
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1637 EAVRCaekmqdhKQKLEKDNAKLKVTVkkqmdKIEELQKNLLNANLSEDEKEQLKklmelkqslECNLDQEMKKNV---- 1712
Cdd:COG5022 1344 QAVKV-------LQLLKDDLNKLDELL-----DACYSLNPAEIQNLKSRYDPADK---------ENNLPKEILKKIeall 1402
|
650 660
....*....|....*....|....*....
gi 1034567240 1713 ---ELEREITGFKNLLKMTRKKLNEYENG 1738
Cdd:COG5022 1403 ikqELQLSLEGKDETEVHLSEIFSEEKSL 1431
|
|
| ClpA |
COG0542 |
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein ... |
1397-1564 |
8.49e-04 |
|
ATP-dependent Clp protease, ATP-binding subunit ClpA [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 440308 [Multi-domain] Cd Length: 836 Bit Score: 44.69 E-value: 8.49e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1397 DE-CSRLqdKMNfdvsnlkdnNEILSQQLFKTESKLNSLEIEfhhtRDAL-REKTLG----LERVQKDLSQTQCQMKEME 1470
Cdd:COG0542 396 DEaAARV--RME---------IDSKPEELDELERRLEQLEIE----KEALkKEQDEAsferLAELRDELAELEEELEALK 460
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1471 QKYQNEQVKVNKYIGKQESVEERLSQLQSenmlLRQQLDDAHNKADNKEKTVINI--QDQFHAIV--------QKLQaES 1540
Cdd:COG0542 461 ARWEAEKELIEEIQELKEELEQRYGKIPE----LEKELAELEEELAELAPLLREEvtEEDIAEVVsrwtgipvGKLL-EG 535
|
170 180
....*....|....*....|....*
gi 1034567240 1541 EKQSLL-LEErnkelisecnHLKER 1564
Cdd:COG0542 536 EREKLLnLEE----------ELHER 550
|
|
| Ank_3 |
pfam13606 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
80-108 |
9.41e-04 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.
Pssm-ID: 463933 [Multi-domain] Cd Length: 30 Bit Score: 38.39 E-value: 9.41e-04
10 20
....*....|....*....|....*....
gi 1034567240 80 NRTALHLACANGHPEVVTLLVDRKCQLNV 108
Cdd:pfam13606 2 GNTPLHLAARNGRLEIVKLLLENGADINA 30
|
|
| Rabaptin |
pfam03528 |
Rabaptin; |
1488-1708 |
9.92e-04 |
|
Rabaptin;
Pssm-ID: 367545 [Multi-domain] Cd Length: 486 Bit Score: 43.94 E-value: 9.92e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1488 ESVEERLSQLQSEN---MLLRQQLDDAHNKADNKektviniqdqFHAIVQKLQAESEKQSLLLEERNKELISECNHLKER 1564
Cdd:pfam03528 4 EDLQQRVAELEKENaefYRLKQQLEAEFNQKRAK----------FKELYLAKEEDLKRQNAVLQEAQVELDALQNQLALA 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1565 QYQYENEKAeREVVVRQLQQELADTLKKQ----------SMSEASLEVTSRYRINLEDE-------TQDLKKKLGQIRNQ 1627
Cdd:pfam03528 74 RAEMENIKA-VATVSENTKQEAIDEVKSQwqeevaslqaIMKETVREYEVQFHRRLEQEraqwnqyRESAEREIADLRRR 152
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1628 LQEAQDRhteavrcaEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNAnlsEDEKEQLK--KLMELKQSLE---- 1701
Cdd:pfam03528 153 LSEGQEE--------ENLEDEMKKAQEDAEKLRSVVMPMEKEIAALKAKLTEA---EDKIKELEasKMKELNHYLEaeks 221
|
....*..
gi 1034567240 1702 CNLDQEM 1708
Cdd:pfam03528 222 CRTDLEM 228
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
1110-1468 |
1.08e-03 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 44.13 E-value: 1.08e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1110 KIKKMEDKVNVLQRELSETKEIKSQLEHQKveweRELCSLRFSLNQEEEKRRNADTLYEKIREQLrrkeeqyrkEVEVKQ 1189
Cdd:PRK11281 50 KQKLLEAEDKLVQQDLEQTLALLDKIDRQK----EETEQLKQQLAQAPAKLRQAQAELEALKDDN---------DEETRE 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1190 QLE-LSLQTLEmelrtvkSNLNQVVQERNDAQRQLSrEQNARMlqdgiltnhLSKQKEIEMAQKKMNSENSHSHEEEKDL 1268
Cdd:PRK11281 117 TLStLSLRQLE-------SRLAQTLDQLQNAQNDLA-EYNSQL---------VSLQTQPERAQAALYANSQRLQQIRNLL 179
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1269 SH----KNSMLQEEIAMLRLEIDTIKNQNqekekkcfedlkivkekneDLQKTIKQNEETLTQTisqYNGRLSVLTAENA 1344
Cdd:PRK11281 180 KGgkvgGKALRPSQRVLLQAEQALLNAQN-------------------DLQRKSLEGNTQLQDL---LQKQRDYLTARIQ 237
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1345 mlnsklenekqskeRLEAEVESyhsrLAAAIHDR--DQSETSKRELElafqrARDECSRLQDkmNFDVSNLKDNNEILSQ 1422
Cdd:PRK11281 238 --------------RLEHQLQL----LQEAINSKrlTLSEKTVQEAQ-----SQDEAARIQA--NPLVAQELEINLQLSQ 292
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 1034567240 1423 QLFKTESKLNSLeiefhhTRDALREKTLgLERvqkdLSQTQCQMKE 1468
Cdd:PRK11281 293 RLLKATEKLNTL------TQQNLRVKNW-LDR----LTQSERNIKE 327
|
|
| PRK04778 |
PRK04778 |
septation ring formation regulator EzrA; Provisional |
1618-1927 |
1.25e-03 |
|
septation ring formation regulator EzrA; Provisional
Pssm-ID: 179877 [Multi-domain] Cd Length: 569 Bit Score: 43.67 E-value: 1.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1618 KKKLGQIRNQLQEAQDRHteavrcaEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLS------------ED 1685
Cdd:PRK04778 104 KHEINEIESLLDLIEEDI-------EQILEELQELLESEEKNREEVEQLKDLYRELRKSLLANRFSfgpaldelekqlEN 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1686 EKEQLKKLMELKQS---------LEcNLDQEMKknvELEREITGFKNLLKMTRKK----LNEYENGefsfHGDLKTS--- 1749
Cdd:PRK04778 177 LEEEFSQFVELTESgdyveareiLD-QLEEELA---ALEQIMEEIPELLKELQTElpdqLQELKAG----YRELVEEgyh 248
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1750 --QFEMDIQINKLKHKIDDLTAELETagskcLHLD---TKNQILQEELLSM-----------KTVQKKCEKLQKNKKKLE 1813
Cdd:PRK04778 249 ldHLDIEKEIQDLKEQIDENLALLEE-----LDLDeaeEKNEEIQERIDQLydilerevkarKYVEKNSDTLPDFLEHAK 323
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1814 QEVINLRSHIER-------NMVELGQVKQYKQEIE--ERARQEIAEKLKEvnlflQAQAAS--QENLEQfrennfasmks 1882
Cdd:PRK04778 324 EQNKELKEEIDRvkqsytlNESELESVRQLEKQLEslEKQYDEITERIAE-----QEIAYSelQEELEE----------- 387
|
330 340 350 360
....*....|....*....|....*....|....*....|....*
gi 1034567240 1883 qmelrikdLESELSKIKTSQEDFNktelEKYKQLYLEELKVRKSL 1927
Cdd:PRK04778 388 --------ILKQLEEIEKEQEKLS----EMLQGLRKDELEAREKL 420
|
|
| Crescentin |
pfam19220 |
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament ... |
1260-1578 |
1.30e-03 |
|
Crescentin protein; This entry represents a bacterial equivalent to Intermediate Filament proteins, named crescentin, whose cytoskeletal function is required for the vibrioid and helical shapes of Caulobacter crescentus. Without crescentin, the cells adopt a straight-rod morphology. Crescentin has characteriztic features of IF proteins including the ability to assemble into filaments in vitro without energy or cofactor requirements. In vivo, crescentin forms a helical structure that colocalizes with the inner cell curvatures beneath the cytoplasmic membrane.
Pssm-ID: 437057 [Multi-domain] Cd Length: 401 Bit Score: 43.52 E-value: 1.30e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1260 HSHEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKivkeKNEDLQKTIKQNEETltqtISQYNGRLSVL 1339
Cdd:pfam19220 45 QAKSRLLELEALLAQERAAYGKLRRELAGLTRRLSAAEGELEELVA----RLAKLEAALREAEAA----KEELRIELRDK 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1340 TAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQ---DKMNFDVSNLKDN 1416
Cdd:pfam19220 117 TAQAEALERQLAAETEQNRALEEENKALREEAQAAEKALQRAEGELATARERLALLEQENRRLQalsEEQAAELAELTRR 196
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1417 NEILSQQLFKTESKLNSLEIEFHHTRdALREK----------TLGLER----------------VQKDLSQTQCQMKEME 1470
Cdd:pfam19220 197 LAELETQLDATRARLRALEGQLAAEQ-AERERaeaqleeaveAHRAERaslrmklealtaraaaTEQLLAEARNQLRDRD 275
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1471 QKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAH-------------NKA--------DNKEKTVINIQDQf 1529
Cdd:pfam19220 276 EAIRAAERRLKEASIERDTLERRLAGLEADLERRTQQFQEMQraraeleeraemlTKAlaakdaalERAEERIASLSDR- 354
|
330 340 350 360
....*....|....*....|....*....|....*....|....*....
gi 1034567240 1530 haiVQKLQAESEKQSLLLEERNKELISEcnhlkerqyqYENEKAEREVV 1578
Cdd:pfam19220 355 ---IAELTKRFEVERAALEQANRRLKEE----------LQRERAERALA 390
|
|
| PHA02736 |
PHA02736 |
Viral ankyrin protein; Provisional |
75-203 |
1.38e-03 |
|
Viral ankyrin protein; Provisional
Pssm-ID: 165103 [Multi-domain] Cd Length: 154 Bit Score: 41.40 E-value: 1.38e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 75 DRDKMNRTALHLACANGhpEVVTLLVDRkcqlNVCDNENRTALMK----AVQC-----------QEEKCaTILLEHGADP 139
Cdd:PHA02736 12 EPDIEGENILHYLCRNG--GVTDLLAFK----NAISDENRYLVLEynrhGKQCvhivsnpdkadPQEKL-KLLMEWGADI 84
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1034567240 140 NLAD-VHGNTALHYAVYNEDISVATKLLLY-DANIEAKNKDDLTPLLLAVSGKKQQMVEFLIKKKA 203
Cdd:PHA02736 85 NGKErVFGNTPLHIAVYTQNYELATWLCNQpGVNMEILNYAFKTPYYVACERHDAKMMNILRAKGA 150
|
|
| Mitofilin |
pfam09731 |
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. ... |
1347-1693 |
1.40e-03 |
|
Mitochondrial inner membrane protein; Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence rich in positive-charged and hydroxylated residues and a membrane anchor domain. In addition, it has three centrally located coiled coil domains.
Pssm-ID: 430783 [Multi-domain] Cd Length: 618 Bit Score: 43.59 E-value: 1.40e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1347 NSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMnfdVSNLKDNNEILSQQLFK 1426
Cdd:pfam09731 121 KSEQEKEKALEEVLKEAISKAESATAVAKEAKDDAIQAVKAHTDSLKEASDTAEISREKA---TDSALQKAEALAEKLKE 197
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1427 TESKLNSLEIEfhhtrdALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYigkQESVEErlsqlqsENMLLRQ 1506
Cdd:pfam09731 198 VINLAKQSEEE------AAPPLLDAAPETPPKLPEHLDNVEEKVEKAQSLAKLVDQY---KELVAS-------ERIVFQQ 261
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1507 QLDDAHN------KADNKEKTviniqDQFHAIVQKLQAEsekqsllLEERNKELIsecnHLKERqyqyENEKAEREVVVR 1580
Cdd:pfam09731 262 ELVSIFPdiipvlKEDNLLSN-----DDLNSLIAHAHRE-------IDQLSKKLA----ELKKR----EEKHIERALEKQ 321
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1581 QLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKL-GQIRNQLQEAQDRHTEAVRCAEK----------MQDHK 1649
Cdd:pfam09731 322 KEELDKLAEELSARLEEVRAADEAQLRLEFEREREEIRESYeEKLRTELERQAEAHEEHLKDVLVeqeielqrefLQDIK 401
|
330 340 350 360
....*....|....*....|....*....|....*....|....
gi 1034567240 1650 QKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQLKKL 1693
Cdd:pfam09731 402 EKVEEERAGRLLKLNELLANLKGLEKATSSHSEVEDENRKAQQL 445
|
|
| Ank |
pfam00023 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
145-177 |
1.44e-03 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.
Pssm-ID: 459634 [Multi-domain] Cd Length: 34 Bit Score: 38.04 E-value: 1.44e-03
10 20 30
....*....|....*....|....*....|....
gi 1034567240 145 HGNTALHYAVYNE-DISVATKLLLYDANIEAKNK 177
Cdd:pfam00023 1 DGNTPLHLAAGRRgNLEIVKLLLSKGADVNARDK 34
|
|
| COG4913 |
COG4913 |
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown]; |
1262-1856 |
1.45e-03 |
|
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
Pssm-ID: 443941 [Multi-domain] Cd Length: 1089 Bit Score: 43.75 E-value: 1.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1262 HEEEKDLSHKNSMLQEEIAMLRLEIDTIK-NQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNG----RL 1336
Cdd:COG4913 261 AERYAAARERLAELEYLRAALRLWFAQRRlELLEAELEELRAELARLEAELERLEARLDALREELDELEAQIRGnggdRL 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1337 SVLTAEnamlnskLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDN 1416
Cdd:COG4913 341 EQLERE-------IERLERELEERERRRARLEALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAA 413
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1417 NEILSQQLFKTESKLNSLE-------IEFHHTRDALREKtLGLERVQ----KDLsqtqCQMKEMEQKYQNeqvKVNKYIG 1485
Cdd:COG4913 414 LRDLRRELRELEAEIASLErrksnipARLLALRDALAEA-LGLDEAElpfvGEL----IEVRPEEERWRG---AIERVLG 485
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1486 KQesveeRLSqlqsenMLLRQQLDDAHNKA--DNKEKTVINIQDQFHAIVQKLQAESEKQSLL--LEERNKELISECNHL 1561
Cdd:COG4913 486 GF-----ALT------LLVPPEHYAAALRWvnRLHLRGRLVYERVRTGLPDPERPRLDPDSLAgkLDFKPHPFRAWLEAE 554
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1562 KERQYQYEneKAEREvvvrqlqQELADTlkKQSMSEASL--EVTSRYRINLEDET-------QDLKKKLGQIRNQLQEAQ 1632
Cdd:COG4913 555 LGRRFDYV--CVDSP-------EELRRH--PRAITRAGQvkGNGTRHEKDDRRRIrsryvlgFDNRAKLAALEAELAELE 623
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1633 DRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMD---------KIEELQKNLLNANLSEDE----KEQLKKLMELKQS 1699
Cdd:COG4913 624 EELAEAEERLEALEAELDALQERREALQRLAEYSWDeidvasaerEIAELEAELERLDASSDDlaalEEQLEELEAELEE 703
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1700 LECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENGEFSFH--------GDLKTSQFEMDIQiNKLKHKIDDLTAEL 1771
Cdd:COG4913 704 LEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELralleerfAAALGDAVERELR-ENLEERIDALRARL 782
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1772 ETAGSKCL----------HLDTKNqiLQEELLSMKTVQKKCEKLQKNK-KKLEQEvinLRSHIERNMVElgQVKQYKQEI 1840
Cdd:COG4913 783 NRAEEELEramrafnrewPAETAD--LDADLESLPEYLALLDRLEEDGlPEYEER---FKELLNENSIE--FVADLLSKL 855
|
650
....*....|....*.
gi 1034567240 1841 EeRARQEIAEKLKEVN 1856
Cdd:COG4913 856 R-RAIREIKERIDPLN 870
|
|
| CwlO1 |
COG3883 |
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ... |
1420-1638 |
1.47e-03 |
|
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];
Pssm-ID: 443091 [Multi-domain] Cd Length: 379 Bit Score: 43.28 E-value: 1.47e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1420 LSQQLFKTESKLNSLEIEFhhtrDALREKtlgLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKyigKQESVEERLSQLQS 1499
Cdd:COG3883 28 LQAELEAAQAELDALQAEL----EELNEE---YNELQAELEALQAEIDKLQAEIAEAEAEIEE---RREELGERARALYR 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1500 E-------NMLLRQQ-LDDAHNKADNKEKtvinIQDQFHAIV---QKLQAESEKQSLLLEERNKELISECNHLKERQYQY 1568
Cdd:COG3883 98 SggsvsylDVLLGSEsFSDFLDRLSALSK----IADADADLLeelKADKAELEAKKAELEAKLAELEALKAELEAAKAEL 173
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1569 ENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEA 1638
Cdd:COG3883 174 EAQQAEQEALLAQLSAEEAAAEAQLAELEAELAAAEAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 243
|
|
| SPEC |
cd00176 |
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members ... |
1540-1737 |
1.73e-03 |
|
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here
Pssm-ID: 238103 [Multi-domain] Cd Length: 213 Bit Score: 42.05 E-value: 1.73e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1540 SEKQSLLLEERNKELISECNHLKERQYQYENEKAEREVVVRQLQQeLADTLKKQSmSEASLEVTSRYRiNLEDETQDLKK 1619
Cdd:cd00176 17 SEKEELLSSTDYGDDLESVEALLKKHEALEAELAAHEERVEALNE-LGEQLIEEG-HPDAEEIQERLE-ELNQRWEELRE 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1620 KLGQIRNQLQEAQDRHT---EAVRCAEKMQDHKQKLEK-DNAKLKVTVKKQMDKIEELQKNLLNAnlsedeKEQLKKLME 1695
Cdd:cd00176 94 LAEERRQRLEEALDLQQffrDADDLEQWLEEKEAALASeDLGKDLESVEELLKKHKELEEELEAH------EPRLKSLNE 167
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1034567240 1696 LKQSL--ECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYEN 1737
Cdd:cd00176 168 LAEELleEGHPDADEEIEEKLEELNERWEELLELAEERQKKLEE 211
|
|
| PHA02875 |
PHA02875 |
ankyrin repeat protein; Provisional |
79-215 |
1.75e-03 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165206 [Multi-domain] Cd Length: 413 Bit Score: 43.06 E-value: 1.75e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 79 MNRTALHLACANGHPEVVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYAVYNED 158
Cdd:PHA02875 1 MDQVALCDAILFGELDIARRLLDIGINPNFEIYDGISPIKLAMKFRDSEAIKLLMKHGAIPDVKYPDIESELHDAVEEGD 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 159 ISVATKLLLYDANI-EAKNKDDLTPLLLAVSGKKQQMVEFLIKKKA--NVNAVDKLESSH 215
Cdd:PHA02875 81 VKAVEELLDLGKFAdDVFYKDGMTPLHLATILKKLDIMKLLIARGAdpDIPNTDKFSPLH 140
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
1023-1777 |
1.88e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 43.50 E-value: 1.88e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1023 KKTSNEKNKVKNQIQSMDDVDDLTQSSETASEdcelphssyknfmllIEQlgmeckdsvsllKIQDAALSCErllelkkn 1102
Cdd:TIGR01612 1139 KKSENYIDEIKAQINDLEDVADKAISNDDPEE---------------IEK------------KIENIVTKID-------- 1183
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1103 hcelltvKIKKMEDKVNVLQRELSETKEIKSQLEHQK---VEWERELCSLrFSLNQEEEKRRNADTL-----YEKIREQL 1174
Cdd:TIGR01612 1184 -------KKKNIYDEIKKLLNEIAEIEKDKTSLEEVKginLSYGKNLGKL-FLEKIDEEKKKSEHMIkameaYIEDLDEI 1255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1175 RRKEEQYRKEVEVKQQLELSLQTLEMELRTVKSnlNQVVQERNDAQRQLSREQNARMLQDgiltnhLSKQKEIEMAQKKM 1254
Cdd:TIGR01612 1256 KEKSPEIENEMGIEMDIKAEMETFNISHDDDKD--HHIISKKHDENISDIREKSLKIIED------FSEESDINDIKKEL 1327
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1255 NSENSHSHEEEKDLSHKNSMLQEEIAMLRL-EIDTIKNQNQEKEKKCFEDLKIVK---EKNEDLQKTIKQNE--ETLTQT 1328
Cdd:TIGR01612 1328 QKNLLDAQKHNSDINLYLNEIANIYNILKLnKIKKIIDEVKEYTKEIEENNKNIKdelDKSEKLIKKIKDDInlEECKSK 1407
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1329 ISQyngrlsvlTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRA----RDECSRLQD 1404
Cdd:TIGR01612 1408 IES--------TLDDKDIDECIKKIKELKNHILSEESNIDTYFKNADENNENVLLLFKNIEMADNKSqhilKIKKDNATN 1479
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1405 KMNFDVSNLKDNNEilSQQLFKTESKLNSLEIEfhhtrdalREKTLgLERVQKDLSQTQCQMKEMEQKYQNEQVKV---- 1480
Cdd:TIGR01612 1480 DHDFNINELKEHID--KSKGCKDEADKNAKAIE--------KNKEL-FEQYKKDVTELLNKYSALAIKNKFAKTKKdsei 1548
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1481 ---------NKYIGKQESVEERLSQLQSENMllrqQLDDAHNKADNKEKTVINIQdqfhaivqkLQAESEKQSLLLEERN 1551
Cdd:TIGR01612 1549 iikeikdahKKFILEAEKSEQKIKEIKKEKF----RIEDDAAKNDKSNKAAIDIQ---------LSLENFENKFLKISDI 1615
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1552 KELISECnhLKERQyqyENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLED---ETQDLKKKLGQIRNQL 1628
Cdd:TIGR01612 1616 KKKINDC--LKETE---SIEKKISSFSIDSQDTELKENGDNLNSLQEFLESLKDQKKNIEDkkkELDELDSEIEKIEIDV 1690
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1629 QEAQDRHTEAV--RCAEKMQDHKQKLEKDNAKLKVTVKKQMDKI--EELQKNLLNANLSEDEKEQ---LKKLMELKQSLE 1701
Cdd:TIGR01612 1691 DQHKKNYEIGIieKIKEIAIANKEEIESIKELIEPTIENLISSFntNDLEGIDPNEKLEEYNTEIgdiYEEFIELYNIIA 1770
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1702 CNLDQEMKKNVELER----EITGFKNLLKM--TRKKLNEYENgefsfhgDLKTSQFemDIQINKLKHKIDDLTAELETAG 1775
Cdd:TIGR01612 1771 GCLETVSKEPITYDEikntRINAQNEFLKIieIEKKSKSYLD-------DIEAKEF--DRIINHFKKKLDHVNDKFTKEY 1841
|
..
gi 1034567240 1776 SK 1777
Cdd:TIGR01612 1842 SK 1843
|
|
| Ank |
pfam00023 |
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ... |
178-210 |
1.98e-03 |
|
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.
Pssm-ID: 459634 [Multi-domain] Cd Length: 34 Bit Score: 37.65 E-value: 1.98e-03
10 20 30
....*....|....*....|....*....|....
gi 1034567240 178 DDLTPLLLAV-SGKKQQMVEFLIKKKANVNAVDK 210
Cdd:pfam00023 1 DGNTPLHLAAgRRGNLEIVKLLLSKGADVNARDK 34
|
|
| COG4372 |
COG4372 |
Uncharacterized protein, contains DUF3084 domain [Function unknown]; |
1400-1697 |
2.10e-03 |
|
Uncharacterized protein, contains DUF3084 domain [Function unknown];
Pssm-ID: 443500 [Multi-domain] Cd Length: 370 Bit Score: 42.58 E-value: 2.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1400 SRLQDKMNFDVSNLKDNNEILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVK 1479
Cdd:COG4372 30 SEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEE 109
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1480 VNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEKQSLLLEERNKELISECN 1559
Cdd:COG4372 110 AEELQEELEELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDELL 189
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1560 HLKERQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAV 1639
Cdd:COG4372 190 KEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILV 269
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 1034567240 1640 RCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQLKKLMELK 1697
Cdd:COG4372 270 EKDTEEEELEIAALELEALEEAALELKLLALLLNLAALSLIGALEDALLAALLELAKK 327
|
|
| GumC |
COG3206 |
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis]; |
1149-1360 |
2.57e-03 |
|
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442439 [Multi-domain] Cd Length: 687 Bit Score: 43.08 E-value: 2.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1149 LRFSLNQEEEKRRNADTLYEK----IREQLRRKE---EQYRKE-------------VEVKQQLELSLQTLEMELRTVKSN 1208
Cdd:COG3206 162 LEQNLELRREEARKALEFLEEqlpeLRKELEEAEaalEEFRQKnglvdlseeakllLQQLSELESQLAEARAELAEAEAR 241
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1209 LNQVvqernDAQRQLSREQNARMLQDGILTNHLSKQKEIEMAQKKMNSENSHSHeeekdlshknsmlqEEIAMLRLEIDT 1288
Cdd:COG3206 242 LAAL-----RAQLGSGPDALPELLQSPVIQQLRAQLAELEAELAELSARYTPNH--------------PDVIALRAQIAA 302
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1034567240 1289 IKNQNQEKEKKCFEDLKIVKEknedlqkTIKQNEETLTQTISQYNGRLSVLTAENAMLNSkLENEKQSKERL 1360
Cdd:COG3206 303 LRAQLQQEAQRILASLEAELE-------ALQAREASLQAQLAQLEARLAELPELEAELRR-LEREVEVAREL 366
|
|
| CALCOCO1 |
pfam07888 |
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ... |
1583-1834 |
3.16e-03 |
|
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Pssm-ID: 462303 [Multi-domain] Cd Length: 488 Bit Score: 42.57 E-value: 3.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1583 QQELADTLKKQSMS----EASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAK 1658
Cdd:pfam07888 40 LQERAELLQAQEAAnrqrEKEKERYKRDREQWERQRRELESRVAELKEELRQSREKHEELEEKYKELSASSEELSEEKDA 119
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1659 LKVTVKKQMDKIEELQKNLlnANLSEDEKEQLKKLMELKQSLECNLDQemKKNVELEREitGFKNLLKMTRKKLNEYeNG 1738
Cdd:pfam07888 120 LLAQRAAHEARIRELEEDI--KTLTQRVLERETELERMKERAKKAGAQ--RKEEEAERK--QLQAKLQQTEEELRSL-SK 192
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1739 EFSfhgDLKTSQFEMDIQINKLKHKIDDLTAELETAGSKclhlDTKNQILQEEllsMKTVQKKCEKLQKNKKKLEQEVIN 1818
Cdd:pfam07888 193 EFQ---ELRNSLAQRDTQVLQLQDTITTLTQKLTTAHRK----EAENEALLEE---LRSLQERLNASERKVEGLGEELSS 262
|
250
....*....|....*.
gi 1034567240 1819 LRSHIERNMVELGQVK 1834
Cdd:pfam07888 263 MAAQRDRTQAELHQAR 278
|
|
| CEP63 |
pfam17045 |
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole ... |
1124-1404 |
3.67e-03 |
|
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole activity.
Pssm-ID: 465338 [Multi-domain] Cd Length: 264 Bit Score: 41.34 E-value: 3.67e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1124 ELSE-TKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNAdtlyekiREQLRRKEeqyrKEVEVKQQLELSLQTLEMEL 1202
Cdd:pfam17045 7 ELQElMKQIDIMVAHKKSEWEGQTRALETRLDIREEELLSA-------RNTLERKH----KEIGLLRQQLEELEKGKQEL 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1203 rTVKSNlnqvvQERNDAQRQLSReqnarmlqdgiLTNHLSKqkeIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAML 1282
Cdd:pfam17045 76 -VAKYE-----QQLQKLQEELSK-----------LKRSYEK---LQRKQLKEAREEAKSREEDRSELSRLNGKLEEFRQK 135
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1283 RLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQneetltqtiSQYNGRLSVLTA---ENAMLNSKLENEKQSKER 1359
Cdd:pfam17045 136 SLEWEQQRLQYQQQVASLEAQRKALAEQSSLIQSAAYQ---------VQLEGRKQCLEAsqsEIQRLRSKLERAQDSLCA 206
|
250 260 270 280
....*....|....*....|....*....|....*....|....*
gi 1034567240 1360 LEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQD 1404
Cdd:pfam17045 207 QELELERLRMRVSELGDSNRKLLEEQQRLLEELRMSQRQLQVLQN 251
|
|
| mukB |
PRK04863 |
chromosome partition protein MukB; |
1105-1818 |
4.01e-03 |
|
chromosome partition protein MukB;
Pssm-ID: 235316 [Multi-domain] Cd Length: 1486 Bit Score: 42.64 E-value: 4.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1105 ELLTVKIKKMEDKVNVLQRELSETKEIKSQLEhqkvEWERELCSLRFSLNQEEEKrrnadtlyEKIREQLRRKEEQyRKE 1184
Cdd:PRK04863 445 EEFQAKEQEATEELLSLEQKLSVAQAAHSQFE----QAYQLVRKIAGEVSRSEAW--------DVARELLRRLREQ-RHL 511
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1185 VEVKQQLELSLQTLEMELRtvksNLNQVVQERNDAQRQLSR-EQNARMLQDgiltnhlsKQKEIEMAQKKMNSENSHSHE 1263
Cdd:PRK04863 512 AEQLQQLRMRLSELEQRLR----QQQRAERLLAEFCKRLGKnLDDEDELEQ--------LQEELEARLESLSESVSEARE 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1264 EEKDLSHKNSMLQEEIAMLR------LEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQ---KTIKQNEETLTQTISQYNG 1334
Cdd:PRK04863 580 RRMALRQQLEQLQARIQRLAarapawLAAQDALARLREQSGEEFEDSQDVTEYMQQLLereRELTVERDELAARKQALDE 659
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1335 RLSVLTAENAmlnSKLENEKQSKERLEAEVES------------YHSRL----AAAIHDRDQSeTSKRELElafqrARDE 1398
Cdd:PRK04863 660 EIERLSQPGG---SEDPRLNALAERFGGVLLSeiyddvsledapYFSALygpaRHAIVVPDLS-DAAEQLA-----GLED 730
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1399 CSRLQDKMNFDVSNLkDNNEILSQQLFKTES-KLNslEIEFHHTR--------DALREKTLGLERVQKDLSQTQCQMKEM 1469
Cdd:PRK04863 731 CPEDLYLIEGDPDSF-DDSVFSVEELEKAVVvKIA--DRQWRYSRfpevplfgRAAREKRIEQLRAEREELAERYATLSF 807
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1470 E-QKYQNEQVKVNKYIGKQESV------EERLSQLQSEnmllRQQLDDAHNKADNKEKTVINIQDQFHAIVQKLQAESEK 1542
Cdd:PRK04863 808 DvQKLQRLHQAFSRFIGSHLAVafeadpEAELRQLNRR----RVELERALADHESQEQQQRSQLEQAKEGLSALNRLLPR 883
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1543 QSLLLEERNKELISECN-HLKE--------RQYQYENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDE 1613
Cdd:PRK04863 884 LNLLADETLADRVEEIReQLDEaeeakrfvQQHGNALAQLEPIVSVLQSDPEQFEQLKQDYQQAQQTQRDAKQQAFALTE 963
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1614 tqdlkkkLGQIRNQL--QEAQDRHTEAVRCAEKM-QDHKQKlEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKEQL 1690
Cdd:PRK04863 964 -------VVQRRAHFsyEDAAEMLAKNSDLNEKLrQRLEQA-EQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQM 1035
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1691 kkLMELKQSLEcnldqemkknvelereitgfknllkmtrkklneyengEFSFHGDlktsqFEMDIQinkLKHKIDDLTAE 1770
Cdd:PRK04863 1036 --LQELKQELQ-------------------------------------DLGVPAD-----SGAEER---ARARRDELHAR 1068
|
730 740 750 760
....*....|....*....|....*....|....*....|....*...
gi 1034567240 1771 LETAGSKCLHLDTKNQILQEEllsMKTVQKKCEKLQKNKKKLEQEVIN 1818
Cdd:PRK04863 1069 LSANRSRRNQLEKQLTFCEAE---MDNLTKKLRKLERDYHEMREQVVN 1113
|
|
| ANK |
smart00248 |
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four ... |
145-174 |
4.02e-03 |
|
ankyrin repeats; Ankyrin repeats are about 33 amino acids long and occur in at least four consecutive copies. They are involved in protein-protein interactions. The core of the repeat seems to be an helix-loop-helix structure.
Pssm-ID: 197603 [Multi-domain] Cd Length: 30 Bit Score: 36.41 E-value: 4.02e-03
10 20 30
....*....|....*....|....*....|
gi 1034567240 145 HGNTALHYAVYNEDISVATKLLLYDANIEA 174
Cdd:smart00248 1 DGRTPLHLAAENGNLEVVKLLLDKGADINA 30
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1661-1953 |
4.06e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 42.42 E-value: 4.06e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1661 VTVKKQMDKIEELQKNLLnanlsEDEKEQLKKLMELKQSLEcnlDQEMKKNVELEREITGF--KNLLKMTRKKLNEYENG 1738
Cdd:pfam17380 284 VSERQQQEKFEKMEQERL-----RQEKEEKAREVERRRKLE---EAEKARQAEMDRQAAIYaeQERMAMERERELERIRQ 355
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1739 EFSFHGDLKTSQFEMDIQINKLKhkiddltaELETAGskcLHLDTKNQILQEELLSMKTVQKKCEKLQKNkkkleqevin 1818
Cdd:pfam17380 356 EERKRELERIRQEEIAMEISRMR--------ELERLQ---MERQQKNERVRQELEAARKVKILEEERQRK---------- 414
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1819 lrshIERNMVELGQVKQYKQEIEERARQEIAE-KLKEVNLFLQAQAASQENLEQFRENNFASMKSQMELrikdlESELSK 1897
Cdd:pfam17380 415 ----IQQQKVEMEQIRAEQEEARQREVRRLEEeRAREMERVRLEEQERQQQVERLRQQEEERKRKKLEL-----EKEKRD 485
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*....
gi 1034567240 1898 IKTSQEDFNKT---ELEKYKQLYLEELKVRKSLSSKLTKTNERLAEVNTKLLVEKQQSR 1953
Cdd:pfam17380 486 RKRAEEQRRKIlekELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRK 544
|
|
| PHA02878 |
PHA02878 |
ankyrin repeat protein; Provisional |
50-224 |
4.11e-03 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 222939 [Multi-domain] Cd Length: 477 Bit Score: 42.17 E-value: 4.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 50 IHKAASAGNVAKVQQiLLLRKNGLNDRDKMNRTALHLACANGHPEVVTLLVDRKCQLNVCDNEnrTALMKAVQCQEEKCA 129
Cdd:PHA02878 41 LHQAVEARNLDVVKS-LLTRGHNVNQPDHRDLTPLHIICKEPNKLGMKEMIRSINKCSVFYTL--VAIKDAFNNRNVEIF 117
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 130 TILLEHGADPN--LADVHGNTALHYAVYNEDIsvaTKLLL-YDANIEAKNKDDL-TPLLLAVSGKKQQMVEFLIKKKANV 205
Cdd:PHA02878 118 KIILTNRYKNIqtIDLVYIDKKSKDDIIEAEI---TKLLLsYGADINMKDRHKGnTALHYATENKDQRLTELLLSYGANV 194
|
170 180
....*....|....*....|.
gi 1034567240 206 NAVDKLESS--HQLISEYKEE 224
Cdd:PHA02878 195 NIPDKTNNSplHHAVKHYNKP 215
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1135-1365 |
4.31e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 42.03 E-value: 4.31e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1135 LEHQKVEWERelcslrfslnQEEEKrrnadtlYEKI-REQLRRKEEQYRKEVEVKQQLELSLQTLEMELR---TVKSNLN 1210
Cdd:pfam17380 278 VQHQKAVSER----------QQQEK-------FEKMeQERLRQEKEEKAREVERRRKLEEAEKARQAEMDrqaAIYAEQE 340
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1211 QVVQERN---DAQRQLSREQNARMLQDGILTNHLSKQKEIEMAQKKMNSENSHShEEEKDLSHKNSMLQEE----IAMLR 1283
Cdd:pfam17380 341 RMAMERErelERIRQEERKRELERIRQEEIAMEISRMRELERLQMERQQKNERV-RQELEAARKVKILEEErqrkIQQQK 419
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1284 LEIDTIKNQNQEKEKkcfEDLKIVKEKNEDLQKTIKQNEETLTQTISqyngRLSVLTAENAmlNSKLENEKQSKERLEAE 1363
Cdd:pfam17380 420 VEMEQIRAEQEEARQ---REVRRLEEERAREMERVRLEEQERQQQVE----RLRQQEEERK--RKKLELEKEKRDRKRAE 490
|
..
gi 1034567240 1364 VE 1365
Cdd:pfam17380 491 EQ 492
|
|
| MukB |
COG3096 |
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell ... |
1331-1734 |
4.45e-03 |
|
Chromosome condensin MukBEF, ATPase and DNA-binding subunit MukB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 442330 [Multi-domain] Cd Length: 1470 Bit Score: 42.25 E-value: 4.45e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1331 QYNGRLSVLTAE--------NAMLNSKLENEKqsKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRL 1402
Cdd:COG3096 317 ELSARESDLEQDyqaasdhlNLVQTALRQQEK--IERYQEDLEELTERLEEQEEVVEEAAEQLAEAEARLEAAEEEVDSL 394
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1403 QdkmnfdvSNLKDNNEILSQQlfKTESklnsleIEFHHTRDALREktlglervqkdlSQTQCQMKEMEQkyqnEQVKvnk 1482
Cdd:COG3096 395 K-------SQLADYQQALDVQ--QTRA------IQYQQAVQALEK------------ARALCGLPDLTP----ENAE--- 440
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1483 yiGKQESVEERLSQLQSENMLLRQQL---DDAHNKADNKEKTVINIQDQfhaiVQKLQAESEKQSLLLEERnkelisECN 1559
Cdd:COG3096 441 --DYLAAFRAKEQQATEEVLELEQKLsvaDAARRQFEKAYELVCKIAGE----VERSQAWQTARELLRRYR------SQQ 508
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1560 HLKERQYQYENEKAEREVVVRQLQ--QELADTLKKQSmseaSLEVTSryRINLEDEtqdlkkklgqiRNQLQEAQDRHTE 1637
Cdd:COG3096 509 ALAQRLQQLRAQLAELEQRLRQQQnaERLLEEFCQRI----GQQLDA--AEELEEL-----------LAELEAQLEELEE 571
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1638 AVR-CAEKMQDHKQKLEKDNAKLKvTVKKQMDKIEELQKNLlnANLSEDEKEQLKKLMElkqslecnLDQEMKKNVELER 1716
Cdd:COG3096 572 QAAeAVEQRSELRQQLEQLRARIK-ELAARAPAWLAAQDAL--ERLREQSGEALADSQE--------VTAAMQQLLERER 640
|
410
....*....|....*...
gi 1034567240 1717 EITGFKNLLKMTRKKLNE 1734
Cdd:COG3096 641 EATVERDELAARKQALES 658
|
|
| PRK10929 |
PRK10929 |
putative mechanosensitive channel protein; Provisional |
1198-1521 |
4.55e-03 |
|
putative mechanosensitive channel protein; Provisional
Pssm-ID: 236798 [Multi-domain] Cd Length: 1109 Bit Score: 42.35 E-value: 4.55e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1198 LEMELRTVKSNLN----QVVQERNDAQRQLSR----EQNARMLQDGI-----LTNHLSKQKEIEMAQKKMNSENSHSHEE 1264
Cdd:PRK10929 28 ITQELEQAKAAKTpaqaEIVEALQSALNWLEErkgsLERAKQYQQVIdnfpkLSAELRQQLNNERDEPRSVPPNMSTDAL 107
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1265 EKDLSHKNSMLQEEIAMLrleidtikNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLtQTISQYN-----GRLSVL 1339
Cdd:PRK10929 108 EQEILQVSSQLLEKSRQA--------QQEQDRAREISDSLSQLPQQQTEARRQLNEIERRL-QTLGTPNtplaqAQLTAL 178
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1340 TAENAMLNSKLE---------NEKQSKERLEAEV-ESYHSRLAAAIHD-RDQ-SETSKRELELAFQRARDECSRLQDKMN 1407
Cdd:PRK10929 179 QAESAALKALVDelelaqlsaNNRQELARLRSELaKKRSQQLDAYLQAlRNQlNSQRQREAERALESTELLAEQSGDLPK 258
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1408 FDVSNLKDNNEI---LSQQ------LFKTESKLNSLEIEFHHTRDALREKT--LGLERVQKDLSQTQ-CQMKEMEqKYQn 1475
Cdd:PRK10929 259 SIVAQFKINRELsqaLNQQaqrmdlIASQQRQAASQTLQVRQALNTLREQSqwLGVSNALGEALRAQvARLPEMP-KPQ- 336
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 1034567240 1476 eQVKvnkyigkQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKT 1521
Cdd:PRK10929 337 -QLD-------TEMAQLRVQRLRYEDLLNKQPQLRQIRQADGQPLT 374
|
|
| PHA02946 |
PHA02946 |
ankyin-like protein; Provisional |
96-215 |
5.04e-03 |
|
ankyin-like protein; Provisional
Pssm-ID: 165256 [Multi-domain] Cd Length: 446 Bit Score: 41.58 E-value: 5.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 96 VTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYAVYNED--ISVATKLLLYDANIE 173
Cdd:PHA02946 55 VEELLHRGYSPNETDDDGNYPLHIASKINNNRIVAMLLTHGADPNACDKQHKTPLYYLSGTDDevIERINLLVQYGAKIN 134
|
90 100 110 120
....*....|....*....|....*....|....*....|..
gi 1034567240 174 AKNKDDLTPLLLAVSGKKQQMVEFLIKKKANVNAVDKLESSH 215
Cdd:PHA02946 135 NSVDEEGCGPLLACTDPSERVFKKIMSIGFEARIVDKFGKNH 176
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1351-1630 |
5.11e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 42.03 E-value: 5.11e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1351 ENEKQSKERLEAEVESYHSRLAAAIH---DRDQSETSKRELELAFQRARD-ECSRLQDKMNFDVSNLKDNNEIlsQQLFK 1426
Cdd:pfam17380 322 EKARQAEMDRQAAIYAEQERMAMERErelERIRQEERKRELERIRQEEIAmEISRMRELERLQMERQQKNERV--RQELE 399
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1427 TESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVnkyigkqesvEERLSQLQSEnmLLRQ 1506
Cdd:pfam17380 400 AARKVKILEEERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRL----------EEQERQQQVE--RLRQ 467
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1507 QLDDAHNKADNKEKTVIN---IQDQFHAIVQKlQAESEKQSLLlEERNKELISEcNHLKERQYQ-YENEKAEREVVVRQL 1582
Cdd:pfam17380 468 QEEERKRKKLELEKEKRDrkrAEEQRRKILEK-ELEERKQAMI-EEERKRKLLE-KEMEERQKAiYEEERRREAEEERRK 544
|
250 260 270 280
....*....|....*....|....*....|....*....|....*....
gi 1034567240 1583 QQELADTLK-KQSMSEASLEvtsRYRINLEDETQDLKKKLGQIRNQLQE 1630
Cdd:pfam17380 545 QQEMEERRRiQEQMRKATEE---RSRLEAMEREREMMRQIVESEKARAE 590
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1680-1908 |
5.32e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 41.29 E-value: 5.32e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1680 ANLSEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNEYENgefsfhgdlktsqfemdiQINK 1759
Cdd:COG4942 19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQ------------------ELAA 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1760 LKHKIDDLTAELETAGSKclhLDTKNQILQEELLSM-KTVQKKCEKL---QKNKKKLEQEVINLRSHIERNMVELGQVKQ 1835
Cdd:COG4942 81 LEAELAELEKEIAELRAE---LEAQKEELAELLRALyRLGRQPPLALllsPEDFLDAVRRLQYLKYLAPARREQAEELRA 157
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034567240 1836 YKQEIEERaRQEIAEKLKEVNLFLQAQAASQENLEQfRENNFASMKSQMELRIKDLESELSKIKTSQEDFNKT 1908
Cdd:COG4942 158 DLAELAAL-RAELEAERAELEALLAELEEERAALEA-LKAERQKLLARLEKELAELAAELAELQQEAEELEAL 228
|
|
| PRK09039 |
PRK09039 |
peptidoglycan -binding protein; |
1325-1435 |
5.42e-03 |
|
peptidoglycan -binding protein;
Pssm-ID: 181619 [Multi-domain] Cd Length: 343 Bit Score: 41.49 E-value: 5.42e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1325 LTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELElafqRARDECSRLQD 1404
Cdd:PRK09039 44 LSREISGKDSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQALLAELA----GAGAAAEGRAG 119
|
90 100 110
....*....|....*....|....*....|....*...
gi 1034567240 1405 KMNFDVSNLKDNN-------EILSQQLFKTESKLNSLE 1435
Cdd:PRK09039 120 ELAQELDSEKQVSaralaqvELLNQQIAALRRQLAALE 157
|
|
| PRK12704 |
PRK12704 |
phosphodiesterase; Provisional |
1659-1854 |
5.46e-03 |
|
phosphodiesterase; Provisional
Pssm-ID: 237177 [Multi-domain] Cd Length: 520 Bit Score: 41.69 E-value: 5.46e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1659 LKVTVKKQMDKIEELQKNLLNAnlSEDEKEQLKKLMELKQSlecnlDQEMKKNVELEREItgfknllkmtRKKLNEYENG 1738
Cdd:PRK12704 25 RKKIAEAKIKEAEEEAKRILEE--AKKEAEAIKKEALLEAK-----EEIHKLRNEFEKEL----------RERRNELQKL 87
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1739 EfsfhgdlktsqfemdiqiNKLKHKIDDLTAELETAGSKCLHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQEVIN 1818
Cdd:PRK12704 88 E------------------KRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISG 149
|
170 180 190
....*....|....*....|....*....|....*.
gi 1034567240 1819 LRSHIERNMVelgqvkqyKQEIEERARQEIAEKLKE 1854
Cdd:PRK12704 150 LTAEEAKEIL--------LEKVEEEARHEAAVLIKE 177
|
|
| 235kDa-fam |
TIGR01612 |
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ... |
1157-1902 |
5.52e-03 |
|
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain] Cd Length: 2757 Bit Score: 41.96 E-value: 5.52e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1157 EEKRRNADTLYEKIREQLRRKEEQYRKEVEVKQQlELSLQTLEMELRT---VKSNLNQVVQERNDAQRQLSREQNARMLQ 1233
Cdd:TIGR01612 699 DDLKSKIDKEYDKIQNMETATVELHLSNIENKKN-ELLDIIVEIKKHIhgeINKDLNKILEDFKNKEKELSNKINDYAKE 777
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1234 DGILTNHLSKQKEIEmaqkkmnsenshsheeekdlSHKNSmlqeeiamlRLEIDTIKNqnqekekkcfEDLKIVKEKNED 1313
Cdd:TIGR01612 778 KDELNKYKSKISEIK--------------------NHYND---------QINIDNIKD----------EDAKQNYDKSKE 818
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1314 LQKTIKQNEETLTQTISQY-NGRLSVLTAENAMLNskLENekQSKERLEAEVESYhsrlaAAIHDRDQSETSKRELELAF 1392
Cdd:TIGR01612 819 YIKTISIKEDEIFKIINEMkFMKDDFLNKVDKFIN--FEN--NCKEKIDSEHEQF-----AELTNKIKAEISDDKLNDYE 889
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1393 QRARDECSRLQDKMNF------DVSNLKDNNEIL------SQQLFKTESKLNSLEIEFHHTRDALREKTLgLERVQKDLS 1460
Cdd:TIGR01612 890 KKFNDSKSLINEINKSieeeyqNINTLKKVDEYIkicentKESIEKFHNKQNILKEILNKNIDTIKESNL-IEKSYKDKF 968
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1461 QTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQS------ENMLLRQ------QLDDAHNKADNKEKTVINIQDQ 1528
Cdd:TIGR01612 969 DNTLIDKINELDKAFKDASLNDYEAKNNELIKYFNDLKAnlgknkENMLYHQfdekekATNDIEQKIEDANKNIPNIEIA 1048
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1529 FHAIVQKLQAESEKQ-SLLLEERNKELISEC-------NHLKERQYQYENEKAEREVVVRqlqqeLADTLKKQSMSEASL 1600
Cdd:TIGR01612 1049 IHTSIYNIIDEIEKEiGKNIELLNKEILEEAeinitnfNEIKEKLKHYNFDDFGKEENIK-----YADEINKIKDDIKNL 1123
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1601 EVTSRYRINledETQDLKKKLGQIRNQLQeAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNA 1680
Cdd:TIGR01612 1124 DQKIDHHIK---ALEEIKKKSENYIDEIK-AQINDLEDVADKAISNDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEI 1199
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1681 NLSEDEKEQLKKLMELKQSLECNL--------DQEMKKNVELEREITGFKNLLKMTRKKLNEYENgEFSFHGDLKTSQFE 1752
Cdd:TIGR01612 1200 AEIEKDKTSLEEVKGINLSYGKNLgklflekiDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIEN-EMGIEMDIKAEMET 1278
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1753 MDIQINKLK--HKIDDLTAE-LETAGSKCLHLDTKNQILQEELLSMKTVQKKCEKLQKNKKKLEQ---EVINLRSHIERN 1826
Cdd:TIGR01612 1279 FNISHDDDKdhHIISKKHDEnISDIREKSLKIIEDFSEESDINDIKKELQKNLLDAQKHNSDINLylnEIANIYNILKLN 1358
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1827 MVE--LGQVKQYKQEIEERARQEIAEKLKevnlflqaqaaSQENLEQFREN-NFASMKSQMELRI--KDLESELSKIKTS 1901
Cdd:TIGR01612 1359 KIKkiIDEVKEYTKEIEENNKNIKDELDK-----------SEKLIKKIKDDiNLEECKSKIESTLddKDIDECIKKIKEL 1427
|
.
gi 1034567240 1902 Q 1902
Cdd:TIGR01612 1428 K 1428
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
1107-1646 |
6.25e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 41.70 E-value: 6.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1107 LTVKIKKMEDKVNVLQRELSETKEIKSQLEH-------QKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEE 1179
Cdd:pfam01576 487 LSTRLRQLEDERNSLQEQLEEEEEAKRNVERqlstlqaQLSDMKKKLEEDAGTLEALEEGKKRLQRELEALTQQLEEKAA 566
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1180 QYRKEVEVKQQLELSLQTLEMEL---RTVKSNL------------------NQVVQERNDAQRQlSREQNARMLQ-DGIL 1237
Cdd:pfam01576 567 AYDKLEKTKNRLQQELDDLLVDLdhqRQLVSNLekkqkkfdqmlaeekaisARYAEERDRAEAE-AREKETRALSlARAL 645
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1238 TNHLSKQKEIEMAQKKMNSENSHSHEEEKDLSHKNSMLQEEIAMLRLEIDTIKNQNQEKEK--KCFEDLKIVKEKNedLQ 1315
Cdd:pfam01576 646 EEALEAKEELERTNKQLRAEMEDLVSSKDDVGKNVHELERSKRALEQQVEEMKTQLEELEDelQATEDAKLRLEVN--MQ 723
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1316 KTIKQNEETLTQTISQYNGRLSVLTAENAMLNSKLENEKQ-------SKERLEAEVESYHSRLAAAIHDRDQSETSKREL 1388
Cdd:pfam01576 724 ALKAQFERDLQARDEQGEEKRRQLVKQVRELEAELEDERKqraqavaAKKKLELDLKELEAQIDAANKGREEAVKQLKKL 803
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1389 ELA---FQRARDECSRLQDkmnfdvsnlkdnnEILSQQLfKTESKLNSLEIEFHHtrdaLREKTLGLERVQKdlsQTQCQ 1465
Cdd:pfam01576 804 QAQmkdLQRELEEARASRD-------------EILAQSK-ESEKKLKNLEAELLQ----LQEDLAASERARR---QAQQE 862
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1466 MKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQSEnmllrqqLDDAHNKADNKEKTVINIQDQFHAIVQKLQAE------ 1539
Cdd:pfam01576 863 RDELADEIASGASGKSALQDEKRRLEARIAQLEEE-------LEEEQSNTELLNDRLRKSTLQVEQLTTELAAErstsqk 935
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1540 SEKQSLLLEERNKELISECNHLKER-QYQYENEKAEREVVVRQLQQELADTLKKQSMS------------EASLEVTSRY 1606
Cdd:pfam01576 936 SESARQQLERQNKELKAKLQEMEGTvKSKFKSSIAALEAKIAQLEEQLEQESRERQAAnklvrrtekklkEVLLQVEDER 1015
|
570 580 590 600
....*....|....*....|....*....|....*....|..
gi 1034567240 1607 RI--NLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQ 1646
Cdd:pfam01576 1016 RHadQYKDQAEKGNSRMKQLKRQLEEAEEEASRANAARRKLQ 1057
|
|
| PHA02876 |
PHA02876 |
ankyrin repeat protein; Provisional |
95-209 |
6.64e-03 |
|
ankyrin repeat protein; Provisional
Pssm-ID: 165207 [Multi-domain] Cd Length: 682 Bit Score: 41.59 E-value: 6.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 95 VVTLLVDRKCQLNVCDNENRTALMKAVQCQEEKCATILLEHGADPNLADVHGNTALHYAVYNEDISVATKLLLYDANIea 174
Cdd:PHA02876 160 IAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDLSVLECAVDSKNIDTIKAIIDNRSNI-- 237
|
90 100 110
....*....|....*....|....*....|....*
gi 1034567240 175 kNKDDLTpLLLAVSGKKQQMVEFLIKKKANVNAVD 209
Cdd:PHA02876 238 -NKNDLS-LLKAIRNEDLETSLLLYDAGFSVNSID 270
|
|
| Myosin_tail_1 |
pfam01576 |
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ... |
873-1713 |
6.85e-03 |
|
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Pssm-ID: 460256 [Multi-domain] Cd Length: 1081 Bit Score: 41.70 E-value: 6.85e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 873 QQDMQRFKNEIGMLKVEFQALEKEKVQLQK----------EVEEERKKHRNNEMEVSANIHDGATDDAEDDDDDDGLIQK 942
Cdd:pfam01576 18 KERQQKAESELKELEKKHQQLCEEKNALQEqlqaetelcaEAEEMRARLAARKQELEEILHELESRLEEEEERSQQLQNE 97
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 943 RKSGETDHQQFPRKENKEYASSgPALQMKEVKSTEKEKRTSKEsvnspvfgkaslltggLLQVDDDSSlseidedegrpt 1022
Cdd:pfam01576 98 KKKMQQHIQDLEEQLDEEEAAR-QKLQLEKVTTEAKIKKLEED----------------ILLLEDQNS------------ 148
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1023 kKTSNEKNKVKNQIQSMDDvddltqssetasedcelphssyknfmllieQLGMECKDSVSLLKIQDAALSCERLLELKKN 1102
Cdd:pfam01576 149 -KLSKERKLLEERISEFTS------------------------------NLAEEEEKAKSLSKLKNKHEAMISDLEERLK 197
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1103 HCELLTVKIKKMEDKvnvLQRELSETKEiksqlehQKVEWERELCSLRFSLNQEEEKRRNAdtlyekireQLRRKEEQYR 1182
Cdd:pfam01576 198 KEEKGRQELEKAKRK---LEGESTDLQE-------QIAELQAQIAELRAQLAKKEEELQAA---------LARLEEETAQ 258
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1183 KEVEVKQQLELSLQTLEmelrtVKSNLNQVVQERNDAQRQ---LSREQNA--RMLQDGILT----NHLSKQKEIEMAQ-K 1252
Cdd:pfam01576 259 KNNALKKIRELEAQISE-----LQEDLESERAARNKAEKQrrdLGEELEAlkTELEDTLDTtaaqQELRSKREQEVTElK 333
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1253 KMNSENSHSHEEE-KDLSHKNSMLQEEIAMlRLEIDTIKNQNQEKEKKCFEdlkivkEKNEDLQKTIKqneeTLTQTISQ 1331
Cdd:pfam01576 334 KALEEETRSHEAQlQEMRQKHTQALEELTE-QLEQAKRNKANLEKAKQALE------SENAELQAELR----TLQQAKQD 402
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1332 YNGRLSVLTAENAMLNSKL-ENEKQSKERLEaevesyhsRLAAAIHDRDQSETSKRELELAFQRARDECSRLQdkmnfdv 1410
Cdd:pfam01576 403 SEHKRKKLEGQLQELQARLsESERQRAELAE--------KLSKLQSELESVSSLLNEAEGKNIKLSKDVSSLE------- 467
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1411 SNLKDNNEILS---QQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQvkvnkyiGKQ 1487
Cdd:pfam01576 468 SQLQDTQELLQeetRQKLNLSTRLRQLEDERNSLQEQLEEEEEAKRNVERQLSTLQAQLSDMKKKLEEDA-------GTL 540
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1488 ESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQ----------DQFHAIVQKLQAESEK-QSLLLEERNkelIS 1556
Cdd:pfam01576 541 EALEEGKKRLQRELEALTQQLEEKAAAYDKLEKTKNRLQqelddllvdlDHQRQLVSNLEKKQKKfDQMLAEEKA---IS 617
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1557 EcnhlkerQYQYENEKAEREV--------------------------VVRQLQQELADTLKKQSMSEASLEVTSRYRINL 1610
Cdd:pfam01576 618 A-------RYAEERDRAEAEAreketralslaraleealeakeelerTNKQLRAEMEDLVSSKDDVGKNVHELERSKRAL 690
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1611 EDETQDLKKKLGQIRNQLQEAQDrhtEAVRCAEKMQDHKQKLEKDnakLKVTVKKQMDKIEELQKNL--LNANLsEDEKE 1688
Cdd:pfam01576 691 EQQVEEMKTQLEELEDELQATED---AKLRLEVNMQALKAQFERD---LQARDEQGEEKRRQLVKQVreLEAEL-EDERK 763
|
890 900
....*....|....*....|....*
gi 1034567240 1689 QLKKLMELKQSLECNLdQEMKKNVE 1713
Cdd:pfam01576 764 QRAQAVAAKKKLELDL-KELEAQID 787
|
|
| DUF3584 |
pfam12128 |
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ... |
1152-1461 |
8.04e-03 |
|
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.
Pssm-ID: 432349 [Multi-domain] Cd Length: 1191 Bit Score: 41.36 E-value: 8.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1152 SLNQEEEKRRNADTLYEKI---REQLRRKEEQY---RKEVE-VKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLS 1224
Cdd:pfam12128 595 WAASEEELRERLDKAEEALqsaREKQAAAEEQLvqaNGELEkASREETFARTALKNARLDLRRLFDEKQSEKDKKNKALA 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1225 REQNARMLQ----DGILTNHLSKQKEIEMAQKKMNSENSHSHEE-----EKDLSHKNSMLQEEIAMLRLEIDTIKNQNQE 1295
Cdd:pfam12128 675 ERKDSANERlnslEAQLKQLDKKHQAWLEEQKEQKREARTEKQAywqvvEGALDAQLALLKAAIAARRSGAKAELKALET 754
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1296 KEKKCFEDLKIVKEKNEDLQKTIKqneeTLTQTISQYNGRLSVLTAENAMLNSKLENEKQSKERLEAEVESYHSRLAAAI 1375
Cdd:pfam12128 755 WYKRDLASLGVDPDVIAKLKREIR----TLERKIERIAVRRQEVLRYFDWYQETWLQRRPRLATQLSNIERAISELQQQL 830
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1376 hDRDQSETSKRELELafQRARDECSRLQDKMNFDVSNLKDNNEILSQqlFKTESKLNSLEIEFHHTRDALREKTLGLERV 1455
Cdd:pfam12128 831 -ARLIADTKLRRAKL--EMERKASEKQQVRLSENLRGLRCEMSKLAT--LKEDANSEQAQGSIGERLAQLEDLKLKRDYL 905
|
....*.
gi 1034567240 1456 QKDLSQ 1461
Cdd:pfam12128 906 SESVKK 911
|
|
| CCCAP |
pfam15964 |
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in ... |
1107-1327 |
8.13e-03 |
|
Centrosomal colon cancer autoantigen protein family; CCCAP is a family of proteins found in eukaryotes. CCCAP is also known as SDCCAG8, serologically defined colon cancer antigen 8. It is associated with the centrosome.
Pssm-ID: 435040 [Multi-domain] Cd Length: 703 Bit Score: 41.43 E-value: 8.13e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1107 LTVKIKKMEDKVNVLQRE----LSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKRRNADTLYEKIREQLRRKEEQYR 1182
Cdd:pfam15964 408 LSQNVAQLEAQVEKVTREknslVSQLEEAQKQLASQEMDVTKVCGEMRYQLNQTKMKKDEAEKEHREYRTKTGRQLEIKD 487
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1183 KEVEvKQQLELSLQTLEME-------------------LRTVKSNLNQVVQERNDAQRQLSREQNARMLQdgiltnhlSK 1243
Cdd:pfam15964 488 QEIE-KLGLELSESKQRLEqaqqdaarareeclkltelLGESEHQLHLTRLEKESIQQSFSNEAKAQALQ--------AQ 558
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1244 QKEIEMAQKKMNSENSHS---HEEEKDLSHKNSM---LQEEIAMLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKT 1317
Cdd:pfam15964 559 QREQELTQKMQQMEAQHDktvNEQYSLLTSQNTFiakLKEECCTLAKKLEEITQKSRSEVEQLSQEKEYLQDRLEKLQKR 638
|
250
....*....|
gi 1034567240 1318 IKQNEETLTQ 1327
Cdd:pfam15964 639 NEELEEQCVQ 648
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
1112-1430 |
8.35e-03 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 41.67 E-value: 8.35e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1112 KKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRFSLNQEEEKR-----RNADTLYEKIREQLRRKEEQYRKEVE 1186
Cdd:PTZ00121 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKkaeeaKKAEEDEKKAAEALKKEAEEAKKAEE 1706
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1187 VKQQLELSLQTLEMELRTVKSNLNQVVQERNDAQRQLSREQNARmlqdgiltnhlskqKEIEMAQKKMNSENSHSHEEEK 1266
Cdd:PTZ00121 1707 LKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAK--------------KDEEEKKKIAHLKKEEEKKAEE 1772
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1267 DLSHKNSMLQEEiamLRLEIDTIKNQNQEKEKKCFEDLKIVKEKNEDLQKTIKQNEETLTQTISQyngrlsVLTAENAML 1346
Cdd:PTZ00121 1773 IRKEKEAVIEEE---LDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGNLVINDSKEMEDSAIKE------VADSKNMQL 1843
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1347 NS--KLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDvsnlKDNNEILSQQL 1424
Cdd:PTZ00121 1844 EEadAFEKHKFNKNNENGEDGNKEADFNKEKDLKEDDEEEIEEADEIEKIDKDDIEREIPNNNMA----GKNNDIIDDKL 1919
|
....*.
gi 1034567240 1425 FKTESK 1430
Cdd:PTZ00121 1920 DKDEYI 1925
|
|
| EnvC |
COG4942 |
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ... |
1419-1640 |
8.41e-03 |
|
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain] Cd Length: 377 Bit Score: 40.90 E-value: 8.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1419 ILSQQLFKTESKLNSLEIEFHHTRDALREKTLGLERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGKQESVEERLSQLQ 1498
Cdd:COG4942 10 LLALAAAAQADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELE 89
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1499 SENMLLRQQLDD-----------AHNKADNKEKTVINIQDQFHAIVQKLQAeSEKQSLLLEERNKELISECNHLKERQYQ 1567
Cdd:COG4942 90 KEIAELRAELEAqkeelaellraLYRLGRQPPLALLLSPEDFLDAVRRLQY-LKYLAPARREQAEELRADLAELAALRAE 168
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1034567240 1568 YENEKAEREVVVRQLQQELADTLKKQSMSEASLEVTSRYRINLEDETQDLKKKLGQIRNQLQEAQDRHTEAVR 1640
Cdd:COG4942 169 LEAERAELEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAE 241
|
|
| YhaN |
COG4717 |
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown]; |
1271-1654 |
8.86e-03 |
|
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
Pssm-ID: 443752 [Multi-domain] Cd Length: 641 Bit Score: 41.29 E-value: 8.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1271 KNSMLQEEIAMLRLEIDTIKNQNQEkekkcFEDLKIVKEKNEDLQKTIKQNEETLTQTISQYNGRLSVLTAENAM--LNS 1348
Cdd:COG4717 65 KPELNLKELKELEEELKEAEEKEEE-----YAELQEELEELEEELEELEAELEELREELEKLEKLLQLLPLYQELeaLEA 139
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1349 KLENEKQSKERLEAEVESYHSRLAAAIHDRDQSETSKRELELAFQRARDECSRLQDKMNFDVSNLKDNNEILSQQLFKTE 1428
Cdd:COG4717 140 ELAELPERLEELEERLEELRELEEELEELEAELAELQEELEELLEQLSLATEEELQDLAEELEELQQRLAELEEELEEAQ 219
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1429 SKLNSLEIEFHHTRDALREKTLgLERVQK-----------------------------------------DLSQTQCQMK 1467
Cdd:COG4717 220 EELEELEEELEQLENELEAAAL-EERLKEarlllliaaallallglggsllsliltiagvlflvlgllalLFLLLAREKA 298
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1468 EMEQKYQNEQVKVNKYIGKQESVEERLSQLQSENMLLRQQLDDAHNKADNKEKTVINIQDQFHAIvqKLQAESEKQSLLL 1547
Cdd:COG4717 299 SLGKEAEELQALPALEELEEEELEELLAALGLPPDLSPEELLELLDRIEELQELLREAEELEEEL--QLEELEQEIAALL 376
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1548 EERNKELISECNHLKERQYQYENEKAEREVVVRQLQQ------ELADTLKKQSMSEASLEVTSRYRiNLEDETQDLKKKL 1621
Cdd:COG4717 377 AEAGVEDEEELRAALEQAEEYQELKEELEELEEQLEEllgeleELLEALDEEELEEELEELEEELE-ELEEELEELREEL 455
|
410 420 430
....*....|....*....|....*....|...
gi 1034567240 1622 GQIRNQLQEAQDRHteavRCAEKMQDHKQKLEK 1654
Cdd:COG4717 456 AELEAELEQLEEDG----ELAELLQELEELKAE 484
|
|
| DUF5401 |
pfam17380 |
Family of unknown function (DUF5401); This is a family of unknown function found in ... |
1094-1297 |
9.09e-03 |
|
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain] Cd Length: 722 Bit Score: 41.26 E-value: 9.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1094 ERLLELKKNHCELLTVKIKKMEDKVNVLQRELSETKEIKSQLEHQKVEWERELCSLRfslnQEEEKRRNADTLYEKIREQ 1173
Cdd:pfam17380 410 ERQRKIQQQKVEMEQIRAEQEEARQREVRRLEEERAREMERVRLEEQERQQQVERLR----QQEEERKRKKLELEKEKRD 485
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1174 LRRKEEQYRKEVEvkQQLELSLQTLEMELRTVKsnlnQVVQERNDAQRQLSREQNARMLQDgiltnhlSKQKEIEMAQKK 1253
Cdd:pfam17380 486 RKRAEEQRRKILE--KELEERKQAMIEEERKRK----LLEKEMEERQKAIYEEERRREAEE-------ERRKQQEMEERR 552
|
170 180 190 200
....*....|....*....|....*....|....*....|....
gi 1034567240 1254 MNSENSHSHEEEKDlshKNSMLQEEIAMLRleiDTIKNQNQEKE 1297
Cdd:pfam17380 553 RIQEQMRKATEERS---RLEAMEREREMMR---QIVESEKARAE 590
|
|
| ClyA_Cry6Aa-like |
cd22656 |
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes ... |
1611-1734 |
9.68e-03 |
|
Bacillus thuringiensis crystal 6Aa (Cry6Aa) toxin, and similar proteins; This model includes pesticidal Cry6Aa toxin from Bacillus thuringiensis, one of the many parasporal crystal (Cry) toxins produced during the sporulation phase of growth. Many of these proteins are toxic to numerous insect species and have been effectively used as proteinaceous insecticides to directly kill insect pests; some have been used to control insect growth on transgenic agricultural plants. Cry6Aa exists as a protoxin, which is activated by cleavage using trypsin. Structure studies for Cry6Aa support a mechanism of action by pore formation, similar to cytolysin A (ClyA)-type alpha pore-forming toxins (alpha-PFTs) such as HblB, and bioassay and mutation studies show that Cry6Aa is an active pore-forming toxin. Cry6Aa shows atypical features compared to other members of alpha-PFTs, including internal repeat sequences and small loop regions within major alpha helices.
Pssm-ID: 439154 [Multi-domain] Cd Length: 309 Bit Score: 40.43 E-value: 9.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1611 EDETQDLKKKL-GQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKvTVKKQMD-------------KIEELQKN 1676
Cdd:cd22656 109 DEELEEAKKTIkALLDDLLKEAKKYQDKAAKVVDKLTDFENQTEKDQTALE-TLEKALKdlltdeggaiarkEIKDLQKE 187
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*...
gi 1034567240 1677 LLNANlsEDEKEQLKKLMELKQSLECNLDQEMKKNVELEREITGFKNLLKMTRKKLNE 1734
Cdd:cd22656 188 LEKLN--EEYAAKLKAKIDELKALIADDEAKLAAALRLIADLTAADTDLDNLLALIGP 243
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
1609-1945 |
9.80e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 41.16 E-value: 9.80e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1609 NLEDETQDLKKKLGQIRNQLQEAQDRHTEAVRCAEKMQDHKQKLEKDNAKLKVTVKKQMDKIEELQKNLLNANLSEDEKE 1688
Cdd:TIGR04523 37 QLEKKLKTIKNELKNKEKELKNLDKNLNKDEEKINNSNNKIKILEQQIKDLNDKLKKNKDKINKLNSDLSKINSEIKNDK 116
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1689 QLKKLMELKQSlecNLDQEMKKNvelEREITGFKNLLKMTRKKLNEYENGefsfHGDLKTSQFEMDIQINKLKHKIDDLT 1768
Cdd:TIGR04523 117 EQKNKLEVELN---KLEKQKKEN---KKNIDKFLTEIKKKEKELEKLNNK----YNDLKKQKEELENELNLLEKEKLNIQ 186
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1769 AELETAGSKCLHLDTKnqilqeeLLSMKTVQKKCEKLQKNKKKLEQEVINLRSHIERNMVELGQVKQYKQEIEERARQEI 1848
Cdd:TIGR04523 187 KNIDKIKNKLLKLELL-------LSNLKKKIQKNKSLESQISELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQLNQLK 259
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1849 AEKLKEVNLFLQAQAASQENLEQfrennfasmksqmelrIKDLESELSKIKTSQEDFNKTELEKYKQLYLEELKVRKS-- 1926
Cdd:TIGR04523 260 DEQNKIKKQLSEKQKELEQNNKK----------------IKELEKQLNQLKSEISDLNNQKEQDWNKELKSELKNQEKkl 323
|
330 340
....*....|....*....|.
gi 1034567240 1927 --LSSKLTKTNERLAEVNTKL 1945
Cdd:TIGR04523 324 eeIQNQISQNNKIISQLNEQI 344
|
|
| PRK11281 |
PRK11281 |
mechanosensitive channel MscK; |
1447-1677 |
9.87e-03 |
|
mechanosensitive channel MscK;
Pssm-ID: 236892 [Multi-domain] Cd Length: 1113 Bit Score: 41.05 E-value: 9.87e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1447 EKTLGL----ERVQKDLSQTQCQMKEMEQKYQNEQVKVNKYIGK-QESVEERLSQL---QSENML--LRQQLDDAHNKAD 1516
Cdd:PRK11281 66 EQTLALldkiDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDnDEETRETLSTLslrQLESRLaqTLDQLQNAQNDLA 145
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1517 NKEKTVINIQDQFHAiVQKLQAESEKQSLLLEER-NKELISECNHLKERQYQYENEKAEREVVVRQLQQELADTLKKQSM 1595
Cdd:PRK11281 146 EYNSQLVSLQTQPER-AQAALYANSQRLQQIRNLlKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDL 224
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034567240 1596 SEASLEVTSRYRINLEDETQDLK-----KKLGQIRNQLQEAQdrhteAVRCAEKMQDH---KQKLEKdNAKLKVTVKKQM 1667
Cdd:PRK11281 225 LQKQRDYLTARIQRLEHQLQLLQeainsKRLTLSEKTVQEAQ-----SQDEAARIQANplvAQELEI-NLQLSQRLLKAT 298
|
250
....*....|.
gi 1034567240 1668 DKIEEL-QKNL 1677
Cdd:PRK11281 299 EKLNTLtQQNL 309
|
|
|