NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1370473301|ref|XP_024306891|]
View 

microtubule cross-linking factor 1 isoform X20 [Homo sapiens]

Protein Classification

SOGA and DUF4482 domain-containing protein( domain architecture ID 13530637)

protein containing domains SMC_prok_B, SOGA, and DUF4482

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
906-1029 6.46e-51

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


:

Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 176.41  E-value: 6.46e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  906 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 973
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1370473301  974 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1029
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
141-235 2.94e-37

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


:

Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 135.50  E-value: 2.94e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  141 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 220
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1370473301  221 LGRKIVELEVENRGL 235
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
270-362 9.41e-37

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


:

Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 133.96  E-value: 9.41e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  270 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 347
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1370473301  348 LSGKVLKLQHENHAL 362
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SMC_prok_B super family cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
9-296 1.31e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


The actual alignment was detected with superfamily member TIGR02168:

Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 69.70  E-value: 1.31e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAK 88
Cdd:TIGR02168  707 LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE 786
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGR 168
Cdd:TIGR02168  787 LEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEE 866
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  169 EKDELEQELQKYKSLYGDVDSPLptgeaggppstreAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQER 248
Cdd:TIGR02168  867 LIEELESELEALLNERASLEEAL-------------ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1370473301  249 EGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 296
Cdd:TIGR02168  934 LEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
 
Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
906-1029 6.46e-51

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 176.41  E-value: 6.46e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  906 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 973
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1370473301  974 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1029
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
141-235 2.94e-37

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 135.50  E-value: 2.94e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  141 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 220
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1370473301  221 LGRKIVELEVENRGL 235
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
270-362 9.41e-37

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 133.96  E-value: 9.41e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  270 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 347
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1370473301  348 LSGKVLKLQHENHAL 362
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
9-296 1.31e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 69.70  E-value: 1.31e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAK 88
Cdd:TIGR02168  707 LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE 786
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGR 168
Cdd:TIGR02168  787 LEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEE 866
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  169 EKDELEQELQKYKSLYGDVDSPLptgeaggppstreAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQER 248
Cdd:TIGR02168  867 LIEELESELEALLNERASLEEAL-------------ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1370473301  249 EGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 296
Cdd:TIGR02168  934 LEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
15-415 2.06e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 59.18  E-value: 2.06e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNcRILQYRLRKAEQKSLKVAetgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENE 94
Cdd:COG1196    201 QLEPLERQAEKAERY-RELKEELKELEAELLLLK---------LRELEAELEELEAELEELEAELEELEAELAELEAELE 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   95 TLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELE 174
Cdd:COG1196    271 ELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAE 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  175 QELQkykslygdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRD 254
Cdd:COG1196    351 EELE----------------------EAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEA 408
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  255 HApsiptspfgdslESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKfkfepprepgwlgegaspgaggGAPL 334
Cdd:COG1196    409 EE------------ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEE----------------------EAEL 454
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  335 QEELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRApsprdsdAESDAGKKESDGEESRLPQPKREGPVGG 414
Cdd:COG1196    455 EEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA-------EADYEGFLEGVKAALLLAGLRGLAGAVA 527

                   .
gi 1370473301  415 E 415
Cdd:COG1196    528 V 528
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2-310 1.30e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 50.06  E-value: 1.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    2 EEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVD--GELIRSLEQDLKVakdvsvrlhHEL 79
Cdd:PRK03918   447 EEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKelAEQLKELEEKLKK---------YNL 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   80 KTVEEKrakaEDENETLRQQMIEVEISKQALQNELERLKEssLKRRStREMYKEKKTfnqddsadlrcqlqfAKEEAFLM 159
Cdd:PRK03918   518 EELEKK----AEEYEKLKEKLIKLKGEIKSLKKELEKLEE--LKKKL-AELEKKLDE---------------LEEELAEL 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  160 RKKMAKLGREK-DELEQELQKYKSLYGDVDsplptgEAGGPPSTREAELKlRLKLVEEEANILGRKIVELEVENRGLKAE 238
Cdd:PRK03918   576 LKELEELGFESvEELEERLKELEPFYNEYL------ELKDAEKELEREEK-ELKKLEEELDKAFEELAETEKRLEELRKE 648
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370473301  239 MEDMRGQQEREgpgrdhapsiptsPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFK 310
Cdd:PRK03918   649 LEELEKKYSEE-------------EYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
15-179 1.27e-04

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 46.67  E-value: 1.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNcriLQYRLRKAEQKSLKVAETGQVD----GELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAE 90
Cdd:pfam07111  482 ELEQLREERNRLDAE---LQLSAHLIQQEVGRAREQGEAErqqlSEVAQQLEQELQRAQESLASVGQQLEVARQGQQEST 558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   91 DENETLRQQMI-EVEISKQALQN---ELE-RLKE--SSLKRR---STREMYK---------EKKTFNQDDSADL-RCQLQ 150
Cdd:pfam07111  559 EEAASLRQELTqQQEIYGQALQEkvaEVEtRLREqlSDTKRRlneARREQAKavvslrqiqHRATQEKERNQELrRLQDE 638
                          170       180
                   ....*....|....*....|....*....
gi 1370473301  151 FAKEEAFLMRKKMAKLGREKDELEQELQK 179
Cdd:pfam07111  639 ARKEEGQRLARRVQELERDKNLMLATLQQ 667
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
60-535 3.42e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 42.34  E-value: 3.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   60 SLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVE------ISKQALQNELERLKESSLKRRSTREM--- 130
Cdd:TIGR00606  581 SKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEdklfdvCGSQDEESDLERLKEEIEKSSKQRAMlag 660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  131 --------------------------YKEKKTFNQDdSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLY 184
Cdd:TIGR00606  661 atavysqfitqltdenqsccpvcqrvFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSII 739
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  185 GDVDSPLP-TGEAGGPPSTREAELKLRLklvEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSP 263
Cdd:TIGR00606  740 DLKEKEIPeLRNKLQKVNRDIQRLKNDI---EEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQ 816
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  264 FGDSLESSTELRrhlQFVEEEAELLRRSISEIE-------DHNRQLTHELSKFKfepprepgwlgegaspgagggaplqe 336
Cdd:TIGR00606  817 GSDLDRTVQQVN---QEKQEKQHELDTVVSKIElnrkliqDQQEQIQHLKSKTN-------------------------- 867
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  337 ELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSDAESDAGKKESDGEESRLPQPKREGPVggeS 416
Cdd:TIGR00606  868 ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKV---N 944
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  417 DSEEMFEKTSGFGSGKPSEASEPCPTELL-KAREDSEYLVTLKHEAQRLERTVERLITDTDSFlhdaglrggaplpgpgl 495
Cdd:TIGR00606  945 DIKEKVKNIHGYMKDIENKIQDGKDDYLKqKETELNTVNAQLEECEKHQEKINEDMRLMRQDI----------------- 1007
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 1370473301  496 qGEEEQGEGDQQEPQLLGTINAKMKAFKKELQAFLEQVNR 535
Cdd:TIGR00606 1008 -DTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQ 1046
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
74-241 5.34e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 40.29  E-value: 5.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   74 RLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEK--KTFNQDDSADLRCQLQF 151
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgNVRNNKEYEALQKEIES 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  152 AKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 231
Cdd:COG1579    101 LKRRISDLEDEILELMERIEELEEELAELEAEL----------------AELEAELEEKKAELDEELAELEAELEELEAE 164
                          170
                   ....*....|
gi 1370473301  232 NRGLKAEMED 241
Cdd:COG1579    165 REELAAKIPP 174
 
Name Accession Description Interval E-value
DUF4482 pfam14818
Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately ...
906-1029 6.46e-51

Domain of unknown function (DUF4482); This family is found in eukaryotes, and is approximately 140 amino acids in length. The family is found in association with pfam11365.


Pssm-ID: 464333 [Multi-domain]  Cd Length: 138  Bit Score: 176.41  E-value: 6.46e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  906 MDLRWQIHHSEKNWNREKVELLDRLDRDRQEWERQKKEFLWRIEQLQKENSPRR------------GGSFLCDQKDGNVR 973
Cdd:pfam14818    1 MDLRWQLQHTEKNWHREKMELLDRFDRERQEWESQKKIMQKKIEQLQREVSLRRkinmnerakvidGEKFVPDQKESSSP 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 1370473301  974 PFPHQGSLRMPR--PVAMWPCADADSIPFEDRPLSKLKESDRCSASENLYLDALSLDD 1029
Cdd:pfam14818   81 PFPDSGQCEFPRmnHPGSLSKSDSDEESFLDEGNQKLKEQKRCKASENLFLDALSLDN 138
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
141-235 2.94e-37

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 135.50  E-value: 2.94e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  141 DSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI 220
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSDSSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1370473301  221 LGRKIVELEVENRGL 235
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SOGA pfam11365
Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, ...
270-362 9.41e-37

Protein SOGA; The SOGA (suppressor of glucose by autophagy) family consists of proteins SOGA1, SOGA2, and SOGA3. SOGA1 regulates autophagy by playing a role in the reduction of glucose production in an adiponectin and insulin dependent manner.


Pssm-ID: 463264 [Multi-domain]  Cd Length: 95  Bit Score: 133.96  E-value: 9.41e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  270 SSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA--GGGAPLQEELKSARLQISE 347
Cdd:pfam11365    1 SSAELRRQLQFVEEEAELLRRSLSEIEDHNKQLTNELNKYKSKYGPDESSLSDGEGGGSdsSREAELQEELKLARLQINE 80
                           90
                   ....*....|....*
gi 1370473301  348 LSGKVLKLQHENHAL 362
Cdd:pfam11365   81 LSGKVMKLQYENRVL 95
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
9-296 1.31e-11

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 69.70  E-value: 1.31e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAK 88
Cdd:TIGR02168  707 LEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE 786
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGR 168
Cdd:TIGR02168  787 LEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEE 866
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  169 EKDELEQELQKYKSLYGDVDSPLptgeaggppstreAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQER 248
Cdd:TIGR02168  867 LIEELESELEALLNERASLEEAL-------------ALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1370473301  249 EGPGRDHAPSIPTSPFGDSLEsstELRRHLQFVEEEAELLRRSISEIE 296
Cdd:TIGR02168  934 LEVRIDNLQERLSEEYSLTLE---EAEALENKIEDDEEEARRRLKRLE 978
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
58-356 1.29e-09

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 63.54  E-value: 1.29e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   58 IRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRS-------TREM 130
Cdd:TIGR02168  679 IEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEEriaqlskELTE 758
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  131 YKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDvdsplpTGEAGGPPSTREAELKLR 210
Cdd:TIGR02168  759 LEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTL------LNEEAANLRERLESLERR 832
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  211 LKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHApsiptspfgdsLESSTELRRHLQFVEEEAELLRR 290
Cdd:TIGR02168  833 IAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAL-----------LNERASLEEALALLRSELEELSE 901
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1370473301  291 SISEIEDHNRQLTHELSKfkfepprepgwlgegaspgagggapLQEELKSARLQISELSGKVLKLQ 356
Cdd:TIGR02168  902 ELRELESKRSELRRELEE-------------------------LREKLAQLELRLEGLEVRIDNLQ 942
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
15-415 2.06e-08

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 59.18  E-value: 2.06e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNcRILQYRLRKAEQKSLKVAetgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENE 94
Cdd:COG1196    201 QLEPLERQAEKAERY-RELKEELKELEAELLLLK---------LRELEAELEELEAELEELEAELEELEAELAELEAELE 270
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   95 TLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELE 174
Cdd:COG1196    271 ELRLELEELELELEEAQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEELEELEEELEEAE 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  175 QELQkykslygdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRD 254
Cdd:COG1196    351 EELE----------------------EAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEA 408
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  255 HApsiptspfgdslESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKfkfepprepgwlgegaspgaggGAPL 334
Cdd:COG1196    409 EE------------ALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEE----------------------EAEL 454
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  335 QEELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRApsprdsdAESDAGKKESDGEESRLPQPKREGPVGG 414
Cdd:COG1196    455 EEEEEALLELLAELLEEAALLEAALAELLEELAEAAARLLLLLEA-------EADYEGFLEGVKAALLLAGLRGLAGAVA 527

                   .
gi 1370473301  415 E 415
Cdd:COG1196    528 V 528
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
83-358 1.11e-07

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 57.00  E-value: 1.11e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   83 EEKRAKAEDENETLRQQMIEVEISKQALQNELERLKesslKRRSTREMYKEKKTFNQDDSADLRC-QLQFAKEEAFLMRK 161
Cdd:TIGR02169  169 DRKKEKALEELEEVEENIERLDLIIDEKRQQLERLR----REREKAERYQALLKEKREYEGYELLkEKEALERQKEAIER 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  162 KMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANI----------------LGRKI 225
Cdd:TIGR02169  245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIaslersiaekereledAEERL 324
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  226 VELEVENRGLKAEMEDMRGQQEREGPGRDHApsipTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHE 305
Cdd:TIGR02169  325 AKLEAEIDKLLAEIEELEREIEEERKRRDKL----TEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE 400
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1370473301  306 LSKFKFEPPRepgwLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHE 358
Cdd:TIGR02169  401 INELKRELDR----LQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALE 449
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
10-312 1.42e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 56.60  E-value: 1.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   10 EEDVYQLQELRRELDRANKNCRI-----LQYRLRKAEQKSLKVAETGQVDGELIRSLEQdLKVAKDVSVRLHHELktvEE 84
Cdd:TIGR02168  185 RENLDRLEDILNELERQLKSLERqaekaERYKELKAELRELELALLVLRLEELREELEE-LQEELKEAEEELEEL---TA 260
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   85 KRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRRstREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKK 162
Cdd:TIGR02168  261 ELQELEEKLEELRLEVSELEEEIEELQKELYALANeiSRLEQQ--KQILRERLANLERQLEELEAQLEELESKLDELAEE 338
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  163 MAKLGREKDELEQELQkykslygDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDM 242
Cdd:TIGR02168  339 LAELEEKLEELKEELE-------SLEAELEELEA------ELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERL 405
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1370473301  243 RGQQER--------EGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFE 312
Cdd:TIGR02168  406 EARLERledrrerlQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERE 483
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
9-308 5.23e-06

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 50.67  E-value: 5.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAK 88
Cdd:COG4372     54 LEQAREELEQLEEELEQARSELEQLEEELEELNEQLQAAQAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQ 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDS--ADLRCQLQFAKEEAFLMRKKMAKL 166
Cdd:COG4372    134 LEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQALSEAEAEQALDEllKEANRNAEKEEELAEAEKLIESLP 213
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  167 GREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPST---REAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMR 243
Cdd:COG4372    214 RELAEELLEAKDSLEAKLGLALSALLDALELEEDKEellEEVILKEIEELELAILVEKDTEEEELEIAALELEALEEAAL 293
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1370473301  244 GQQEREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSK 308
Cdd:COG4372    294 ELKLLALLLNLAALSLIGALEDALLAALLELAKKLELALAILLAELADLLQLLLVGLLDNDVLEL 358
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
3-301 5.34e-06

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 51.61  E-value: 5.34e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    3 EMRDSYLEEDVYQLQELRRELDRankncRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKvakdvsvRLHHElktv 82
Cdd:TIGR02169  764 EARIEELEEDLHKLEEALNDLEA-----RLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLN-------RLTLE---- 827
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   83 eekRAKAEDENETLRQQMIEVEISKQALQNELERLKessLKRRSTREMYKEKKTFnqddSADLRCQLQFAKEEAFLMRKK 162
Cdd:TIGR02169  828 ---KEYLEKEIQELQEQRIDLKEQIKSIEKEIENLN---GKKEELEEELEELEAA----LRDLESRLGDLKKERDELEAQ 897
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  163 MAKLGREKDELEQELQKYKSLYGDVDSPLptGEAGGPPSTREAELKLRLKLVEEEANI--LGRKIVELEVENRGLkaEME 240
Cdd:TIGR02169  898 LRELERKIEELEAQIEKKRKRLSELKAKL--EALEEELSEIEDPKGEDEEIPEEELSLedVQAELQRVEEEIRAL--EPV 973
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1370473301  241 DMRGQQEREgpgrdhapsiptspfgDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQ 301
Cdd:TIGR02169  974 NMLAIQEYE----------------EVLKRLDELKEKRAKLEEERKAILERIEEYEKKKRE 1018
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
10-365 6.71e-06

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 50.79  E-value: 6.71e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   10 EEDVYQLQELRRELDRANKNCRILQYRLR--KAEQKSLKvaetGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRA 87
Cdd:TIGR04523  263 NKIKKQLSEKQKELEQNNKKIKELEKQLNqlKSEISDLN----NQKEQDWNKELKSELKNQEKKLEEIQNQISQNNKIIS 338
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   88 KAEDENETLRQQMIEVEISKQALQNELERlKESSLKR-RSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKL 166
Cdd:TIGR04523  339 QLNEQISQLKKELTNSESENSEKQRELEE-KQNEIEKlKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKL 417
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  167 GREKDELEQELQKYKSLYGDVDSPLPTGEaggppsTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMrgQQ 246
Cdd:TIGR04523  418 QQEKELLEKEIERLKETIIKNNSEIKDLT------NQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQK--QK 489
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  247 EREGPGRDHapSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKFepprepgwlgegasp 326
Cdd:TIGR04523  490 ELKSKEKEL--KKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDF--------------- 552
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|.
gi 1370473301  327 gagggaplqeELKSARL--QISELSGKVLKLQHENHALLSN 365
Cdd:TIGR04523  553 ----------ELKKENLekEIDEKNKEIEELKQTQKSLKKK 583
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
15-267 6.92e-06

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 50.15  E-value: 6.92e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQDLKVAKDvsvrlhhELKTVEEKRAKAEDENE 94
Cdd:COG4942     21 AAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQ-------LAALERRIAALAR-------RIRALEQELAALEAELA 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   95 TLRQQMIEVEISKQALQNELERLkesslkrrsTREMYKEKKT------FNQDDSAD-------LRCQLQFAKEEAFLMRK 161
Cdd:COG4942     87 ELEKEIAELRAELEAQKEELAEL---------LRALYRLGRQpplallLSPEDFLDavrrlqyLKYLAPARREQAEELRA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  162 KMAKLGREKDELEQELQKYKSLYGDVdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMED 241
Cdd:COG4942    158 DLAELAALRAELEAERAELEALLAEL-------------EEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224
                          250       260
                   ....*....|....*....|....*..
gi 1370473301  242 MRGQQER-EGPGRDHAPSIPTSPFGDS 267
Cdd:COG4942    225 LEALIARlEAEAAAAAERTPAAGFAAL 251
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
92-378 7.51e-06

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 50.83  E-value: 7.51e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   92 ENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKD 171
Cdd:TIGR02168  678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELT 757
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  172 ELEQELQKYKSLYGDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEmedmrgqqeregp 251
Cdd:TIGR02168  758 ELEAEIEELEERLEEAEEELAEAEA------EIEELEAQIEQLKEELKALREALDELRAELTLLNEE------------- 818
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  252 grdhapsiptspFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKfEPPREpgwLGEGASPGAGGG 331
Cdd:TIGR02168  819 ------------AANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELE-ELIEE---LESELEALLNER 882
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 1370473301  332 APLQEELKSARLQISELSGKVLKLQHENHALLSNIQRC-DLAAHLGLR 378
Cdd:TIGR02168  883 ASLEEALALLRSELEELSEELRELESKRSELRRELEELrEKLAQLELR 930
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
15-261 1.01e-05

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 49.90  E-value: 1.01e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENE 94
Cdd:COG4372     46 ELEQLREELEQAREELEQLEEELEQARSELEQLEEE-------LEELNEQLQAAQAELAQAQEELESLQEEAEELQEELE 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   95 TLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQfakeeaflmRKKMAKLGREKDELE 174
Cdd:COG4372    119 ELQKERQDLEQQRKQLEAQIAELQSEIAEREEELKELEEQLESLQEELAALEQELQ---------ALSEAEAEQALDELL 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  175 QELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRD 254
Cdd:COG4372    190 KEANRNAEKEEELAEAEKLIESLPRELAEELLEAKDSLEAKLGLALSALLDALELEEDKEELLEEVILKEIEELELAILV 269

                   ....*..
gi 1370473301  255 HAPSIPT 261
Cdd:COG4372    270 EKDTEEE 276
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
2-310 1.30e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 50.06  E-value: 1.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    2 EEMRDSYLEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVD--GELIRSLEQDLKVakdvsvrlhHEL 79
Cdd:PRK03918   447 EEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKelAEQLKELEEKLKK---------YNL 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   80 KTVEEKrakaEDENETLRQQMIEVEISKQALQNELERLKEssLKRRStREMYKEKKTfnqddsadlrcqlqfAKEEAFLM 159
Cdd:PRK03918   518 EELEKK----AEEYEKLKEKLIKLKGEIKSLKKELEKLEE--LKKKL-AELEKKLDE---------------LEEELAEL 575
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  160 RKKMAKLGREK-DELEQELQKYKSLYGDVDsplptgEAGGPPSTREAELKlRLKLVEEEANILGRKIVELEVENRGLKAE 238
Cdd:PRK03918   576 LKELEELGFESvEELEERLKELEPFYNEYL------ELKDAEKELEREEK-ELKKLEEELDKAFEELAETEKRLEELRKE 648
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370473301  239 MEDMRGQQEREgpgrdhapsiptsPFGDSLESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFK 310
Cdd:PRK03918   649 LEELEKKYSEE-------------EYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707
PTZ00121 PTZ00121
MAEBL; Provisional
1-302 1.47e-05

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 50.14  E-value: 1.47e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    1 MEEMRDSYLEEDVYQLQELRREldranKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELK 80
Cdd:PTZ00121  1594 IEEVMKLYEEEKKMKAEEAKKA-----EEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAK 1668
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   81 TVEEKRAKAEdenETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLqfAKEEAFLMR 160
Cdd:PTZ00121  1669 KAEEDKKKAE---EAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE--AKKEAEEDK 1743
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  161 KKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRG-----L 235
Cdd:PTZ00121  1744 KKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNFANIIEGGKEGnlvinD 1823
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1370473301  236 KAEMEDMRGQQ--EREGPGRDHAPSIPTSPFGDSLESSTELRRHLQFvEEEAELLRRSISEIE--DHNRQL 302
Cdd:PTZ00121  1824 SKEMEDSAIKEvaDSKNMQLEEADAFEKHKFNKNNENGEDGNKEADF-NKEKDLKEDDEEEIEeaDEIEKI 1893
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
16-310 3.44e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 48.91  E-value: 3.44e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   16 LQELRRELDRankncrilqyrlRKAEQKSLkVAETGQVDgELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENET 95
Cdd:PRK03918   167 LGEVIKEIKR------------RIERLEKF-IKRTENIE-ELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKE 232
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   96 L---RQQMIEVEISKQALQNELERLKEsslKRRSTREMYKEKKtfnqDDSADLRCQ------LQFAKEEAFLMRKKMAKL 166
Cdd:PRK03918   233 LeelKEEIEELEKELESLEGSKRKLEE---KIRELEERIEELK----KEIEELEEKvkelkeLKEKAEEYIKLSEFYEEY 305
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  167 GREKDELEQELQKYKSLYGDVDSPLPTGEAggpPSTREAELKLRLKLVEEEANILGRKIVELEVenrgLKAEMEDMRGQQ 246
Cdd:PRK03918   306 LDELREIEKRLSRLEEEINGIEERIKELEE---KEERLEELKKKLKELEKRLEELEERHELYEE----AKAKKEELERLK 378
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1370473301  247 EREGPgrdhapsiptspfgdslESSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFK 310
Cdd:PRK03918   379 KRLTG-----------------LTPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIKELK 425
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
9-310 6.89e-05

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 47.75  E-value: 6.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLR---------KAEQKSL--KVAETGQVDG------ELIRSLEQDLKVAKDV 71
Cdd:PRK03918   233 LEELKEEIEELEKELESLEGSKRKLEEKIReleerieelKKEIEELeeKVKELKELKEkaeeyiKLSEFYEEYLDELREI 312
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   72 SVRL---HHELKTVEEKRAKAEDENETLRqqmiEVEISKQALQNELERLKESSLK----RRSTREMYKEKKTFNQDDSAD 144
Cdd:PRK03918   313 EKRLsrlEEEINGIEERIKELEEKEERLE----ELKKKLKELEKRLEELEERHELyeeaKAKKEELERLKKRLTGLTPEK 388
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  145 LRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRK 224
Cdd:PRK03918   389 LEKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKKAKGKCPVCGRELTEEHRKELLEEYTAELKRIEKEL 468
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  225 IvELEVENRGLKAEMEDMRGQQEREgpgRDHAPSIPTSPFGDSLESSTEL--RRHLQFVEEEAELLRRSISEIEDHNRQL 302
Cdd:PRK03918   469 K-EIEEKERKLRKELRELEKVLKKE---SELIKLKELAEQLKELEEKLKKynLEELEKKAEEYEKLKEKLIKLKGEIKSL 544

                   ....*...
gi 1370473301  303 THELSKFK 310
Cdd:PRK03918   545 KKELEKLE 552
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
9-199 1.06e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 47.22  E-value: 1.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGElIRSLEQDLKVAKDVSVrlhhELKTVEEKRAK 88
Cdd:COG4913    622 LEEELAEAEERLEALEAELDALQERREALQRLAEYSWDEIDVASAERE-IAELEAELERLDASSD----DLAALEEQLEE 696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKM----- 163
Cdd:COG4913    697 LEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEERFAAALGDAVERELRENLeerid 776
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 1370473301  164 ---AKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGP 199
Cdd:COG4913    777 alrARLNRAEEELERAMRAFNREWPAETADLDADLESLP 815
HCR pfam07111
Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha ...
15-179 1.27e-04

Alpha helical coiled-coil rod protein (HCR); This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.


Pssm-ID: 284517 [Multi-domain]  Cd Length: 749  Bit Score: 46.67  E-value: 1.27e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNcriLQYRLRKAEQKSLKVAETGQVD----GELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAE 90
Cdd:pfam07111  482 ELEQLREERNRLDAE---LQLSAHLIQQEVGRAREQGEAErqqlSEVAQQLEQELQRAQESLASVGQQLEVARQGQQEST 558
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   91 DENETLRQQMI-EVEISKQALQN---ELE-RLKE--SSLKRR---STREMYK---------EKKTFNQDDSADL-RCQLQ 150
Cdd:pfam07111  559 EEAASLRQELTqQQEIYGQALQEkvaEVEtRLREqlSDTKRRlneARREQAKavvslrqiqHRATQEKERNQELrRLQDE 638
                          170       180
                   ....*....|....*....|....*....
gi 1370473301  151 FAKEEAFLMRKKMAKLGREKDELEQELQK 179
Cdd:pfam07111  639 ARKEEGQRLARRVQELERDKNLMLATLQQ 667
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
9-244 2.24e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 46.21  E-value: 2.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEqkslkvaetgqvdgELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAK 88
Cdd:PRK03918   226 LEKEVKELEELKEEIEELEKELESLEGSKRKLE--------------EKIRELEERIEELKKEIEELEEKVKELKELKEK 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEdENETLRQQMIEVEISKQALQNELERLKEsslKRRSTREMYKEkktfnqddsadlrcqLQFAKEEAFLMRKKMAKLGR 168
Cdd:PRK03918   292 AE-EYIKLSEFYEEYLDELREIEKRLSRLEE---EINGIEERIKE---------------LEEKEERLEELKKKLKELEK 352
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  169 EKDELEQELQKY---KSLYGDVDSpLPTGEAGGPPSTREAELKL---RLKLVEEEANILGRKIVELEVENRGLKAEMEDM 242
Cdd:PRK03918   353 RLEELEERHELYeeaKAKKEELER-LKKRLTGLTPEKLEKELEElekAKEEIEEEISKITARIGELKKEIKELKKAIEEL 431

                   ..
gi 1370473301  243 RG 244
Cdd:PRK03918   432 KK 433
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
1-175 2.39e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 45.88  E-value: 2.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    1 MEEMRDSYLEEDVYQLQELRRELDRANKN--CRILQYRLRKAEQKSLKVAETgQVDGELIRSLEQDLKVAKDVSVRLHHE 78
Cdd:pfam17380  407 LEEERQRKIQQQKVEMEQIRAEQEEARQRevRRLEEERAREMERVRLEEQER-QQQVERLRQQEEERKRKKLELEKEKRD 485
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   79 LKTVEEKRAKA-EDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQ-DDSADLRCQLQFAKEEa 156
Cdd:pfam17380  486 RKRAEEQRRKIlEKELEERKQAMIEEERKRKLLEKEMEERQKAIYEEERRREAEEERRKQQEmEERRRIQEQMRKATEE- 564
                          170
                   ....*....|....*....
gi 1370473301  157 flmRKKMAKLGREKDELEQ 175
Cdd:pfam17380  565 ---RSRLEAMEREREMMRQ 580
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
71-302 2.88e-04

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.83  E-value: 2.88e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   71 VSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQ 150
Cdd:TIGR02169  668 FSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLS 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  151 FAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDV-----DSPLPTGEAGGPP----------STREAELKLRLKLVE 215
Cdd:TIGR02169  748 SLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLearlsHSRIPEIQAELSKleeevsrieaRLREIEQKLNRLTLE 827
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  216 EEanILGRKIVELEVENRGLKaEMEDMRGQQEREGPGRdhapsiptspFGDSLESSTELRRHLQFVEEEAELLRRSISEI 295
Cdd:TIGR02169  828 KE--YLEKEIQELQEQRIDLK-EQIKSIEKEIENLNGK----------KEELEEELEELEAALRDLESRLGDLKKERDEL 894

                   ....*..
gi 1370473301  296 EDHNRQL 302
Cdd:TIGR02169  895 EAQLREL 901
Tropomyosin pfam00261
Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 ...
15-191 3.22e-04

Tropomyosin; Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites. The protein is best known for its role in regulating the interaction between actin and myosin in muscle contraction, but is also involved in the organization and dynamics of the cytoskeleton in non-muscle cells. There are multiple cell-specific isoforms, expressed by alternative promoters and alternative RNA processing of at least four genes. Muscle isoforms of tropomyosin are characterized by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region.


Pssm-ID: 459736 [Multi-domain]  Cd Length: 235  Bit Score: 44.25  E-value: 3.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNCRILQYRLRKAEQKS------LKVAE-TGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRA 87
Cdd:pfam00261   44 RIQLLEEELERTEERLAEALEKLEEAEKAAdesergRKVLEnRALKDEEKMEILEAQLKEAKEIAEEADRKYEEVARKLV 123
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   88 KAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLG 167
Cdd:pfam00261  124 VVEGDLERAEERAELAESKIVELEEELKVVGNNLKSLEASEEKASEREDKYEEQIRFLTEKLKEAETRAEFAERSVQKLE 203
                          170       180
                   ....*....|....*....|....
gi 1370473301  168 REKDELEQELQKYKSLYGDVDSPL 191
Cdd:pfam00261  204 KEVDRLEDELEAEKEKYKAISEEL 227
MAD pfam05557
Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint ...
15-248 4.87e-04

Mitotic checkpoint protein; This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.


Pssm-ID: 461677 [Multi-domain]  Cd Length: 660  Bit Score: 44.73  E-value: 4.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNCRILQYRLRKAEQkslkvaetgqvdgeLIRSLEQDlkvakdvsvrlHHELKTVEEKRAKAEDENE 94
Cdd:pfam05557  126 ELQSTNSELEELQERLDLLKAKASEAEQ--------------LRQNLEKQ-----------QSSLAEAEQRIKELEFEIQ 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   95 TLRQQMIEVEISKQAL------QNELERLKESSLKRRSTRE---MYKEKKtfnqddsADLRCQLQfaKEEAflMRKKMAK 165
Cdd:pfam05557  181 SQEQDSEIVKNSKSELaripelEKELERLREHNKHLNENIEnklLLKEEV-------EDLKRKLE--REEK--YREEAAT 249
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  166 LGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPP----STREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMED 241
Cdd:pfam05557  250 LELEKEKLEQELQSWVKLAQDTGLNLRSPEDLSRRieqlQQREIVLKEENSSLTSSARQLEKARRELEQELAQYLKKIED 329

                   ....*..
gi 1370473301  242 MRGQQER 248
Cdd:pfam05557  330 LNKKLKR 336
DUF5401 pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
13-302 6.29e-04

Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.


Pssm-ID: 375164 [Multi-domain]  Cd Length: 722  Bit Score: 44.34  E-value: 6.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   13 VYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQ--VDGELIRSLEQDlkvaKDVSVRLHHELKTVEEKRA--- 87
Cdd:pfam17380  277 IVQHQKAVSERQQQEKFEKMEQERLRQEKEEKAREVERRRklEEAEKARQAEMD----RQAAIYAEQERMAMEREREler 352
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   88 -KAED---ENETLRQQMIEVEISKqalQNELERLK--------------ESSLKRRSTREMYKEKKTFNQDDSADLRCQL 149
Cdd:pfam17380  353 iRQEErkrELERIRQEEIAMEISR---MRELERLQmerqqknervrqelEAARKVKILEEERQRKIQQQKVEMEQIRAEQ 429
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  150 QFAKEEAflMRKKMAKLGRE-----KDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRK 224
Cdd:pfam17380  430 EEARQRE--VRRLEEERAREmervrLEEQERQQQVERLRQQEEERKRKKLELEKEKRDRKRAEEQRRKILEKELEERKQA 507
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  225 IVELEVENRGLKAEMEDMRG----QQER---EGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRsISEIED 297
Cdd:pfam17380  508 MIEEERKRKLLEKEMEERQKaiyeEERRreaEEERRKQQEMEERRRIQEQMRKATEERSRLEAMEREREMMRQ-IVESEK 586

                   ....*
gi 1370473301  298 HNRQL 302
Cdd:pfam17380  587 ARAEY 591
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
15-182 7.35e-04

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 44.52  E-value: 7.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNCRILQYRLRKAEqkslkvAETGQVDGELIRSLEQDLKVAKDvsvRLHHELKTVEEKRAKAEDENE 94
Cdd:COG4913    256 PIRELAERYAAARERLAELEYLRAALR------LWFAQRRLELLEAELEELRAELA---RLEAELERLEARLDALREELD 326
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   95 TLRQQMIEVE-ISKQALQNELERLKESSLKRRSTREMYKE------------KKTFN------QDDSADLRCQLQFAKEE 155
Cdd:COG4913    327 ELEAQIRGNGgDRLEQLEREIERLERELEERERRRARLEAllaalglplpasAEEFAalraeaAALLEALEEELEALEEA 406
                          170       180
                   ....*....|....*....|....*..
gi 1370473301  156 AFLMRKKMAKLGREKDELEQELQKYKS 182
Cdd:COG4913    407 LAEAEAALRDLRRELRELEAEIASLER 433
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
9-311 7.85e-04

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 44.24  E-value: 7.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVakdvsvrLHHELKTVEEKRAK 88
Cdd:TIGR04523  365 LEEKQNEIEKLKKENQSYKQEIKNLESQINDLESKIQNQEKLNQQKDEQIKKLQQEKEL-------LEKEIERLKETIIK 437
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQMIEVEI-------SKQALQNELERLKESSLKRRSTREmyKEKKTFNQDDSadlrcQLQFAKEEAFLMRK 161
Cdd:TIGR04523  438 NNSEIKDLTNQDSVKELiiknldnTRESLETQLKVLSRSINKIKQNLE--QKQKELKSKEK-----ELKKLNEEKKELEE 510
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  162 KMAKLGREKDELEQELQKYKSLYGDVDSPLptgeaggppSTREAELK-----LRLKLVEEEANILGRKIVELEVENRGLK 236
Cdd:TIGR04523  511 KVKDLTKKISSLKEKIEKLESEKKEKESKI---------SDLEDELNkddfeLKKENLEKEIDEKNKEIEELKQTQKSLK 581
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1370473301  237 A---EMEDMRGQQEREgpgrdhapsiptspfgdslesSTELRRHLQFVEEEAELLRRSISEIEDHNRQLTHELSKFKF 311
Cdd:TIGR04523  582 KkqeEKQELIDQKEKE---------------------KKDLIKEIEEKEKKISSLEKELEKAKKENEKLSSIIKNIKS 638
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
139-368 1.36e-03

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 42.83  E-value: 1.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  139 QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVdsplptgeaggppSTREAELKLRLKLVEEEA 218
Cdd:COG4942     19 ADAAAEAEAELEQLQQEIAELEKELAALKKEEKALLKQLAALERRIAAL-------------ARRIRALEQELAALEAEL 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  219 NILGRKIVELEVENRGLKAEMEDMRGQQEREgpGRDHAPSIPTSP--FGDSLESSTELRRHLQFVEEEAELLRRSISEIE 296
Cdd:COG4942     86 AELEKEIAELRAELEAQKEELAELLRALYRL--GRQPPLALLLSPedFLDAVRRLQYLKYLAPARREQAEELRADLAELA 163
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1370473301  297 DHNRQLTHELSKfkfepprepgwLGEGASPGAGGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQR 368
Cdd:COG4942    164 ALRAELEAERAE-----------LEALLAELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEE 224
PTZ00121 PTZ00121
MAEBL; Provisional
10-249 1.39e-03

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 43.59  E-value: 1.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   10 EEDVYQLQELRRELDRANKncrilQYRLRKAEQK----SLKVAETGQVDGELIRSLE----QDLKVAKDVsvRLHHELKT 81
Cdd:PTZ00121  1493 EEAKKKADEAKKAAEAKKK-----ADEAKKAEEAkkadEAKKAEEAKKADEAKKAEEkkkaDELKKAEEL--KKAEEKKK 1565
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   82 VEEKRAKAEDENETLR--------------QQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQ---DDSAD 144
Cdd:PTZ00121  1566 AEEAKKAEEDKNMALRkaeeakkaeearieEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQlkkKEAEE 1645
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  145 LRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAggppSTREAELKLRLKLVEEEANILGRK 224
Cdd:PTZ00121  1646 KKKAEELKKAE----EENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEA----LKKEAEEAKKAEELKKKEAEEKKK 1717
                          250       260
                   ....*....|....*....|....*
gi 1370473301  225 IVELEVENRGLKAEMEDMRGQQERE 249
Cdd:PTZ00121  1718 AEELKKAEEENKIKAEEAKKEAEED 1742
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
9-471 1.80e-03

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 43.00  E-value: 1.80e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAK 88
Cdd:COG1196    332 LEELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAEAEEELEELAEELLEALRAAAELAAQLEELEEAEEA 411
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAflmRKKMAKLGR 168
Cdd:COG1196    412 LLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEEEEALLELLAELLEEAALLEAAL---AELLEELAE 488
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  169 EKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQER 248
Cdd:COG1196    489 AAARLLLLLEAEADYEGFLEGVKAALLLAGLRGLAGAVAVLIGVEAAYEAALEAALAAALQNIVVEDDEVAAAAIEYLKA 568
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  249 EGPGRDHAPSIPTSPFGDSLESSTELRRHLQFVEEEAELLRRSisEIEDHNRQLTHELSKFKFEPPREPGWLGEGASPGA 328
Cdd:COG1196    569 AKAGRATFLPLDKIRARAALAAALARGAIGAAVDLVASDLREA--DARYYVLGDTLLGRTLVAARLEAALRRAVTLAGRL 646
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  329 GGGAPLQEELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSDAESDAGKKESDGEESRLPQPKR 408
Cdd:COG1196    647 REVTLEGEGGSAGGSLTGGSRRELLAALLEAEAELEELAERLAEEELELEEALLAEEEEERELAEAEEERLEEELEEEAL 726
                          410       420       430       440       450       460
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1370473301  409 EGPVGGESDSEEMFEktsgfgsgkpSEASEPCPTELLKAREDSEYLVTLKHEAQRLERTVERL 471
Cdd:COG1196    727 EEQLEAEREELLEEL----------LEEEELLEEEALEELPEPPDLEELERELERLEREIEAL 779
COG5022 COG5022
Myosin heavy chain [General function prediction only];
1-183 2.38e-03

Myosin heavy chain [General function prediction only];


Pssm-ID: 227355 [Multi-domain]  Cd Length: 1463  Bit Score: 42.76  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    1 MEEMRDSYLEEDVYQLQE------LRRELDRANKNCRILQ--------YRLRKAEQKSL-------------KVAETGQV 53
Cdd:COG5022    736 LEDMRDAKLDNIATRIQRairgryLRRRYLQALKRIKKIQviqhgfrlRRLVDYELKWRlfiklqpllsllgSRKEYRSY 815
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   54 DgELIRSLEQDLKVAKDVSVRLHHELKTVEE-------------KRAKAEDENETLRQQMIEVEISK---QALQNELERL 117
Cdd:COG5022    816 L-ACIIKLQKTIKREKKLRETEEVEFSLKAEvliqkfgrslkakKRFSLLKKETIYLQSAQRVELAErqlQELKIDVKSI 894
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1370473301  118 KESSLKRRSTREMYKEkktFNQDDSADLRCQLQFaKEEAFLMRKKMAKLGREKDELEQELQKYKSL 183
Cdd:COG5022    895 SSLKLVNLELESEIIE---LKKSLSSDLIENLEF-KTELIARLKKLLNNIDLEEGPSIEYVKLPEL 956
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
1-297 3.19e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 42.36  E-value: 3.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    1 MEEMRDSYLEEdvyqLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQVDGELIRSLE---------QDLKVAKDV 71
Cdd:PRK03918   298 LSEFYEEYLDE----LREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEeleerhelyEEAKAKKEE 373
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   72 SVRLH------------HELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLK-----------------ESSL 122
Cdd:PRK03918   374 LERLKkrltgltpekleKELEELEKAKEEIEEEISKITARIGELKKEIKELKKAIEELKkakgkcpvcgrelteehRKEL 453
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  123 KRRSTREM---YKEKKTFNQDDSaDLRCQLQ-----FAKEEAFLMRKKMAKLGRE---------KDELEQELQKYKSLYG 185
Cdd:PRK03918   454 LEEYTAELkriEKELKEIEEKER-KLRKELRelekvLKKESELIKLKELAEQLKEleeklkkynLEELEKKAEEYEKLKE 532
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  186 DVDsplptgEAGGPPST------REAELKLRLKLVEEEANILGRKIVEL--EVENRGLKAEMEDMRGQQEREgpgrdhap 257
Cdd:PRK03918   533 KLI------KLKGEIKSlkkeleKLEELKKKLAELEKKLDELEEELAELlkELEELGFESVEELEERLKELE-------- 598
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|
gi 1370473301  258 siptsPFGDSLESSTELRRHLQFVEEEAELLRRSISEIED 297
Cdd:PRK03918   599 -----PFYNEYLELKDAEKELEREEKELKKLEEELDKAFE 633
COG4372 COG4372
Uncharacterized protein, contains DUF3084 domain [Function unknown];
38-296 3.41e-03

Uncharacterized protein, contains DUF3084 domain [Function unknown];


Pssm-ID: 443500 [Multi-domain]  Cd Length: 370  Bit Score: 41.81  E-value: 3.41e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   38 RKAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERL 117
Cdd:COG4372      6 EKVGKARLSLFGLRPKTGILIAALSEQLRKALFELDKLQEELEQLREELEQAREELEQLEEELEQARSELEQLEEELEEL 85
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  118 KEsSLKRRstremykekktfnQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLptgeag 197
Cdd:COG4372     86 NE-QLQAA-------------QAELAQAQEELESLQEEAEELQEELEELQKERQDLEQQRKQLEAQIAELQSEI------ 145
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  198 gppSTREAELK-LRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSPFGDSLESSTELRR 276
Cdd:COG4372    146 ---AEREEELKeLEEQLESLQEELAALEQELQALSEAEAEQALDELLKEANRNAEKEEELAEAEKLIESLPRELAEELLE 222
                          250       260
                   ....*....|....*....|
gi 1370473301  277 HLQFVEEEAELLRRSISEIE 296
Cdd:COG4372    223 AKDSLEAKLGLALSALLDAL 242
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
60-535 3.42e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 42.34  E-value: 3.42e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   60 SLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENETLRQQMIEVE------ISKQALQNELERLKESSLKRRSTREM--- 130
Cdd:TIGR00606  581 SKSKEINQTRDRLAKLNKELASLEQNKNHINNELESKEEQLSSYEdklfdvCGSQDEESDLERLKEEIEKSSKQRAMlag 660
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  131 --------------------------YKEKKTFNQDdSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLY 184
Cdd:TIGR00606  661 atavysqfitqltdenqsccpvcqrvFQTEAELQEF-ISDLQSKLRLAPDKLKSTESELKKKEKRRDEMLGLAPGRQSII 739
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  185 GDVDSPLP-TGEAGGPPSTREAELKLRLklvEEEANILGRKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTSP 263
Cdd:TIGR00606  740 DLKEKEIPeLRNKLQKVNRDIQRLKNDI---EEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQ 816
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  264 FGDSLESSTELRrhlQFVEEEAELLRRSISEIE-------DHNRQLTHELSKFKfepprepgwlgegaspgagggaplqe 336
Cdd:TIGR00606  817 GSDLDRTVQQVN---QEKQEKQHELDTVVSKIElnrkliqDQQEQIQHLKSKTN-------------------------- 867
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  337 ELKSARLQISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSDAESDAGKKESDGEESRLPQPKREGPVggeS 416
Cdd:TIGR00606  868 ELKSEKLQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKKAQDKV---N 944
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  417 DSEEMFEKTSGFGSGKPSEASEPCPTELL-KAREDSEYLVTLKHEAQRLERTVERLITDTDSFlhdaglrggaplpgpgl 495
Cdd:TIGR00606  945 DIKEKVKNIHGYMKDIENKIQDGKDDYLKqKETELNTVNAQLEECEKHQEKINEDMRLMRQDI----------------- 1007
                          490       500       510       520
                   ....*....|....*....|....*....|....*....|
gi 1370473301  496 qGEEEQGEGDQQEPQLLGTINAKMKAFKKELQAFLEQVNR 535
Cdd:TIGR00606 1008 -DTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQ 1046
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
14-191 3.69e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 41.06  E-value: 3.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   14 YQLQELRRELDRANKNCRILQYRLRKAEQKslkvaetgqvdgelIRSLEQDLKVAKDvsvrlhhELKTVEEKRAKAEDEN 93
Cdd:COG1579     10 LDLQELDSELDRLEHRLKELPAELAELEDE--------------LAALEARLEAAKT-------ELEDLEKEIKRLELEI 68
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   94 ETLRQ-------QMIEVEISK--QALQNELERLKesslKRRSTRE------MYK-EKKtfnQDDSADLRCQLQFAKEEaf 157
Cdd:COG1579     69 EEVEArikkyeeQLGNVRNNKeyEALQKEIESLK----RRISDLEdeilelMERiEEL---EEELAELEAELAELEAE-- 139
                          170       180       190
                   ....*....|....*....|....*....|....*...
gi 1370473301  158 lMRKKMAKLGREKDELEQELQKYKS----LYGDVDSPL 191
Cdd:COG1579    140 -LEEKKAELDEELAELEAELEELEAereeLAAKIPPEL 176
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
15-247 4.59e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 41.98  E-value: 4.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNCRILQYRLRKAEQKSLK-VAETGQVDGElIRSLEQDLKvakdvsvRLHHELKTVEEKRAKAEDEN 93
Cdd:TIGR02169  295 KIGELEAEIASLERSIAEKERELEDAEERLAKlEAEIDKLLAE-IEELEREIE-------EERKRRDKLTEEYAELKEEL 366
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   94 ETLRQQMIEVEISKQALQNELERLKESslKRRSTREMYKEKKTFN--QDDSADLRCQLQFAKEEAFLMRKKMAKLGREKD 171
Cdd:TIGR02169  367 EDLRAELEEVDKEFAETRDELKDYREK--LEKLKREINELKRELDrlQEELQRLSEELADLNAAIAGIEAKINELEEEKE 444
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  172 ELEQEL----QKYKSLYGDVDsplptgeaggppSTREAELKLRLKL--VEEEANILGRKIVELEVENRGLKAEMEDMRGQ 245
Cdd:TIGR02169  445 DKALEIkkqeWKLEQLAADLS------------KYEQELYDLKEEYdrVEKELSKLQRELAEAEAQARASEERVRGGRAV 512

                   ..
gi 1370473301  246 QE 247
Cdd:TIGR02169  513 EE 514
DUF4201 pfam13870
Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. ...
16-179 4.73e-03

Domain of unknown function (DUF4201); This is a family of coiled-coil proteins from eukaryotes. The function is not known.


Pssm-ID: 464008 [Multi-domain]  Cd Length: 177  Bit Score: 39.90  E-value: 4.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   16 LQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGQ----VD-----------GELIRSLEQDLKVAKDVSVRLHHELK 80
Cdd:pfam13870    1 MRAKRNELSKLRLELITLKHTLAKIQEKLEQKEELGEgltmIDflqlqienqalNEKIEERNKELKRLKLKVTNTVHALT 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   81 TVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKEsslkrrsTREMYKekktfnqDDSADLRCQL----------Q 150
Cdd:pfam13870   81 HLKEKLHFLSAELSRLKKELRERQELLAKLRKELYRVKL-------ERDKLR-------KQNKKLRQQGgllhvpallhD 146
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1370473301  151 FAKEEAFL--MRKKMAKLGREKDELEQELQK 179
Cdd:pfam13870  147 YDKTKAEVeeKRKSVKKLRRKVKILEMRIKE 177
Myosin_tail_1 pfam01576
Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and ...
15-297 5.07e-03

Myosin tail; The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.


Pssm-ID: 460256 [Multi-domain]  Cd Length: 1081  Bit Score: 41.70  E-value: 5.07e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   15 QLQELRRELDRANKNCRILQYRLRKAEqkslkvAETGQVDGELIRSLEQDLKVAKDVSvRLHHELKTV-----EEKRAK- 88
Cdd:pfam01576  413 QLQELQARLSESERQRAELAEKLSKLQ------SELESVSSLLNEAEGKNIKLSKDVS-SLESQLQDTqellqEETRQKl 485
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 --------AEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRR-----STREMYKEKKTFNQDDSADLRCQLQFAK 153
Cdd:pfam01576  486 nlstrlrqLEDERNSLQEQLEEEEEAKRNVERQLSTLQAqlSDMKKKleedaGTLEALEEGKKRLQRELEALTQQLEEKA 565
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  154 EEAFLMRKKMAKLGREKDELEQELQKYKSLYGDV-------DSPLPTGEAGGPPSTRE---AELKLRLKlvEEEANILGR 223
Cdd:pfam01576  566 AAYDKLEKTKNRLQQELDDLLVDLDHQRQLVSNLekkqkkfDQMLAEEKAISARYAEErdrAEAEAREK--ETRALSLAR 643
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  224 -------KIVELEVENRGLKAEMEDMRGQQEREGpgrdhapsiptspfgdslESSTELRRHLQFVEEEAELLRRSISEIE 296
Cdd:pfam01576  644 aleealeAKEELERTNKQLRAEMEDLVSSKDDVG------------------KNVHELERSKRALEQQVEEMKTQLEELE 705

                   .
gi 1370473301  297 D 297
Cdd:pfam01576  706 D 706
DR0291 COG1579
Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General ...
74-241 5.34e-03

Predicted nucleic acid-binding protein DR0291, contains C4-type Zn-ribbon domain [General function prediction only];


Pssm-ID: 441187 [Multi-domain]  Cd Length: 236  Bit Score: 40.29  E-value: 5.34e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   74 RLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEK--KTFNQDDSADLRCQLQF 151
Cdd:COG1579     21 RLEHRLKELPAELAELEDELAALEARLEAAKTELEDLEKEIKRLELEIEEVEARIKKYEEQlgNVRNNKEYEALQKEIES 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  152 AKEEAFLMRKKMAKLGREKDELEQELQKYKSLYgdvdsplptgeaggppSTREAELKLRLKLVEEEANILGRKIVELEVE 231
Cdd:COG1579    101 LKRRISDLEDEILELMERIEELEEELAELEAEL----------------AELEAELEEKKAELDEELAELEAELEELEAE 164
                          170
                   ....*....|
gi 1370473301  232 NRGLKAEMED 241
Cdd:COG1579    165 REELAAKIPP 174
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
74-305 6.02e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.44  E-value: 6.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   74 RLHHELKTVEEKRAKAEDENETLRQQMIEVEISKQALQnELERLKESSLKRRSTREMYKEKKTF------NQDDSADLRC 147
Cdd:COG4913    614 ALEAELAELEEELAEAEERLEALEAELDALQERREALQ-RLAEYSWDEIDVASAEREIAELEAElerldaSSDDLAALEE 692
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  148 QLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLYGDVDSPLPTGEAGGPPSTReAELKLRL------KLVEEEANIL 221
Cdd:COG4913    693 QLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELR-ALLEERFaaalgdAVERELRENL 771
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  222 GRKIVELEVENRGLKAEMEDMRGQQEREGPGrdhapsiPTSPFGDSLESSTELRRHLQFVEEE---------AELLRR-S 291
Cdd:COG4913    772 EERIDALRARLNRAEEELERAMRAFNREWPA-------ETADLDADLESLPEYLALLDRLEEDglpeyeerfKELLNEnS 844
                          250
                   ....*....|....
gi 1370473301  292 ISEIEDHNRQLTHE 305
Cdd:COG4913    845 IEFVADLLSKLRRA 858
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
32-409 6.17e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 41.58  E-value: 6.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   32 ILQYRLRK--AEQKslkvaetgqvdgelIRSLEQDLKVAKDVSVRLHHELKTVEEKRAKAEDENEtLRQQMIEVEIS--- 106
Cdd:TIGR02168  167 ISKYKERRkeTERK--------------LERTRENLDRLEDILNELERQLKSLERQAEKAERYKE-LKAELRELELAllv 231
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  107 --KQALQNELERLKESSLKRRSTREMYKEKKTFNQDDSADLRCQLQFAKEEAFLMRKKMAKLGREKDELEQELQKYKSLY 184
Cdd:TIGR02168  232 lrLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERL 311
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  185 GDVDSPLPTGEAggppstREAELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQEregpgrdhapsiptspf 264
Cdd:TIGR02168  312 ANLERQLEELEA------QLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELE----------------- 368
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  265 gdslesstELRRHLQFVEEEAELLRRSISEiedhnrqlthelskfkfepprepgwlgegaspgagggapLQEELKSARLQ 344
Cdd:TIGR02168  369 --------ELESRLEELEEQLETLRSKVAQ---------------------------------------LELQIASLNNE 401
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1370473301  345 ISELSGKVLKLQHENHALLSNIQRCDLAAHLGLRAPSPRDSdAESDAGKKESDGEESRLPQPKRE 409
Cdd:TIGR02168  402 IERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAEL-EELEEELEELQEELERLEEALEE 465
Rootletin pfam15035
Ciliary rootlet component, centrosome cohesion;
31-133 7.19e-03

Ciliary rootlet component, centrosome cohesion;


Pssm-ID: 464459 [Multi-domain]  Cd Length: 190  Bit Score: 39.64  E-value: 7.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   31 RILQYRLRKAEQKSLKVAETGQVDGELIRSLEQDL-----KVAKDVSVRLHHELKTVEEKRAKAED---ENETLRQQMIE 102
Cdd:pfam15035   24 KVLQYKKRCSELEQQLLEKTSELEKTELLLRKLTLeprlqRLEREHSADLEEALIRLEEERQRSESlsqVNSLLREQLEQ 103
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1370473301  103 VEISKQALQNELERL-------------KESSLKRRstREMYKE 133
Cdd:pfam15035  104 ASRANEALREDLQKLtndwerareeleqKESEWRKE--EEAFNE 145
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
9-181 7.38e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 40.91  E-value: 7.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    9 LEEDVYQLQELRRELDRANKNCRILQYRLRKAEQKslkvaetgqvdgelIRSLEQDLKVAkdvsvRLHHELKTVEEKRAK 88
Cdd:COG4717     83 AEEKEEEYAELQEELEELEEELEELEAELEELREE--------------LEKLEKLLQLL-----PLYQELEALEAELAE 143
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   89 AEDENETLRQQM---IEVEISKQALQNELERLKES--SLKRRSTREMYKEKKTFnQDDSADLRCQLQFAKEEAFLMRKKM 163
Cdd:COG4717    144 LPERLEELEERLeelRELEEELEELEAELAELQEEleELLEQLSLATEEELQDL-AEELEELQQRLAELEEELEEAQEEL 222
                          170
                   ....*....|....*...
gi 1370473301  164 AKLGREKDELEQELQKYK 181
Cdd:COG4717    223 EELEEELEQLENELEAAA 240
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
7-184 7.58e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 41.16  E-value: 7.58e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    7 SYLEEdvyQLQELRRELDRANKncRILQYRlrkAEQKSLKVAETGQVDGELIRSLEQDLKVAKDVSVRLHHELKTVEEKR 86
Cdd:COG3206    178 EFLEE---QLPELRKELEEAEA--ALEEFR---QKNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAALRAQL 249
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   87 AKAEDEN---------ETLRQQMIEVEISK--------------QALQNELERLKESSLKR-RSTREMYKEKKTFNQDDS 142
Cdd:COG3206    250 GSGPDALpellqspviQQLRAQLAELEAELaelsarytpnhpdvIALRAQIAALRAQLQQEaQRILASLEAELEALQARE 329
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|..
gi 1370473301  143 ADLRCQLQFAKEEAflmrKKMAKLGREKDELEQELQKYKSLY 184
Cdd:COG3206    330 ASLQAQLAQLEARL----AELPELEAELRRLEREVEVARELY 367
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
2-125 8.73e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 41.05  E-value: 8.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301    2 EEMRDSYLEEDVYQLQELRRELDRANKncrilqyRLRKAEQKSLKVAetgqvdgELIRSLEQDLKVAKDVSVRLHHELKt 81
Cdd:COG4913    326 DELEAQIRGNGGDRLEQLEREIERLER-------ELEERERRRARLE-------ALLAALGLPLPASAEEFAALRAEAA- 390
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1370473301   82 veEKRAKAEDENETLRQQMIEVEISKQALQNELERLKE--SSLKRR 125
Cdd:COG4913    391 --ALLEALEEELEALEEALAEAEAALRDLRRELRELEAeiASLERR 434
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
11-302 9.28e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 40.52  E-value: 9.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   11 EDVYQLQELRRELDRANKNCRILQYRLRKAEQKSLKVAETGqvdgELIRSLEQDLKVAKDVSVRLHHELKTVEEKRAK-A 89
Cdd:COG4717    122 EKLLQLLPLYQELEALEAELAELPERLEELEERLEELRELE----EELEELEAELAELQEELEELLEQLSLATEEELQdL 197
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   90 EDENETLRQQMIEVEISKQALQNELERLKESSLKRRSTREMYKEKKTFNQDD----SADLRCQLQFAKEE---------- 155
Cdd:COG4717    198 AEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAAALEERLKEARllllIAAALLALLGLGGSllsliltiag 277
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  156 ------------AFLMRKKMAKLGREKDEL----------EQELQKYKSLYGdVDSPLPTGEAGGPPST----------- 202
Cdd:COG4717    278 vlflvlgllallFLLLAREKASLGKEAEELqalpaleeleEEELEELLAALG-LPPDLSPEELLELLDRieelqellrea 356
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  203 REAELKLRLKLVEEEANILG-----------RKIVELEVENRGLKAEMEDMRGQQEREGPGRDHAPSIPTspfGDSLES- 270
Cdd:COG4717    357 EELEEELQLEELEQEIAALLaeagvedeeelRAALEQAEEYQELKEELEELEEQLEELLGELEELLEALD---EEELEEe 433
                          330       340       350
                   ....*....|....*....|....*....|..
gi 1370473301  271 STELRRHLQFVEEEAELLRRSISEIEDHNRQL 302
Cdd:COG4717    434 LEELEEELEELEEELEELREELAELEAELEQL 465
YhaN COG4717
Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];
32-249 9.52e-03

Uncharacterized conserved protein YhaN, contains AAA domain [Function unknown];


Pssm-ID: 443752 [Multi-domain]  Cd Length: 641  Bit Score: 40.52  E-value: 9.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301   32 ILQYRLRKAEQKSLKV-AETGQVDGELIRSLEQDLKvakdvsvrlhhELKTVEEKRAKAEDENETLRQQMIEVEISKQAL 110
Cdd:COG4717     46 MLLERLEKEADELFKPqGRKPELNLKELKELEEELK-----------EAEEKEEEYAELQEELEELEEELEELEAELEEL 114
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1370473301  111 QNELERLKesslKRRSTREMYKEKKTFNQDDsADLRCQLQFAKEEaflmRKKMAKLGREKDELEQELQKYKSLygdvdsp 190
Cdd:COG4717    115 REELEKLE----KLLQLLPLYQELEALEAEL-AELPERLEELEER----LEELRELEEELEELEAELAELQEE------- 178
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1370473301  191 LPTGEAGGPPSTREA--ELKLRLKLVEEEANILGRKIVELEVENRGLKAEMEDMRGQQERE 249
Cdd:COG4717    179 LEELLEQLSLATEEElqDLAEELEELQQRLAELEEELEEAQEELEELEEELEQLENELEAA 239
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH