NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1952735391|ref|XP_038316066|]
View 

keratin, type II cytoskeletal 7 [Canis lupus familiaris]

Protein Classification

type II keratin( domain architecture ID 12177255)

type II keratin is an intermediate filament-forming protein that provides mechanical support and fulfills a variety of additional functions in epithelial cells

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
90-402 1.00e-142

Intermediate filament protein;


:

Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 410.85  E-value: 1.00e-142
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  90 EEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIFEAHIAGLRRHLEGLQTDGGRLEVELRNM 169
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 170 QDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINFFKTLYETELAELQSQISDTSVVLSM 249
Cdd:pfam00038  81 RLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSDTQVNVEM 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 250 DNSRSLDLDGIIAEVKAQYEEMANRSRAETETMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQR 329
Cdd:pfam00038 161 DAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQK 240
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1952735391 330 TKLEAAIAEAEERGELALKDAHAKQEELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGEESR 402
Cdd:pfam00038 241 ASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
Keratin_2_head pfam16208
Keratin type II head;
10-87 5.83e-11

Keratin type II head;


:

Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 60.82  E-value: 5.83e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  10 FSSRSAAFPG--RGLQVRLSSAR------------PGGFGSRSLLGLGASR---PRVAVRSASGGPGGAG---------- 62
Cdd:pfam16208   2 FSSCSAVVPSrsRRSYSSVSSSRrgggggggggggGGGFGSRSLYNLGGSKsisISVAGGGSRPGSGFGFgggggggfgg 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1952735391  63 --------------------------------------------------IRQVTVNQSLLAPLRVDIDPAIQQV 87
Cdd:pfam16208  82 gfgggggggfgggggfgggfggggyggggfggggfggrggfggppcppggIQEVTVNQSLLQPLNLEIDPEIQRV 156
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
90-402 1.00e-142

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 410.85  E-value: 1.00e-142
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  90 EEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIFEAHIAGLRRHLEGLQTDGGRLEVELRNM 169
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 170 QDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINFFKTLYETELAELQSQISDTSVVLSM 249
Cdd:pfam00038  81 RLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSDTQVNVEM 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 250 DNSRSLDLDGIIAEVKAQYEEMANRSRAETETMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQR 329
Cdd:pfam00038 161 DAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQK 240
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1952735391 330 TKLEAAIAEAEERGELALKDAHAKQEELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGEESR 402
Cdd:pfam00038 241 ASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
Keratin_2_head pfam16208
Keratin type II head;
10-87 5.83e-11

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 60.82  E-value: 5.83e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  10 FSSRSAAFPG--RGLQVRLSSAR------------PGGFGSRSLLGLGASR---PRVAVRSASGGPGGAG---------- 62
Cdd:pfam16208   2 FSSCSAVVPSrsRRSYSSVSSSRrgggggggggggGGGFGSRSLYNLGGSKsisISVAGGGSRPGSGFGFgggggggfgg 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1952735391  63 --------------------------------------------------IRQVTVNQSLLAPLRVDIDPAIQQV 87
Cdd:pfam16208  82 gfgggggggfgggggfgggfggggyggggfggggfggrggfggppcppggIQEVTVNQSLLQPLNLEIDPEIQRV 156
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
84-384 1.97e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 56.99  E-value: 1.97e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391   84 IQQVRREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIF------EAHIAGLRRHLEGLQT 157
Cdd:TIGR02168  717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEeelaeaEAEIEELEAQIEQLKE 796
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  158 DGGRLEVELRNMQDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINffktLYETELAELQ 237
Cdd:TIGR02168  797 ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIE----ELEELIEELE 872
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  238 SQisdtsvvlsmdnsrsldLDGIIAEVKAQYEEMANRSRAETETMyqSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQR 317
Cdd:TIGR02168  873 SE-----------------LEALLNERASLEEALALLRSELEELS--EELRELESKRSELRRELEELREKLAQLELRLEG 933
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1952735391  318 LQAEIDNIKNQrtkleaaiaeAEERGELALKDAHAKQEELEAALQRAKQDMARQLREYQELMNVKLA 384
Cdd:TIGR02168  934 LEVRIDNLQER----------LSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLA 990
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
117-410 1.80e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 50.71  E-value: 1.80e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 117 KLLETKWALLQEQRSARssclpsifEAHIAGLRRHLEGLQTDGGRLEVELRNMQDVLEDFKNKYEDEINRRTAAENEFVV 196
Cdd:COG1196   235 RELEAELEELEAELEEL--------EAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIAR 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 197 LKKDVDAAYMSKVELEAKVEALNDEINffktLYETELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYEEmanrSR 276
Cdd:COG1196   307 LEERRRELEERLEELEEELAELEEELE----ELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE----AE 378
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 277 AETETMYQSKFETLQAQAGKHgDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQRTKLEAAIAEAEERGELALKDAHAKQEE 356
Cdd:COG1196   379 EELEELAEELLEALRAAAELA-AQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEE 457
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1952735391 357 LEAALQRAKQDmARQLREYQELMNVKLALDIEIATYRKLLEGEESRLAGDGVGA 410
Cdd:COG1196   458 EEALLELLAEL-LEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGV 510
46 PHA02562
endonuclease subunit; Provisional
107-332 2.90e-06

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 49.63  E-value: 2.90e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 107 DKVRFLEQQNKLLETKWALLQEQrsarssclpsifeahIAGLRRHLEGLQTDGGRLEVELRNMQDVLEDFKNKYEDEINR 186
Cdd:PHA02562  174 DKIRELNQQIQTLDMKIDHIQQQ---------------IKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEE 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 187 RTAAENEFVVLKKDVDAAY----MSKVELEAKVEALNDEINFFKTlyETELAELQSQISDTSVVLSMDNSRSLDLDGIIA 262
Cdd:PHA02562  239 LTDELLNLVMDIEDPSAALnklnTAAAKIKSKIEQFQKVIKMYEK--GGVCPTCTQQISEGPDRITKIKDKLKELQHSLE 316
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 263 EVKAQYEEMAnrSRAETETMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQRTKL 332
Cdd:PHA02562  317 KLDTAIDELE--EIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKL 384
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
90-402 1.00e-142

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 410.85  E-value: 1.00e-142
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  90 EEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIFEAHIAGLRRHLEGLQTDGGRLEVELRNM 169
Cdd:pfam00038   1 NEKEQLQELNDRLASYIDKVRFLEQQNKLLETKISELRQKKGAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 170 QDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINFFKTLYETELAELQSQISDTSVVLSM 249
Cdd:pfam00038  81 RLAAEDFRQKYEDELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSDTQVNVEM 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 250 DNSRSLDLDGIIAEVKAQYEEMANRSRAETETMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQR 329
Cdd:pfam00038 161 DAARKLDLTSALAEIRAQYEEIAAKNREEAEEWYQSKLEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQK 240
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1952735391 330 TKLEAAIAEAEERGELALKDAHAKQEELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGEESR 402
Cdd:pfam00038 241 ASLERQLAETEERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
Keratin_2_head pfam16208
Keratin type II head;
10-87 5.83e-11

Keratin type II head;


Pssm-ID: 465068 [Multi-domain]  Cd Length: 156  Bit Score: 60.82  E-value: 5.83e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  10 FSSRSAAFPG--RGLQVRLSSAR------------PGGFGSRSLLGLGASR---PRVAVRSASGGPGGAG---------- 62
Cdd:pfam16208   2 FSSCSAVVPSrsRRSYSSVSSSRrgggggggggggGGGFGSRSLYNLGGSKsisISVAGGGSRPGSGFGFgggggggfgg 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1952735391  63 --------------------------------------------------IRQVTVNQSLLAPLRVDIDPAIQQV 87
Cdd:pfam16208  82 gfgggggggfgggggfgggfggggyggggfggggfggrggfggppcppggIQEVTVNQSLLQPLNLEIDPEIQRV 156
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
84-384 1.97e-08

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 56.99  E-value: 1.97e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391   84 IQQVRREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIF------EAHIAGLRRHLEGLQT 157
Cdd:TIGR02168  717 LRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEeelaeaEAEIEELEAQIEQLKE 796
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  158 DGGRLEVELRNMQDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINffktLYETELAELQ 237
Cdd:TIGR02168  797 ELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIE----ELEELIEELE 872
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  238 SQisdtsvvlsmdnsrsldLDGIIAEVKAQYEEMANRSRAETETMyqSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQR 317
Cdd:TIGR02168  873 SE-----------------LEALLNERASLEEALALLRSELEELS--EELRELESKRSELRRELEELREKLAQLELRLEG 933
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1952735391  318 LQAEIDNIKNQrtkleaaiaeAEERGELALKDAHAKQEELEAALQRAKQDMARQLREYQELMNVKLA 384
Cdd:TIGR02168  934 LEVRIDNLQER----------LSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLA 990
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
85-406 7.04e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 51.98  E-value: 7.04e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391   85 QQVRREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIfEAHIAGLRRHLEGLQTDGGRLEV 164
Cdd:TIGR02168  669 NSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEEL-SRQISALRKDLARLEAEVEQLEE 747
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  165 ELRNMQDVLEDFKNKYEDEINRRTAAENEFVVLKKDvdaaymsKVELEAKVEALNDEINffktLYETELAELQSQISDTS 244
Cdd:TIGR02168  748 RIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE-------IEELEAQIEQLKEELK----ALREALDELRAELTLLN 816
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  245 VVLSMDNSRSLDLDGIIAEVKAQYEEMANRSRAETETMyqskfETLQAqagkhgddlrntrnEIAEMNRAIQRLQAEIDN 324
Cdd:TIGR02168  817 EEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI-----ESLAA--------------EIEELEELIEELESELEA 877
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  325 IKNQRTKLEAAIAEAeergELALKDAHAKQEELEAALQRAKQDMaRQLREYQELMNVKLA-LDIEIATYRKLLEGEESRL 403
Cdd:TIGR02168  878 LLNERASLEEALALL----RSELEELSEELRELESKRSELRREL-EELREKLAQLELRLEgLEVRIDNLQERLSEEYSLT 952

                   ...
gi 1952735391  404 AGD 406
Cdd:TIGR02168  953 LEE 955
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
117-410 1.80e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 50.71  E-value: 1.80e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 117 KLLETKWALLQEQRSARssclpsifEAHIAGLRRHLEGLQTDGGRLEVELRNMQDVLEDFKNKYEDEINRRTAAENEFVV 196
Cdd:COG1196   235 RELEAELEELEAELEEL--------EAELEELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIAR 306
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 197 LKKDVDAAYMSKVELEAKVEALNDEINffktLYETELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYEEmanrSR 276
Cdd:COG1196   307 LEERRRELEERLEELEEELAELEEELE----ELEEELEELEEELEEAEEELEEAEAELAEAEEALLEAEAELAE----AE 378
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 277 AETETMYQSKFETLQAQAGKHgDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQRTKLEAAIAEAEERGELALKDAHAKQEE 356
Cdd:COG1196   379 EELEELAEELLEALRAAAELA-AQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAEEEAELEEE 457
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....
gi 1952735391 357 LEAALQRAKQDmARQLREYQELMNVKLALDIEIATYRKLLEGEESRLAGDGVGA 410
Cdd:COG1196   458 EEALLELLAEL-LEEAALLEAALAELLEELAEAAARLLLLLEAEADYEGFLEGV 510
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
149-404 2.14e-06

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 50.32  E-value: 2.14e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 149 RRHLEgLQTDGGRLEVELRNMQdvLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINffktL 228
Cdd:COG1196   213 ERYRE-LKEELKELEAELLLLK--LRELEAELEELEAELEELEAELEELEAELAELEAELEELRLELEELELELE----E 285
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 229 YETELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYEEMANRSRAETETMyqskfETLQAQAGKHGDDLRNTRNEI 308
Cdd:COG1196   286 AQAEEYELLAELARLEQDIARLEERRRELEERLEELEEELAELEEELEELEEEL-----EELEEELEEAEEELEEAEAEL 360
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 309 AEMNRAIQRLQAEIDNIKNQRTKLEAAIAEAeergELALKDAHAKQEELEAALQRAKQDMARQLREYQELMNVKLALDIE 388
Cdd:COG1196   361 AEAEEALLEAEAELAEAEEELEELAEELLEA----LRAAAELAAQLEELEEAEEALLERLERLEEELEELEEALAELEEE 436
                         250
                  ....*....|....*.
gi 1952735391 389 IATYRKLLEGEESRLA 404
Cdd:COG1196   437 EEEEEEALEEAAEEEA 452
46 PHA02562
endonuclease subunit; Provisional
107-332 2.90e-06

endonuclease subunit; Provisional


Pssm-ID: 222878 [Multi-domain]  Cd Length: 562  Bit Score: 49.63  E-value: 2.90e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 107 DKVRFLEQQNKLLETKWALLQEQrsarssclpsifeahIAGLRRHLEGLQTDGGRLEVELRNMQDVLEDFKNKYEDEINR 186
Cdd:PHA02562  174 DKIRELNQQIQTLDMKIDHIQQQ---------------IKTYNKNIEEQRKKNGENIARKQNKYDELVEEAKTIKAEIEE 238
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 187 RTAAENEFVVLKKDVDAAY----MSKVELEAKVEALNDEINFFKTlyETELAELQSQISDTSVVLSMDNSRSLDLDGIIA 262
Cdd:PHA02562  239 LTDELLNLVMDIEDPSAALnklnTAAAKIKSKIEQFQKVIKMYEK--GGVCPTCTQQISEGPDRITKIKDKLKELQHSLE 316
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 263 EVKAQYEEMAnrSRAETETMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQRTKL 332
Cdd:PHA02562  317 KLDTAIDELE--EIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAEFVDNAEELAKL 384
CALCOCO1 pfam07888
Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are ...
88-379 6.42e-06

Calcium binding and coiled-coil domain (CALCOCO1) like; Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.


Pssm-ID: 462303 [Multi-domain]  Cd Length: 488  Bit Score: 48.35  E-value: 6.42e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  88 RREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSArsscLPSIFEAHIAGLRRHLEGLQTDGGRL---EV 164
Cdd:pfam07888  75 RRELESRVAELKEELRQSREKHEELEEKYKELSASSEELSEEKDA----LLAQRAAHEARIRELEEDIKTLTQRVlerET 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 165 ELRNMQDVLEDFKN-KYEDEINRRT------AAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINFFKTL------YET 231
Cdd:pfam07888 151 ELERMKERAKKAGAqRKEEEAERKQlqaklqQTEEELRSLSKEFQELRNSLAQRDTQVLQLQDTITTLTQKlttahrKEA 230
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 232 ELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQyeemanRSRAETEtMYQSKFETLQ----------------AQAG 295
Cdd:pfam07888 231 ENEALLEELRSLQERLNASERKVEGLGEELSSMAAQ------RDRTQAE-LHQARLQAAQltlqladaslalregrARWA 303
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 296 KHGDDLRNT----RNEIAEMNRAIQRLQAEIDNIKNQRTKLEAAIAEAEERGELALKDAHAKQEELEAALQRAKQDMARQ 371
Cdd:pfam07888 304 QERETLQQSaeadKDRIEKLSAELQRLEERLQEERMEREKLEVELGREKDCNRVQLSESRRELQELKASLRVAQKEKEQL 383

                  ....*...
gi 1952735391 372 LREYQELM 379
Cdd:pfam07888 384 QAEKQELL 391
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
253-404 1.55e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 47.74  E-value: 1.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  253 RSLDLDGIIAEVKAQYEEMANRSraETETMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQRTKL 332
Cdd:TIGR02168  223 RELELALLVLRLEELREELEELQ--EELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1952735391  333 EAAIAEAEERGELA---LKDAHAKQEELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGEESRLA 404
Cdd:TIGR02168  301 EQQKQILRERLANLerqLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLE 375
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
110-403 8.32e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 45.43  E-value: 8.32e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  110 RFLEQQNKLLETKWALL----QEQRSARSSclpsiFEAHIAGLRRHLEGLQTDGGRLEVELrnmqDVLEDFKNKYEDEIN 185
Cdd:TIGR02168  214 RYKELKAELRELELALLvlrlEELREELEE-----LQEELKEAEEELEELTAELQELEEKL----EELRLEVSELEEEIE 284
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  186 RRTAAENEFVVLKKDVDaayMSKVELEAKVEALNDEInffkTLYETELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVK 265
Cdd:TIGR02168  285 ELQKELYALANEISRLE---QQKQILRERLANLERQL----EELEAQLEELESKLDELAEELAELEEKLEELKEELESLE 357
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  266 AQYEEmANRSRAETETMYQSKFETLQAQAGK---HGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQRTKLEAAIAEAEE- 341
Cdd:TIGR02168  358 AELEE-LEAELEELESRLEELEEQLETLRSKvaqLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEEAELk 436
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1952735391  342 --RGELALKDahAKQEELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGEESRL 403
Cdd:TIGR02168  437 elQAELEELE--EELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQ 498
EnvC COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
161-374 1.03e-04

Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 443969 [Multi-domain]  Cd Length: 377  Bit Score: 44.37  E-value: 1.03e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 161 RLEVELRNMQDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINffktLYETELAELQSQI 240
Cdd:COG4942    31 QLQQEIAELEKELAALKKEEKALLKQLAALERRIAALARRIRALEQELAALEAELAELEKEIA----ELRAELEAQKEEL 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 241 SDTSVVLSMdNSRSLDLDGII-------AEVKAQYEEMANRSRAETETMYQSKFETLQAQAgkhgDDLRNTRNEIAEMNR 313
Cdd:COG4942   107 AELLRALYR-LGRQPPLALLLspedfldAVRRLQYLKYLAPARREQAEELRADLAELAALR----AELEAERAELEALLA 181
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1952735391 314 AIQRLQAEIDNIKNQRTKLEAAIAEAEERGELALKDAHAKQEELEAALQRAKQDMARQLRE 374
Cdd:COG4942   182 ELEEERAALEALKAERQKLLARLEKELAELAAELAELQQEAEELEALIARLEAEAAAAAER 242
PRK01156 PRK01156
chromosome segregation protein; Provisional
77-400 1.19e-03

chromosome segregation protein; Provisional


Pssm-ID: 100796 [Multi-domain]  Cd Length: 895  Bit Score: 41.43  E-value: 1.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  77 RVDIDPAIQQVRREE-REQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLqEQRSARSSCLPSIFEAHIAGLRRH---- 151
Cdd:PRK01156  399 IQEIDPDAIKKELNEiNVKLQDISSKVSSLNQRIRALRENLDELSRNMEML-NGQSVCPVCGTTLGEEKSNHIINHynek 477
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 152 LEGLQTDGGRLEVELRNMQDVLEDFKNKYE----DEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINFFKT 227
Cdd:PRK01156  478 KSRLEEKIREIEIEVKDIDEKIVDLKKRKEylesEEINKSINEYNKIESARADLEDIKIKINELKDKHDKYEEIKNRYKS 557
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 228 LyetELAELQSQisDTSVVLSMDNSRSLDLDGIIA---EVKAQYEEMANRSRaETETMYQSKFETLQAQAGKHGDDLRNT 304
Cdd:PRK01156  558 L---KLEDLDSK--RTSWLNALAVISLIDIETNRSrsnEIKKQLNDLESRLQ-EIEIGFPDDKSYIDKSIREIENEANNL 631
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 305 R---NEIAEMNRAIQRLQAEIDNIKNQRTKLEAAIAEAEERGELALkDAHAKQEELEAALQRAKQDMARQLREYQELMNV 381
Cdd:PRK01156  632 NnkyNEIQENKILIEKLRGKIDNYKKQIAEIDSIIPDLKEITSRIN-DIEDNLKKSRKALDDAKANRARLESTIEILRTR 710
                         330
                  ....*....|....*....
gi 1952735391 382 KLALDIEIATYRKLLEGEE 400
Cdd:PRK01156  711 INELSDRINDINETLESMK 729
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
210-416 1.47e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 41.16  E-value: 1.47e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 210 ELEAKVEALNDEINFFKTlyETELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYEE----MANRSRAETETMYQS 285
Cdd:COG3206   186 ELRKELEEAEAALEEFRQ--KNGLVDLSEEAKLLLQQLSELESQLAEARAELAEAEARLAAlraqLGSGPDALPELLQSP 263
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 286 KFETLQAQagkhgddLRNTRNEIAEMNR-------AIQRLQAEIDNIKNQrtkLEAAIAEAEERGELALKDAHAKQEELE 358
Cdd:COG3206   264 VIQQLRAQ-------LAELEAELAELSArytpnhpDVIALRAQIAALRAQ---LQQEAQRILASLEAELEALQAREASLQ 333
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1952735391 359 AALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEG-EESRLAGDGVGAvNISVV 416
Cdd:COG3206   334 AQLAQLEARLAELPELEAELRRLEREVEVARELYESLLQRlEEARLAEALTVG-NVRVI 391
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
145-404 1.74e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 40.82  E-value: 1.74e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  145 IAGLRRHLEGLQTDGGRLEVELRNMQDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINF 224
Cdd:TIGR02169  655 MTGGSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEK 734
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  225 FKTlyetELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYEEmanrsraetetmYQSKFETLQAQAGKHGddlrnt 304
Cdd:TIGR02169  735 LKE----RLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHK------------LEEALNDLEARLSHSR------ 792
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  305 rneiaemnraIQRLQAEIDNIKNQRTKLEaaiaeaeergelalkdahAKQEELEAALQRAKQDMARQLREYQELMNVKLA 384
Cdd:TIGR02169  793 ----------IPEIQAELSKLEEEVSRIE------------------ARLREIEQKLNRLTLEKEYLEKEIQELQEQRID 844
                          250       260
                   ....*....|....*....|
gi 1952735391  385 LDIEIATYRKLLEGEESRLA 404
Cdd:TIGR02169  845 LKEQIKSIEKEIENLNGKKE 864
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
83-386 1.92e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 40.87  E-value: 1.92e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391   83 AIQQVRREEREQIKTLNNRFASFIDKVRFL--EQQNKLletkwALLQEQRSARSSCLPSIFEAHIAGLRRHLEGLQTDGG 160
Cdd:pfam15921  221 AISKILRELDTEISYLKGRIFPVEDQLEALksESQNKI-----ELLLQQHQDRIEQLISEHEVEITGLTEKASSARSQAN 295
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  161 RLEVELRNMQDvledfknkyedeinrrtAAENEfvvlkkdvDAAYMSKV-ELEAKVEALNDEINFFKTLYETELAELQSQ 239
Cdd:pfam15921  296 SIQSQLEIIQE-----------------QARNQ--------NSMYMRQLsDLESTVSQLRSELREAKRMYEDKIEELEKQ 350
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  240 ISDTSVVLSMDNSRSLDLDGIIAEVKAQYEE-MANRSRAETE-TMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQR 317
Cdd:pfam15921  351 LVLANSELTEARTERDQFSQESGNLDDQLQKlLADLHKREKElSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEVQR 430
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  318 LQAEIDNIKNQ-RTKLEAAIAEAEERGElalkdAHAKQEELEAALQRAKQdMARQLREyqELMNVKLALD 386
Cdd:pfam15921  431 LEALLKAMKSEcQGQMERQMAAIQGKNE-----SLEKVSSLTAQLESTKE-MLRKVVE--ELTAKKMTLE 492
PRK09039 PRK09039
peptidoglycan -binding protein;
230-393 1.96e-03

peptidoglycan -binding protein;


Pssm-ID: 181619 [Multi-domain]  Cd Length: 343  Bit Score: 40.33  E-value: 1.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 230 ETELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYEEM-ANRSRAETETMYQSKFET-LQAQAGKHGDDLRNTRNE 307
Cdd:PRK09039   52 DSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAeAERSRLQALLAELAGAGAaAEGRAGELAQELDSEKQV 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 308 IAEMNRAIQRLQAEIDNIKNQRTKLeaaiaeaeergELALKDAHAKQEELEAALqrakQDMARQLreyqelmNVKLALDI 387
Cdd:PRK09039  132 SARALAQVELLNQQIAALRRQLAAL-----------EAALDASEKRDRESQAKI----ADLGRRL-------NVALAQRV 189

                  ....*..
gi 1952735391 388 -EIATYR 393
Cdd:PRK09039  190 qELNRYR 196
rad50 TIGR00606
rad50; All proteins in this family for which functions are known are involvedin recombination, ...
78-331 2.40e-03

rad50; All proteins in this family for which functions are known are involvedin recombination, recombinational repair, and/or non-homologous end joining.They are components of an exonuclease complex with MRE11 homologs. This family is distantly related to the SbcC family of bacterial proteins.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).


Pssm-ID: 129694 [Multi-domain]  Cd Length: 1311  Bit Score: 40.42  E-value: 2.40e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391   78 VDIDPAIQQVRREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSI-----FEAHIAGLRRHL 152
Cdd:TIGR00606  818 SDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEKLQIGTNLqrrqqFEEQLVELSTEV 897
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  153 EGLQTDGGRLEVELRNMQDVLEDFKNKYEDEINRRTA----AENEFVVLKKDVDAAYMSKVELEAKVEALNDEinfFKTL 228
Cdd:TIGR00606  898 QSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETsnkkAQDKVNDIKEKVKNIHGYMKDIENKIQDGKDD---YLKQ 974
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  229 YETELAELQSQISDTSVVLSMDNSrslDLDGIIAEVKAQYEEMANRSRAETETMYQSKFETLQAQAGKHGDDLRNTRneI 308
Cdd:TIGR00606  975 KETELNTVNAQLEECEKHQEKINE---DMRLMRQDIDTQKIQERWLQDNLTLRKRENELKEVEEELKQHLKEMGQMQ--V 1049
                          250       260
                   ....*....|....*....|...
gi 1952735391  309 AEMNRAIQRLQAEIDNIKNQRTK 331
Cdd:TIGR00606 1050 LQMKQEHQKLEENIDLIKRNHVL 1072
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
84-332 2.46e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 40.39  E-value: 2.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  84 IQQVRREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKwalLQEQrsarssclpsifEAHIAGLRRHLEGLQTDGGRLE 163
Cdd:TIGR04523 361 KQRELEEKQNEIEKLKKENQSYKQEIKNLESQINDLESK---IQNQ------------EKLNQQKDEQIKKLQQEKELLE 425
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 164 VELRNmqdvLEDFKNKYEDEINRrtaAENEFVVLKKDVDAAYMSKVELEAKVEALNDEINFFKTLYETELAELQSQISDt 243
Cdd:TIGR04523 426 KEIER----LKETIIKNNSEIKD---LTNQDSVKELIIKNLDNTRESLETQLKVLSRSINKIKQNLEQKQKELKSKEKE- 497
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 244 svvLSMDNSRSLDLDGIIAEVKAQYEEMANRSRAETETMYQSKFETLQAQAGKHGDDLRNTRN----EIAEMNRAIQRLQ 319
Cdd:TIGR04523 498 ---LKKLNEEKKELEEKVKDLTKKISSLKEKIEKLESEKKEKESKISDLEDELNKDDFELKKEnlekEIDEKNKEIEELK 574
                         250
                  ....*....|...
gi 1952735391 320 AEIDNIKNQRTKL 332
Cdd:TIGR04523 575 QTQKSLKKKQEEK 587
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
93-400 3.71e-03

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 40.05  E-value: 3.71e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  93 EQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLqeqrSARSSCLPSIFEAhIAGLRRHLEGLQTDGGRLEVELRNMQDV 172
Cdd:PRK03918  193 ELIKEKEKELEEVLREINEISSELPELREELEKL----EKEVKELEELKEE-IEELEKELESLEGSKRKLEEKIRELEER 267
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 173 LEDFKNKYED-EINRRTAAEnefvvLKKDVDaAYMSKVELEAKVEALNDEINFFKTLYETELAELQSQISDtsvvLSMDN 251
Cdd:PRK03918  268 IEELKKEIEElEEKVKELKE-----LKEKAE-EYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKE----LEEKE 337
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 252 SRSLDLDGIIAEVKAQYEEMANRSRaETETMYQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQRLQAEIDNIKNQRTK 331
Cdd:PRK03918  338 ERLEELKKKLKELEKRLEELEERHE-LYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITARIGE 416
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952735391 332 LEAAIAEAEERGElALKDAHAK---------QEELEAALQRAKQDMARQLREYQELMNVKLALDIEIATYRKLLEGEE 400
Cdd:PRK03918  417 LKKEIKELKKAIE-ELKKAKGKcpvcgreltEEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKES 493
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
107-393 5.50e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 39.28  E-value: 5.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  107 DKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIFE-AHIAGLRRHLEGLQTDGGRLEVELRNMQDVLEDFKNKYEDEIN 185
Cdd:TIGR02169  637 GKYRMVTLEGELFEKSGAMTGGSRAPRGGILFSRSEpAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR 716
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  186 RRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEInffkTLYETELAELQSQISdtsvvlsmdnsrslDLDGIIAEVK 265
Cdd:TIGR02169  717 KIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEI----ENVKSELKELEARIE--------------ELEEDLHKLE 778
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  266 AQYEEMANRSRaetetmyQSKFETLQAQAGKHGDDLRNTRNEIAEMNRAIQR-------LQAEIDNIKNQRTKLEAAIAE 338
Cdd:TIGR02169  779 EALNDLEARLS-------HSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRltlekeyLEKEIQELQEQRIDLKEQIKS 851
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952735391  339 AEERGELALKDAHAKQEELE-------------AALQRAKQDMARQLREYQELMNvKLALDIEIATYR 393
Cdd:TIGR02169  852 IEKEIENLNGKKEELEEELEeleaalrdlesrlGDLKKERDELEAQLRELERKIE-ELEAQIEKKRKR 918
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
230-403 5.85e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 39.13  E-value: 5.85e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  230 ETELAELQSQISDtsvvlsmdnsrsldLDGIIAEVKAQYEEMANRSRA----ETETMYQSKFETLQAQAGKHGD---DLR 302
Cdd:COG4913    616 EAELAELEEELAE--------------AEERLEALEAELDALQERREAlqrlAEYSWDEIDVASAEREIAELEAeleRLD 681
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  303 NTRNEIAEMNRAIQRLQAEIDNIKNQRTKLEAAIAEAEERGELALKDAHAKQEELEAALQRAKQDMARQLRE-YQELM-- 379
Cdd:COG4913    682 ASSDDLAALEEQLEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDRLEAAEDLARLELRALLEErFAAALgd 761
                          170       180
                   ....*....|....*....|....*...
gi 1952735391  380 ----NVKLALDIEIATYRKLLEGEESRL 403
Cdd:COG4913    762 averELRENLEERIDALRARLNRAEEEL 789
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
83-328 7.69e-03

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 38.90  E-value: 7.69e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391   83 AIQQVRREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARssclpsifEAHIAGLRRHLEGLQTDGGRL 162
Cdd:TIGR02169  802 KLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSI--------EKEIENLNGKKEELEEELEEL 873
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  163 EVELRNMQDVLEDFKNKYEDEINRRTAAENEFVVLKKDVDAAYMSKVELEAKVEALNDEInffkTLYETELAELQSQISD 242
Cdd:TIGR02169  874 EAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEEL----SEIEDPKGEDEEIPEE 949
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  243 TsvvlsmdnsrsLDLDGIIAEVKAQYEEMAN----RSRAETEtmYQSKFETLqaqagkhgDDLRNTRNEIAEMNRAIQRL 318
Cdd:TIGR02169  950 E-----------LSLEDVQAELQRVEEEIRAlepvNMLAIQE--YEEVLKRL--------DELKEKRAKLEEERKAILER 1008
                          250
                   ....*....|
gi 1952735391  319 QAEIDNIKNQ 328
Cdd:TIGR02169 1009 IEEYEKKKRE 1018
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
84-406 8.10e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 38.95  E-value: 8.10e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391   84 IQQVRREEREQIKTLNNRFASFIDKVRFLEQQNKLLETKWALLQEQRSARSSCLPSIfeAHIAGLRRHLEGLQTDGGRLE 163
Cdd:pfam15921  477 LRKVVEELTAKKMTLESSERTVSDLTASLQEKERAIEATNAEITKLRSRVDLKLQEL--QHLKNEGDHLRNVQTECEALK 554
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  164 VELRNMQDVLEDFKNKYEDEI-----NRRTAAenefvvlkkdvdAAYMSKVELEAKVEALNDEINFFKTLYE---TELAE 235
Cdd:pfam15921  555 LQMAEKDKVIEILRQQIENMTqlvgqHGRTAG------------AMQVEKAQLEKEINDRRLELQEFKILKDkkdAKIRE 622
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  236 LQSQISDTSV-VLSMDNSRSLDLDGIiAEVKAQYEEMANR---SRAETETMYQsKFETLQaqagkhgddlRNTRNEIAEM 311
Cdd:pfam15921  623 LEARVSDLELeKVKLVNAGSERLRAV-KDIKQERDQLLNEvktSRNELNSLSE-DYEVLK----------RNFRNKSEEM 690
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391  312 NRAIQRLQAEIDNIKNQRTKLEAAIAEAEERGELALKDAHAKQEE----------LEAALQRAKQDMARQLREYQELMNV 381
Cdd:pfam15921  691 ETTTNKLKMQLKSAQSELEQTRNTLKSMEGSDGHAMKVAMGMQKQitakrgqidaLQSKIQFLEEAMTNANKEKHFLKEE 770
                          330       340
                   ....*....|....*....|....*
gi 1952735391  382 KLALDIEIATyrklLEGEESRLAGD 406
Cdd:pfam15921  771 KNKLSQELST----VATEKNKMAGE 791
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
164-403 8.81e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 38.46  E-value: 8.81e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 164 VELRNMQDVLEDFKNKYEDEINRRTAAENEFVVLKKDVD--------------AAYMSKVELEAKVE---ALNDEINFFK 226
Cdd:TIGR04523 145 TEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEKLniqknidkiknkllKLELLLSNLKKKIQknkSLESQISELK 224
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 227 ---TLYETELAELQSQISDTSVVLSMDNSRSLDLDGIIAEVKAQYE------EMANRSRAETETM---YQSKFETL--QA 292
Cdd:TIGR04523 225 kqnNQLKDNIEKKQQEINEKTTEISNTQTQLNQLKDEQNKIKKQLSekqkelEQNNKKIKELEKQlnqLKSEISDLnnQK 304
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952735391 293 QAGKHGD----------DLRNTRNEIAEMNRAIQRLQAEIDNIKNQRT-------KLEAAIAEAEERGELALKDAHAKQE 355
Cdd:TIGR04523 305 EQDWNKElkselknqekKLEEIQNQISQNNKIISQLNEQISQLKKELTnsesensEKQRELEEKQNEIEKLKKENQSYKQ 384
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1952735391 356 ELEaALQRAKQDMARQLREYQELMNVKlalDIEIATYRK---LLEGEESRL 403
Cdd:TIGR04523 385 EIK-NLESQINDLESKIQNQEKLNQQK---DEQIKKLQQekeLLEKEIERL 431
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH