NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|50400333|sp|Q6NVD9|]
View 

RecName: Full=Phakinin; AltName: Full=49 kDa cytoskeletal protein; AltName: Full=Beaded filament structural protein 2; AltName: Full=Lens fiber cell beaded filament protein CP 47; Short=CP47; AltName: Full=Lens fiber cell beaded filament protein CP 49; Short=CP49; AltName: Full=Lens intermediate filament-like light; Short=LIFL-L

Protein Classification

intermediate filament family protein( domain architecture ID 705869)

intermediate filament (IF) family protein is a primordial component of the cytoskeleton and the nuclear envelope; such as type I keratins

CATH:  1.20.5.170
Gene Ontology:  GO:0005882

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Filament super family cl25641
Intermediate filament protein;
116-416 1.24e-57

Intermediate filament protein;


The actual alignment was detected with superfamily member pfam00038:

Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 191.29  E-value: 1.24e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   116 LVEYMTKVHALEQVSQELETQLRAHLESK-AKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYE 194
Cdd:pfam00038  13 LASYIDKVRFLEQQNKLLETKISELRQKKgAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNLRLAAEDFRQKYE 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   195 NEQPFRKAAEEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEDVKVLYKQLA-GSELEQADVPMGTGLDDVL 273
Cdd:pfam00038  93 DELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSdTQVNVEMDAARKLDLTSAL 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   274 ETIRVQWERDVEKNRAEAGALLQAKqqTEVVHVSQTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDA 353
Cdd:pfam00038 173 AEIRAQYEEIAAKNREEAEEWYQSK--LEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQKASLERQLAET 250
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50400333   354 QHWHDMELQNLGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLDREENN 416
Cdd:pfam00038 251 EERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
116-416 1.24e-57

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 191.29  E-value: 1.24e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   116 LVEYMTKVHALEQVSQELETQLRAHLESK-AKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYE 194
Cdd:pfam00038  13 LASYIDKVRFLEQQNKLLETKISELRQKKgAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNLRLAAEDFRQKYE 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   195 NEQPFRKAAEEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEDVKVLYKQLA-GSELEQADVPMGTGLDDVL 273
Cdd:pfam00038  93 DELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSdTQVNVEMDAARKLDLTSAL 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   274 ETIRVQWERDVEKNRAEAGALLQAKqqTEVVHVSQTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDA 353
Cdd:pfam00038 173 AEIRAQYEEIAAKNREEAEEWYQSK--LEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQKASLERQLAET 250
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50400333   354 QHWHDMELQNLGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLDREENN 416
Cdd:pfam00038 251 EERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
131-415 1.00e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 54.29  E-value: 1.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    131 QELETQLRAHleSKAKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFK-ERYEneqpfrkaAEEEVSS 209
Cdd:TIGR02168  216 KELKAELREL--ELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRlEVSE--------LEEEIEE 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    210 LYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEDVKVLYKQLAGSELEQADVPMGTGLDDVLETIRVQWERDVEKNRA 289
Cdd:TIGR02168  286 LQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    290 EAGALLQAKQQTEVVH------VSQTQEEKLAAA-----LSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQ-HWH 357
Cdd:TIGR02168  366 ELEELESRLEELEEQLetlrskVAQLELQIASLNneierLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAElEEL 445
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 50400333    358 DMELQNLGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLDREEN 415
Cdd:TIGR02168  446 EEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
123-415 1.72e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 53.40  E-value: 1.72e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 123 VHALEQVSQELETQLRAHLESKAKSsggwDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYENEQPFRKA 202
Cdd:COG1196 231 LLKLRELEAELEELEAELEELEAEL----EELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIAR 306
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 203 AEEEVSSLYKVIDEANLTKTDLEHQIESLKEELgflsRSYEEDVKVLYKQLAGSELEQADVpmgTGLDDVLETIRVQWER 282
Cdd:COG1196 307 LEERRRELEERLEELEEELAELEEELEELEEEL----EELEEELEEAEEELEEAEAELAEA---EEALLEAEAELAEAEE 379
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 283 DVEKNRAEAGALLQAKQQTEvvhVSQTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQHwhdmELQ 362
Cdd:COG1196 380 ELEELAEELLEALRAAAELA---AQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAE----EEA 452
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|...
gi 50400333 363 NLGAVVGRLEAELAEIRSETEQQQQERAHLLAcksQLQKDVASYHALLDREEN 415
Cdd:COG1196 453 ELEEEEEALLELLAELLEEAALLEAALAELLE---ELAEAAARLLLLLEAEAD 502
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
186-411 4.11e-05

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.80  E-value: 4.11e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  186 ADDFKERYENEQpfrkaaeEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEDVKVLYKQLAGSELEQADVPM 265
Cdd:PRK02224 239 ADEVLEEHEERR-------EELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEA 311
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  266 GTGLDDVLETIRVQWERDVEKNRAEAGALL-QAKQQTEVVHVSQTQEEKL---AAALSVELHDTSRQVQSLQAETESLRA 341
Cdd:PRK02224 312 VEARREELEDRDEELRDRLEECRVAAQAHNeEAESLREDADDLEERAEELreeAAELESELEEAREAVEDRREEIEELEE 391
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  342 LKRGLENSLHDAqhwhdmelqnlGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLD 411
Cdd:PRK02224 392 EIEELRERFGDA-----------PVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLE 450
 
Name Accession Description Interval E-value
Filament pfam00038
Intermediate filament protein;
116-416 1.24e-57

Intermediate filament protein;


Pssm-ID: 459643 [Multi-domain]  Cd Length: 313  Bit Score: 191.29  E-value: 1.24e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   116 LVEYMTKVHALEQVSQELETQLRAHLESK-AKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYE 194
Cdd:pfam00038  13 LASYIDKVRFLEQQNKLLETKISELRQKKgAEPSRLYSLYEKEIEDLRRQLDTLTVERARLQLELDNLRLAAEDFRQKYE 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   195 NEQPFRKAAEEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEDVKVLYKQLA-GSELEQADVPMGTGLDDVL 273
Cdd:pfam00038  93 DELNLRTSAENDLVGLRKDLDEATLARVDLEAKIESLKEELAFLKKNHEEEVRELQAQVSdTQVNVEMDAARKLDLTSAL 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   274 ETIRVQWERDVEKNRAEAGALLQAKqqTEVVHVSQTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDA 353
Cdd:pfam00038 173 AEIRAQYEEIAAKNREEAEEWYQSK--LEELQQAAARNGDALRSAKEEITELRRTIQSLEIELQSLKKQKASLERQLAET 250
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 50400333   354 QHWHDMELQNLGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLDREENN 416
Cdd:pfam00038 251 EERYELQLADYQELISELEAELQETRQEMARQLREYQELLNVKLALDIEIATYRKLLEGEECR 313
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
131-415 1.00e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 54.29  E-value: 1.00e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    131 QELETQLRAHleSKAKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFK-ERYEneqpfrkaAEEEVSS 209
Cdd:TIGR02168  216 KELKAELREL--ELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRlEVSE--------LEEEIEE 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    210 LYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEDVKVLYKQLAGSELEQADVPMGTGLDDVLETIRVQWERDVEKNRA 289
Cdd:TIGR02168  286 LQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEA 365
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    290 EAGALLQAKQQTEVVH------VSQTQEEKLAAA-----LSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQ-HWH 357
Cdd:TIGR02168  366 ELEELESRLEELEEQLetlrskVAQLELQIASLNneierLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAElEEL 445
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 50400333    358 DMELQNLGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLDREEN 415
Cdd:TIGR02168  446 EEELEELQEELERLEEALEELREELEEAEQALDAAERELAQLQARLDSLERLQENLEG 503
Smc COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
123-415 1.72e-07

Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 440809 [Multi-domain]  Cd Length: 983  Bit Score: 53.40  E-value: 1.72e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 123 VHALEQVSQELETQLRAHLESKAKSsggwDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYENEQPFRKA 202
Cdd:COG1196 231 LLKLRELEAELEELEAELEELEAEL----EELEAELAELEAELEELRLELEELELELEEAQAEEYELLAELARLEQDIAR 306
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 203 AEEEVSSLYKVIDEANLTKTDLEHQIESLKEELgflsRSYEEDVKVLYKQLAGSELEQADVpmgTGLDDVLETIRVQWER 282
Cdd:COG1196 307 LEERRRELEERLEELEEELAELEEELEELEEEL----EELEEELEEAEEELEEAEAELAEA---EEALLEAEAELAEAEE 379
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 283 DVEKNRAEAGALLQAKQQTEvvhVSQTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQHwhdmELQ 362
Cdd:COG1196 380 ELEELAEELLEALRAAAELA---AQLEELEEAEEALLERLERLEEELEELEEALAELEEEEEEEEEALEEAAE----EEA 452
                       250       260       270       280       290
                ....*....|....*....|....*....|....*....|....*....|...
gi 50400333 363 NLGAVVGRLEAELAEIRSETEQQQQERAHLLAcksQLQKDVASYHALLDREEN 415
Cdd:COG1196 453 ELEEEEEALLELLAELLEEAALLEAALAELLE---ELAEAAARLLLLLEAEAD 502
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
121-415 6.06e-07

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 51.98  E-value: 6.06e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    121 TKVHALEQVSQELETQLRAHLESKAKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYENEQPFR 200
Cdd:TIGR02168  698 KALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEEL 777
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    201 KAAEEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEdvkvlyKQLAGSELEQADVPMGTGLDDVLETIRVQW 280
Cdd:TIGR02168  778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAAN------LRERLESLERRIAATERRLEDLEEQIEELS 851
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    281 ER--DVEKNRAEAGALLqAKQQTEVVHVSQ--TQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQhw 356
Cdd:TIGR02168  852 EDieSLAAEIEELEELI-EELESELEALLNerASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLE-- 928
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    357 hdMELQnlgavvgRLEAELAEIRSE-TEQQQQERAHLLACKSQLQKDVASYHALLDREEN 415
Cdd:TIGR02168  929 --LRLE-------GLEVRIDNLQERlSEEYSLTLEEAEALENKIEDDEEEARRRLKRLEN 979
PRK02224 PRK02224
DNA double-strand break repair Rad50 ATPase;
186-411 4.11e-05

DNA double-strand break repair Rad50 ATPase;


Pssm-ID: 179385 [Multi-domain]  Cd Length: 880  Bit Score: 45.80  E-value: 4.11e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  186 ADDFKERYENEQpfrkaaeEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEDVKVLYKQLAGSELEQADVPM 265
Cdd:PRK02224 239 ADEVLEEHEERR-------EELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADAEA 311
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  266 GTGLDDVLETIRVQWERDVEKNRAEAGALL-QAKQQTEVVHVSQTQEEKL---AAALSVELHDTSRQVQSLQAETESLRA 341
Cdd:PRK02224 312 VEARREELEDRDEELRDRLEECRVAAQAHNeEAESLREDADDLEERAEELreeAAELESELEEAREAVEDRREEIEELEE 391
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  342 LKRGLENSLHDAqhwhdmelqnlGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLD 411
Cdd:PRK02224 392 EIEELRERFGDA-----------PVDLGNAEDFLEELREERDELREREAELEATLRTARERVEEAEALLE 450
SMC_prok_A TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
131-400 4.30e-05

chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274009 [Multi-domain]  Cd Length: 1164  Bit Score: 45.83  E-value: 4.30e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    131 QELETQLRAHLESKAKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYENEQPFRKAAEEEVSSL 210
Cdd:TIGR02169  684 EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKEL 763
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    211 YKVIDEanltktdLEHQIESLKEELGFLSRSyeedvkvlykqLAGSELEQADvpmgtGLDDVLETIRVQWErdveknrae 290
Cdd:TIGR02169  764 EARIEE-------LEEDLHKLEEALNDLEAR-----------LSHSRIPEIQ-----AELSKLEEEVSRIE--------- 811
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    291 aGALLQAKQQTEVVHVSQTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQHW---HDMELQNLGAV 367
Cdd:TIGR02169  812 -ARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAAlrdLESRLGDLKKE 890
                          250       260       270
                   ....*....|....*....|....*....|....*..
gi 50400333    368 VGRLEAELAEIRSETEQQ----QQERAHLLACKSQLQ 400
Cdd:TIGR02169  891 RDELEAQLRELERKIEELeaqiEKKRKRLSELKAKLE 927
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
126-401 5.72e-05

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 45.43  E-value: 5.72e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    126 LEQVSQELETQLRAHLESKAKSSGGWDALRASWASSYQQVGEAVLENARLLLQMETIQAGADDFKERYENEQPFRKAAEE 205
Cdd:TIGR02168  752 LSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLER 831
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    206 EVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEE---DVKVLYKQLAgsELEQADVPMGTGLDDVLETIRvQWER 282
Cdd:TIGR02168  832 RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEElesELEALLNERA--SLEEALALLRSELEELSEELR-ELES 908
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    283 DVEKNRAEAGALlQAKQQTEVVHVS------QTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENslhdaqhw 356
Cdd:TIGR02168  909 KRSELRRELEEL-REKLAQLELRLEglevriDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLEN-------- 979
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 50400333    357 hdmELQNLGAVvgRLEAeLAEIRSETEQQQ---QERAHLLACKSQLQK 401
Cdd:TIGR02168  980 ---KIKELGPV--NLAA-IEEYEELKERYDfltAQKEDLTEAKETLEE 1021
CCDC158 pfam15921
Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. ...
222-384 1.82e-03

Coiled-coil domain-containing protein 158; CCDC158 is a family of proteins found in eukaryotes. The function is not known.


Pssm-ID: 464943 [Multi-domain]  Cd Length: 1112  Bit Score: 40.87  E-value: 1.82e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    222 TDLEHQIESLKEELGFLSRSYEEDVKVLYKQ--LAGSELEQADVPM------GTGLDDVLETIRVqwerDVEKNRAEaga 293
Cdd:pfam15921  320 SDLESTVSQLRSELREAKRMYEDKIEELEKQlvLANSELTEARTERdqfsqeSGNLDDQLQKLLA----DLHKREKE--- 392
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    294 LLQAKQQTEVVHVSQTQEEKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQHWHDMELQNLGAVVGRLEA 373
Cdd:pfam15921  393 LSLEKEQNKRLWDRDTGNSITIDHLRRELDDRNMEVQRLEALLKAMKSECQGQMERQMAAIQGKNESLEKVSSLTAQLES 472
                          170
                   ....*....|.
gi 50400333    374 ELAEIRSETEQ 384
Cdd:pfam15921  473 TKEMLRKVVEE 483
CEP63 pfam17045
Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole ...
204-414 3.88e-03

Centrosomal protein of 63 kDa; CEP63 is a family of eukaryotic proteins involved in centriole activity.


Pssm-ID: 465338 [Multi-domain]  Cd Length: 264  Bit Score: 38.65  E-value: 3.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   204 EEEVSSLYKVIDEANLTKTDL----EHQIESLKEELGFLSRSYEedvKVLYKQL----AGSELEQADVPMGTGLDDVLET 275
Cdd:pfam17045  55 HKEIGLLRQQLEELEKGKQELvakyEQQLQKLQEELSKLKRSYE---KLQRKQLkearEEAKSREEDRSELSRLNGKLEE 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333   276 IRvQWERDVEKNRAEAG---ALLQAKQQTEVVHVSQTQeeklAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHD 352
Cdd:pfam17045 132 FR-QKSLEWEQQRLQYQqqvASLEAQRKALAEQSSLIQ----SAAYQVQLEGRKQCLEASQSEIQRLRSKLERAQDSLCA 206
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 50400333   353 aqhwHDMELQNLGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVASYHALLDREE 414
Cdd:pfam17045 207 ----QELELERLRMRVSELGDSNRKLLEEQQRLLEELRMSQRQLQVLQNELMELKATLQSQD 264
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
219-405 4.91e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 39.13  E-value: 4.91e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  219 LTKTDLEHQIESLKEELGFLSRSYEEdVKVLYKQL--------AGSELEQADVPMGTgLDDVLETIRVQ--------WER 282
Cdd:COG4913  218 LEEPDTFEAADALVEHFDDLERAHEA-LEDAREQIellepireLAERYAAARERLAE-LEYLRAALRLWfaqrrlelLEA 295
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  283 DVEKNRAEAGALLQAKQQTEVvHVSQTQEEKLAAALSVELHDTsRQVQSLQAETESLRALKRGLENSLHDAQHW-HDMEL 361
Cdd:COG4913  296 ELEELRAELARLEAELERLEA-RLDALREELDELEAQIRGNGG-DRLEQLEREIERLERELEERERRRARLEALlAALGL 373
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|
gi 50400333  362 QN------LGAVVGRLEAELAEIRSETEQQQQERAHLLACKSQLQKDVAS 405
Cdd:COG4913  374 PLpasaeeFAALRAEAAALLEALEEELEALEEALAEAEAALRDLRRELRE 423
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
202-380 5.74e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 39.13  E-value: 5.74e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  202 AAEEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSRSYEEdvkvLYKQLAGSELEQADvpmgtglddvletirvQWE 281
Cdd:COG4913  285 FAQRRLELLEAELEELRAELARLEAELERLEARLDALREELDE----LEAQIRGNGGDRLE----------------QLE 344
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  282 RDVEKNRAEAGALLQAKQQTEvvhvsqtqeeKLAAALSVELHDTSRQVQSLQAETESLRALKRGLENSLHDAQHWHDMEL 361
Cdd:COG4913  345 REIERLERELEERERRRARLE----------ALLAALGLPLPASAEEFAALRAEAAALLEALEEELEALEEALAEAEAAL 414
                        170
                 ....*....|....*....
gi 50400333  362 QNLGAVVGRLEAELAEIRS 380
Cdd:COG4913  415 RDLRRELRELEAEIASLER 433
GumC COG3206
Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];
223-414 7.63e-03

Exopolysaccharide export protein/domain GumC/Wzc1 [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442439 [Multi-domain]  Cd Length: 687  Bit Score: 38.46  E-value: 7.63e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 223 DLEHQIESLKEELGFLSrsYEEDVKVLYKQLAG--SELEQADVPMgtgldDVLETIRVQWERDVEKNRAEAGALLQAKQQ 300
Cdd:COG3206 193 EAEAALEEFRQKNGLVD--LSEEAKLLLQQLSEleSQLAEARAEL-----AEAEARLAALRAQLGSGPDALPELLQSPVI 265
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333 301 TEVVHVSQTQEEKLAAaLSVELHDTSRQVQSLQAETESLRA-LKRGLENSLHDAQHwhdmELQNLGAVVGRLEAELAEIR 379
Cdd:COG3206 266 QQLRAQLAELEAELAE-LSARYTPNHPDVIALRAQIAALRAqLQQEAQRILASLEA----ELEALQAREASLQAQLAQLE 340
                       170       180       190
                ....*....|....*....|....*....|....*
gi 50400333 380 SETEQQQQERAHLLACKSQLQKDVASYHALLDREE 414
Cdd:COG3206 341 ARLAELPELEAELRRLEREVEVARELYESLLQRLE 375
COG4913 COG4913
Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];
122-294 8.17e-03

Uncharacterized conserved protein, contains a C-terminal ATPase domain [Function unknown];


Pssm-ID: 443941 [Multi-domain]  Cd Length: 1089  Bit Score: 38.74  E-value: 8.17e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  122 KVHALEQVSQELETQLRAHLESKAKSSGGWDALRASwASSYQQ----------VGEAVLENARLLLQMETIQAGADDFKE 191
Cdd:COG4913  611 KLAALEAELAELEEELAEAEERLEALEAELDALQER-REALQRlaeyswdeidVASAEREIAELEAELERLDASSDDLAA 689
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333  192 RYENeqpfRKAAEEEVSSLYKVIDEANLTKTDLEHQIESLKEELGFLSR---SYEEDVKVLYKQLAGSELEQADVpmgtg 268
Cdd:COG4913  690 LEEQ----LEELEAELEELEEELDELKGEIGRLEKELEQAEEELDELQDrleAAEDLARLELRALLEERFAAALG----- 760
                        170       180
                 ....*....|....*....|....*.
gi 50400333  269 lDDVLETIRVQWERDVEKNRAEAGAL 294
Cdd:COG4913  761 -DAVERELRENLEERIDALRARLNRA 785
DUF3584 pfam12128
Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. ...
200-405 8.43e-03

Protein of unknown function (DUF3584); This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 943 to 1234 amino acids in length. This family contains a P-loop motif suggesting it is a nucleotide binding protein. It may be involved in replication.


Pssm-ID: 432349 [Multi-domain]  Cd Length: 1191  Bit Score: 38.67  E-value: 8.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    200 RKAAEEEVSSLYKVIDEANLTKTDL--EHQIESLKEELGFLSrsyeedvkvlykqlAGSELEQADVPMGTGLDDVLETIR 277
Cdd:pfam12128  541 RKEAPDWEQSIGKVISPELLHRTDLdpEVWDGSVGGELNLYG--------------VKLDLKRIDVPEWAASEEELRERL 606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 50400333    278 VQWERDVEKNRAEAGALLQAKQQTEVVHVSQTQEEKLAAA-----------LSVELHDTSRQVQSLQAE-----TESLRA 341
Cdd:pfam12128  607 DKAEEALQSAREKQAAAEEQLVQANGELEKASREETFARTalknarldlrrLFDEKQSEKDKKNKALAErkdsaNERLNS 686
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 50400333    342 LKRGLENSLHDAQHW----------HDME-LQNLGAVVGRLEAELAEIRSETEQQQ-QERAHLLACKSQLQKDVAS 405
Cdd:pfam12128  687 LEAQLKQLDKKHQAWleeqkeqkreARTEkQAYWQVVEGALDAQLALLKAAIAARRsGAKAELKALETWYKRDLAS 762
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH