NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1876941750|ref|NP_001372460|]
View 

caprin-2 isoform 5 [Homo sapiens]

Protein Classification

complement C1q tumor necrosis factor-related protein( domain architecture ID 12115385)

complement C1q tumor necrosis factor-related protein (C1q/TNF) plays diverse and important roles in immune, endocrine, skeletal, neuronal, reproductive, sensory, and vascular systems

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
287-601 3.47e-163

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


:

Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 475.44  E-value: 3.47e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 287 LQDLMTQIQGTCNFMQESVLDFDKPS-SAIPTSQPPSATP-----GSPVASKEQNLSSQSDFLQEPLQATSSPVTCSSNA 360
Cdd:pfam12287   1 LQDLMAQIQGTYNFMQDSMLDFDKPSdSAIVSAQPPSQSPdlsqmVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSNA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 361 ClvttdqASSGSETEFMTSET--PEAAIPP---GKQPSSLASPNPPMAKGSE-QGFQSPPASSSSVTINTAPFQAMQTVF 434
Cdd:pfam12287  81 C------ASSGSEYQFHTSEPpqPEAIDPIqssMSLPSELAPPSPPLSPASQpQVFQSKPASSSGINVNAAPFQSMQTVF 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 435 NVNAPLPPRKEQEIKE-SPYSPGYNQSFTTASTQTPPQCQLPSIHVEQTVhsqeTANYHPDGTIQVSNGSLAFYPAQTNV 513
Cdd:pfam12287 155 NVNAPVPPRNEQELKEsSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTV----VGAYHPDGTIQVSNGHLAFYPAQTNG 230
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 514 FPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGYK-GFDTYRG-LPSISNGNYSQLQFQAREYSGAPYSQRDNFQQCYK 591
Cdd:pfam12287 231 FPRPPQPFYNSRGSPRGGPRGGRGLMNGYRGPNGFKgGFDGYRGpFPNTPNGGYGQLQFQARDYSGTPYSQRDGYQQNYK 310
                         330
                  ....*....|
gi 1876941750 592 RGGTSGGPRA 601
Cdd:pfam12287 311 RGGTQSGPRA 320
C1q pfam00386
C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement ...
665-790 3.94e-41

C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement system.


:

Pssm-ID: 395310 [Multi-domain]  Cd Length: 126  Bit Score: 146.66  E-value: 3.94e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 665 AFSAARTSNLAPGTlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAvNVPLYVNLMKNEEVLVSAYAND 744
Cdd:pfam00386   1 AFSAGRTTGLTAPN-EQPVRFDKVLTNIGGHYDPATGKFTCPVPGVYYFSYHITTVD-GKSLYVSLVKNGQEVVSFYDQP 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1876941750 745 GAPDHETASNHAILQLFQGDQIWLRLH--RGAIYGSSWKYSTFSGYLL 790
Cdd:pfam00386  79 QKGSLDVASGSVVLELQRGDEVWLQLTgyNGLYYDGSDTDSTFSGFLL 126
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
97-469 2.92e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  97 QEVSKPAVSLEQRKQDTSKLRSTLPEEQKKQEISKSKPSPSQwkqdTPKSKAGYVQEEQKKQETPKLWPVQLQKEQDPKK 176
Cdd:pfam03154 175 QAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQ----PPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQ 250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 177 QTPKSWTPSMQSEQNTTKSW------------------------TTPMCEEQDSKQPETPKSWENNVESQKHsltsQSQI 232
Cdd:pfam03154 251 PMTQPPPPSQVSPQPLPQPSlhgqmppmphslqtgpshmqhpvpPQPFPLTPQSSQSQVPPGPSPAAPGQSQ----QRIH 326
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 233 SPKSWGVATASLIPNDQLLPRKLNTEPKDVPKPvhqpvgsSSTLPKDPVLRKEKLQdlmTQIQGTCNFMQESVLDFD--- 309
Cdd:pfam03154 327 TPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPP-------TTPIPQLPNPQSHKHP---PHLSGPSPFQMNSNLPPPpal 396
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 310 KPSSAIPTSQPPSATPgSPVaskeqNLSSQSDFLQEPlqATSSPVTCSSNACLVTTDQASSGSETEFMTSETPEAAIP-- 387
Cdd:pfam03154 397 KPLSSLSTHHPPSAHP-PPL-----QLMPQSQQLPPP--PAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPfv 468
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 388 PGKQPSSLASPNPPMAKGSEQGFQSPPASSSSVTINTAPFQAMQTVFNVNAPLPPRKEQEIKESPYSPGYNQSFTTASTQ 467
Cdd:pfam03154 469 PGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVN 548

                  ..
gi 1876941750 468 TP 469
Cdd:pfam03154 549 TP 550
 
Name Accession Description Interval E-value
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
287-601 3.47e-163

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 475.44  E-value: 3.47e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 287 LQDLMTQIQGTCNFMQESVLDFDKPS-SAIPTSQPPSATP-----GSPVASKEQNLSSQSDFLQEPLQATSSPVTCSSNA 360
Cdd:pfam12287   1 LQDLMAQIQGTYNFMQDSMLDFDKPSdSAIVSAQPPSQSPdlsqmVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSNA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 361 ClvttdqASSGSETEFMTSET--PEAAIPP---GKQPSSLASPNPPMAKGSE-QGFQSPPASSSSVTINTAPFQAMQTVF 434
Cdd:pfam12287  81 C------ASSGSEYQFHTSEPpqPEAIDPIqssMSLPSELAPPSPPLSPASQpQVFQSKPASSSGINVNAAPFQSMQTVF 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 435 NVNAPLPPRKEQEIKE-SPYSPGYNQSFTTASTQTPPQCQLPSIHVEQTVhsqeTANYHPDGTIQVSNGSLAFYPAQTNV 513
Cdd:pfam12287 155 NVNAPVPPRNEQELKEsSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTV----VGAYHPDGTIQVSNGHLAFYPAQTNG 230
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 514 FPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGYK-GFDTYRG-LPSISNGNYSQLQFQAREYSGAPYSQRDNFQQCYK 591
Cdd:pfam12287 231 FPRPPQPFYNSRGSPRGGPRGGRGLMNGYRGPNGFKgGFDGYRGpFPNTPNGGYGQLQFQARDYSGTPYSQRDGYQQNYK 310
                         330
                  ....*....|
gi 1876941750 592 RGGTSGGPRA 601
Cdd:pfam12287 311 RGGTQSGPRA 320
C1q pfam00386
C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement ...
665-790 3.94e-41

C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement system.


Pssm-ID: 395310 [Multi-domain]  Cd Length: 126  Bit Score: 146.66  E-value: 3.94e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 665 AFSAARTSNLAPGTlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAvNVPLYVNLMKNEEVLVSAYAND 744
Cdd:pfam00386   1 AFSAGRTTGLTAPN-EQPVRFDKVLTNIGGHYDPATGKFTCPVPGVYYFSYHITTVD-GKSLYVSLVKNGQEVVSFYDQP 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1876941750 745 GAPDHETASNHAILQLFQGDQIWLRLH--RGAIYGSSWKYSTFSGYLL 790
Cdd:pfam00386  79 QKGSLDVASGSVVLELQRGDEVWLQLTgyNGLYYDGSDTDSTFSGFLL 126
C1Q smart00110
Complement component C1q domain; Globular domain found in many collagens and eponymously in ...
659-793 2.13e-32

Complement component C1q domain; Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.


Pssm-ID: 128420  Cd Length: 135  Bit Score: 122.03  E-value: 2.13e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  659 PQQMRVAFSAARTSNLAPGtlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVplYVNLMKNEEVLV 738
Cdd:smart00110   3 KAQPRSAFSVIRSNRPPPP--GQPIRFDKVLYNQQGHYDPRTGKFTCPVPGVYYFSYHVESKGRNV--KVSLMKNGIQVM 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1876941750  739 SAYANDGAPDHETASNHAILQLFQGDQIWLRLHR--GAIYGSSWKYSTFSGYLLYQD 793
Cdd:smart00110  79 STYDEYQKGLYDVASGGALLQLRQGDQVWLELPDekNGLYAGEYVDSTFSGFLLFPD 135
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
97-469 2.92e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  97 QEVSKPAVSLEQRKQDTSKLRSTLPEEQKKQEISKSKPSPSQwkqdTPKSKAGYVQEEQKKQETPKLWPVQLQKEQDPKK 176
Cdd:pfam03154 175 QAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQ----PPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQ 250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 177 QTPKSWTPSMQSEQNTTKSW------------------------TTPMCEEQDSKQPETPKSWENNVESQKHsltsQSQI 232
Cdd:pfam03154 251 PMTQPPPPSQVSPQPLPQPSlhgqmppmphslqtgpshmqhpvpPQPFPLTPQSSQSQVPPGPSPAAPGQSQ----QRIH 326
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 233 SPKSWGVATASLIPNDQLLPRKLNTEPKDVPKPvhqpvgsSSTLPKDPVLRKEKLQdlmTQIQGTCNFMQESVLDFD--- 309
Cdd:pfam03154 327 TPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPP-------TTPIPQLPNPQSHKHP---PHLSGPSPFQMNSNLPPPpal 396
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 310 KPSSAIPTSQPPSATPgSPVaskeqNLSSQSDFLQEPlqATSSPVTCSSNACLVTTDQASSGSETEFMTSETPEAAIP-- 387
Cdd:pfam03154 397 KPLSSLSTHHPPSAHP-PPL-----QLMPQSQQLPPP--PAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPfv 468
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 388 PGKQPSSLASPNPPMAKGSEQGFQSPPASSSSVTINTAPFQAMQTVFNVNAPLPPRKEQEIKESPYSPGYNQSFTTASTQ 467
Cdd:pfam03154 469 PGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVN 548

                  ..
gi 1876941750 468 TP 469
Cdd:pfam03154 549 TP 550
PHA03247 PHA03247
large tegument protein UL36; Provisional
314-475 1.43e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  314 AIPTSQPPSATPGSPVASkeqnLSSQSDFLQEPLQATSSPVTCSSNACLVTTDQASSGSETEFMTSETPEAAIPPGKQPS 393
Cdd:PHA03247  2773 AAPAAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  394 SLaSPNPPMAKGSEqgFQSPPASSSSVTINTAPfqAMQTVFNVNAPLPPRKEQEIKESPYSPgynQSFTTASTQTPPQCQ 473
Cdd:PHA03247  2849 SL-PLGGSVAPGGD--VRRRPPSRSPAAKPAAP--ARPPVRRLARPAVSRSTESFALPPDQP---ERPPQPQAPPPPQPQ 2920

                   ..
gi 1876941750  474 LP 475
Cdd:PHA03247  2921 PQ 2922
 
Name Accession Description Interval E-value
Caprin-1_C pfam12287
Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is ...
287-601 3.47e-163

Cytoplasmic activation/proliferation-associated protein-1 C term; This family of proteins is found in eukaryotes. Proteins in this family are typically between 343 and 708 amino acids in length. This family is the C terminal region of caprin-1. Caprin-1 is a protein involved in regulating cellular proliferation. In mutated phenotypes, the G1 phase of the cell cycle is greatly lengthened, impairing normal proliferation. The C terminal region of caprin-1 contains RGG motifs which are characteriztic of RNA binding domains. It is possible that caprin-1 functions through an RNA binding mechanism.


Pssm-ID: 463522 [Multi-domain]  Cd Length: 320  Bit Score: 475.44  E-value: 3.47e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 287 LQDLMTQIQGTCNFMQESVLDFDKPS-SAIPTSQPPSATP-----GSPVASKEQNLSSQSDFLQEPLQATSSPVTCSSNA 360
Cdd:pfam12287   1 LQDLMAQIQGTYNFMQDSMLDFDKPSdSAIVSAQPPSQSPdlsqmVCPPASPEQRLSQQSDVLQQPEQTQVSPVSPSSNA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 361 ClvttdqASSGSETEFMTSET--PEAAIPP---GKQPSSLASPNPPMAKGSE-QGFQSPPASSSSVTINTAPFQAMQTVF 434
Cdd:pfam12287  81 C------ASSGSEYQFHTSEPpqPEAIDPIqssMSLPSELAPPSPPLSPASQpQVFQSKPASSSGINVNAAPFQSMQTVF 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 435 NVNAPLPPRKEQEIKE-SPYSPGYNQSFTTASTQTPPQCQLPSIHVEQTVhsqeTANYHPDGTIQVSNGSLAFYPAQTNV 513
Cdd:pfam12287 155 NVNAPVPPRNEQELKEsSQYSSGYNQSFSSQSTQTVPQCQLPSEQLEQTV----VGAYHPDGTIQVSNGHLAFYPAQTNG 230
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 514 FPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGYK-GFDTYRG-LPSISNGNYSQLQFQAREYSGAPYSQRDNFQQCYK 591
Cdd:pfam12287 231 FPRPPQPFYNSRGSPRGGPRGGRGLMNGYRGPNGFKgGFDGYRGpFPNTPNGGYGQLQFQARDYSGTPYSQRDGYQQNYK 310
                         330
                  ....*....|
gi 1876941750 592 RGGTSGGPRA 601
Cdd:pfam12287 311 RGGTQSGPRA 320
C1q pfam00386
C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement ...
665-790 3.94e-41

C1q domain; C1q is a subunit of the C1 enzyme complex that activates the serum complement system.


Pssm-ID: 395310 [Multi-domain]  Cd Length: 126  Bit Score: 146.66  E-value: 3.94e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 665 AFSAARTSNLAPGTlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAvNVPLYVNLMKNEEVLVSAYAND 744
Cdd:pfam00386   1 AFSAGRTTGLTAPN-EQPVRFDKVLTNIGGHYDPATGKFTCPVPGVYYFSYHITTVD-GKSLYVSLVKNGQEVVSFYDQP 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 1876941750 745 GAPDHETASNHAILQLFQGDQIWLRLH--RGAIYGSSWKYSTFSGYLL 790
Cdd:pfam00386  79 QKGSLDVASGSVVLELQRGDEVWLQLTgyNGLYYDGSDTDSTFSGFLL 126
C1Q smart00110
Complement component C1q domain; Globular domain found in many collagens and eponymously in ...
659-793 2.13e-32

Complement component C1q domain; Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.


Pssm-ID: 128420  Cd Length: 135  Bit Score: 122.03  E-value: 2.13e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  659 PQQMRVAFSAARTSNLAPGtlDQPIVFDLLLNNLGETFDLQLGRFNCPVNGTYVFIFHMLKLAVNVplYVNLMKNEEVLV 738
Cdd:smart00110   3 KAQPRSAFSVIRSNRPPPP--GQPIRFDKVLYNQQGHYDPRTGKFTCPVPGVYYFSYHVESKGRNV--KVSLMKNGIQVM 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1876941750  739 SAYANDGAPDHETASNHAILQLFQGDQIWLRLHR--GAIYGSSWKYSTFSGYLLYQD 793
Cdd:smart00110  79 STYDEYQKGLYDVASGGALLQLRQGDQVWLELPDekNGLYAGEYVDSTFSGFLLFPD 135
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
238-625 1.24e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 48.76  E-value: 1.24e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 238 GVATASLIPNDQLLPRklNTEPKDVPKPVHQPVGSSSTLPKDPVLRKEKLQDlmtqiqgtcNFMQESVLDFDKPSSAIPT 317
Cdd:pfam05109 447 GLPSSTHVPTNLTAPA--STGPTVSTADVTSPTPAGTTSGASPVTPSPSPRD---------NGTESKAPDMTSPTSAVTT 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 318 SQPPSATPGSPVASKEQNLSSqsdflqePLQATSSPVTCssnaclVTTDQASSGSETEFMTSETPEAAIPP-GK-QPSSL 395
Cdd:pfam05109 516 PTPNATSPTPAVTTPTPNATS-------PTLGKTSPTSA------VTTPTPNATSPTPAVTTPTPNATIPTlGKtSPTSA 582
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 396 ASPNPPMAKGSEQGFQSPPA---------SSSSVTINTAPFQAMQTVFNVNAPLPPRKEQEIKESPYSPGYNQSFTTAST 466
Cdd:pfam05109 583 VTTPTPNATSPTVGETSPQAnttnhtlggTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDN 662
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 467 QTPPQCQLPSIHVE-----QTVHSQETANYHpdgtiqVSNGSLAFYPAQTNVFPRPTQPFVNSRGSVRGCTRGGRLI-TN 540
Cdd:pfam05109 663 STSHMPLLTSAHPTggeniTQVTPASTSTHH------VSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKnAT 736
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 541 SYRSPGGYKgfdtyRGLPSI-SNGNYSQLQFQAREYSGapYSQRDNFQQCYKRGGTSGGPRANSRAGWSDSSQVSSPERD 619
Cdd:pfam05109 737 SPQAPSGQK-----TAVPTVtSTGGKANSTTGGKHTTG--HGARTSTEPTTDYGGDSTTPRTRYNATTYLPPSTSSKLRP 809

                  ....*.
gi 1876941750 620 NETFNS 625
Cdd:pfam05109 810 RWTFTS 815
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
97-469 2.92e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 2.92e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  97 QEVSKPAVSLEQRKQDTSKLRSTLPEEQKKQEISKSKPSPSQwkqdTPKSKAGYVQEEQKKQETPKLWPVQLQKEQDPKK 176
Cdd:pfam03154 175 QAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQ----PPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQ 250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 177 QTPKSWTPSMQSEQNTTKSW------------------------TTPMCEEQDSKQPETPKSWENNVESQKHsltsQSQI 232
Cdd:pfam03154 251 PMTQPPPPSQVSPQPLPQPSlhgqmppmphslqtgpshmqhpvpPQPFPLTPQSSQSQVPPGPSPAAPGQSQ----QRIH 326
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 233 SPKSWGVATASLIPNDQLLPRKLNTEPKDVPKPvhqpvgsSSTLPKDPVLRKEKLQdlmTQIQGTCNFMQESVLDFD--- 309
Cdd:pfam03154 327 TPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPP-------TTPIPQLPNPQSHKHP---PHLSGPSPFQMNSNLPPPpal 396
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 310 KPSSAIPTSQPPSATPgSPVaskeqNLSSQSDFLQEPlqATSSPVTCSSNACLVTTDQASSGSETEFMTSETPEAAIP-- 387
Cdd:pfam03154 397 KPLSSLSTHHPPSAHP-PPL-----QLMPQSQQLPPP--PAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPfv 468
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750 388 PGKQPSSLASPNPPMAKGSEQGFQSPPASSSSVTINTAPFQAMQTVFNVNAPLPPRKEQEIKESPYSPGYNQSFTTASTQ 467
Cdd:pfam03154 469 PGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAVSCPLPPVQIKEEALDEAEEPESPPPPPRSPSPEPTVVN 548

                  ..
gi 1876941750 468 TP 469
Cdd:pfam03154 549 TP 550
PHA03247 PHA03247
large tegument protein UL36; Provisional
314-475 1.43e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 1.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  314 AIPTSQPPSATPGSPVASkeqnLSSQSDFLQEPLQATSSPVTCSSNACLVTTDQASSGSETEFMTSETPEAAIPPGKQPS 393
Cdd:PHA03247  2773 AAPAAGPPRRLTRPAVAS----LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPP 2848
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  394 SLaSPNPPMAKGSEqgFQSPPASSSSVTINTAPfqAMQTVFNVNAPLPPRKEQEIKESPYSPgynQSFTTASTQTPPQCQ 473
Cdd:PHA03247  2849 SL-PLGGSVAPGGD--VRRRPPSRSPAAKPAAP--ARPPVRRLARPAVSRSTESFALPPDQP---ERPPQPQAPPPPQPQ 2920

                   ..
gi 1876941750  474 LP 475
Cdd:PHA03247  2921 PQ 2922
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
314-678 1.49e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.08  E-value: 1.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  314 AIPTSQPPSATPGSPVAskeqnlssqsdflQEPLQATSSPVTCSSNACLVTT-DQASSGSETEFMTSETPEAAIPPGKQP 392
Cdd:PHA03307    61 ACDRFEPPTGPPPGPGT-------------EAPANESRSTPTWSLSTLAPASpAREGSPTPPGPSSPDPPPPTPPPASPP 127
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  393 SSLASPNPPMAKGSEQGFQSPPASSSSvtintapfqamqtvfnvnAPLPPRKEQEIKESPYSPGYNQSFTTASTQTPPQC 472
Cdd:PHA03307   128 PSPAPDLSEMLRPVGSPGPPPAASPPA------------------AGASPAAVASDAASSRQAALPLSSPEETARAPSSP 189
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  473 QlPSIHVEQTVHSQETANYHPDGTIQVSNGSLAFYPAQTNVFPRPTQPFVNSRGSVRGCTRGGRLITNSYRSPGGYKGFD 552
Cdd:PHA03307   190 P-AEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTR 268
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1876941750  553 TYRGLPSISNGNYSQLQFQA---REYSGAPYSQRDnfqqcyKRGGTSGGPRANSRAGWSDSSQVSSPERDNETFNSGDSG 629
Cdd:PHA03307   269 IWEASGWNGPSSRPGPASSSsspRERSPSPSPSSP------GSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVS 342
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*....
gi 1876941750  630 QGDSRSMTPVDVPVTNPAATilpvhvyPLPQQMRVAFSAARTSNLAPGT 678
Cdd:PHA03307   343 PGPSPSRSPSPSRPPPPADP-------SSPRKRPRPSRAPSSPAASAGR 384
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH